MO-Ant#
Action Space |
Box(-1.0, 1.0, (8,), float32) |
Observation Shape |
(27,) |
Observation High |
inf |
Observation Low |
-inf |
Reward Shape |
(3,) |
Reward High |
[inf inf inf] |
Reward Low |
[-inf -inf -inf] |
Import |
|
Description#
Multi-objective version of the AntEnv environment.
See Gymnasium’s env for more information.
Reward Space#
The reward is 2- or 3-dimensional:
0: x-velocity
1: y-velocity
2: Control cost of the action If the cost_objective flag is set to False, the reward is 2-dimensional, and the cost is added to other objectives. A healthy reward is added to all objectives.