MO-Humanoid#
Action Space |
Box(-0.4, 0.4, (17,), float32) |
Observation Shape |
(376,) |
Observation High |
inf |
Observation Low |
-inf |
Reward Shape |
(2,) |
Reward High |
[inf inf] |
Reward Low |
[-inf -inf] |
Import |
|
Description#
Multi-objective version of the HumanoidEnv environment.
See Gymnasium’s env for more information.
Reward Space#
The reward is 2-dimensional:
0: Reward for running forward (x-velocity)
1: Control cost of the action