MO-Swimmer¶
Action Space |
Box(-1.0, 1.0, (2,), float32) |
Observation Shape |
(8,) |
Observation High |
inf |
Observation Low |
-inf |
Reward Shape |
(2,) |
Reward High |
[inf inf] |
Reward Low |
[-inf -inf] |
Import |
|
Description¶
Multi-objective version of the SwimmerEnv environment.
See Gymnasium’s env for more information.
The original Gymnasium’s ‘Swimmer-v5’ is recovered by the following linear scalarization:
env = mo_gym.make(‘mo-swimmer-v5’) LinearReward(env, weight=np.array([1.0, 1e-4]))
Reward Space¶
The reward is 2-dimensional:
0: Reward for moving forward (x-velocity)
1: Control cost of the action
Version History:¶
See https://gymnasium.farama.org/main/environments/mujoco/swimmer/#version-history