MO-Mountaincar#
![../../_images/mo-mountaincar.gif](../../_images/mo-mountaincar.gif)
Action Space |
Discrete(3) |
Observation Shape |
(2,) |
Observation High |
[0.6 0.07] |
Observation Low |
[-1.2 -0.07] |
Reward Shape |
(3,) |
Reward High |
[-1. 0. 0.] |
Reward Low |
[-1. -1. -1.] |
Import |
|
A multi-objective version of the MountainCar environment, where the goal is to reach the top of the mountain.
See Gymnasium’s env for more information.
Reward space:#
The reward space is a 3D vector containing the time penalty, and penalties for reversing and going forward.
time penalty: -1.0 for each time step
reverse penalty: -1.0 for each time step the action is 0 (reverse)
forward penalty: -1.0 for each time step the action is 2 (forward)