MO-Mountaincar#

../../_images/mo-mountaincar.gif

Action Space

Discrete(3)

Observation Shape

(2,)

Observation High

[0.6 0.07]

Observation Low

[-1.2 -0.07]

Reward Shape

(3,)

Reward High

[-1. 0. 0.]

Reward Low

[-1. -1. -1.]

Import

mo_gymnasium.make("mo-mountaincar-v0")

A multi-objective version of the MountainCar environment, where the goal is to reach the top of the mountain.

See Gymnasium’s env for more information.

Reward space:#

The reward space is a 3D vector containing the time penalty, and penalties for reversing and going forward.

  • time penalty: -1.0 for each time step

  • reverse penalty: -1.0 for each time step the action is 0 (reverse)

  • forward penalty: -1.0 for each time step the action is 2 (forward)