Grid-World#

Environments with discrete observation spaces, e.g., grid-worlds.

Env

Obs/Action spaces

Objectives

Description

deep-sea-treasure-v0

Discrete / Discrete

[treasure, time_penalty]

Agent is a submarine that must collect a treasure while taking into account a time penalty. Treasures values taken from Yang et al. 2019.

deep-sea-treasure-concave-v0

Discrete / Discrete

[treasure, time_penalty]

Agent is a submarine that must collect a treasure while taking into account a time penalty. Treasures values taken from Vamplew et al. 2010.

deep-sea-treasure-mirrored-v0

Discrete / Discrete

[treasure, time_penalty]

Harder version of the concave DST Felten et al. 2022.

resource-gathering-v0

Discrete / Discrete

[enemy, gold, gem]

Agent must collect gold or gem. Enemies have a 10% chance of killing the agent. From Barret & Narayanan 2008.

fishwood-v0

Discrete / Discrete

[fish_amount, wood_amount]

ESR environment, the agent must collect fish and wood to light a fire and eat. From Roijers et al. 2018.

breakable-bottles-v0

Discrete (Dictionary) / Discrete

[time_penalty, bottles_delivered, potential]

Gridworld with 5 cells. The agents must collect bottles from the source location and deliver to the destination. From Vamplew et al. 2021.

fruit-tree-v0

Discrete / Discrete

[nutri1, ..., nutri6]

Full binary tree of depth d=5,6 or 7. Every leaf contains a fruit with a value for the nutrients Protein, Carbs, Fats, Vitamins, Minerals and Water. From Yang et al. 2019.

four-room-v0

Discrete / Discrete

[item1, item2, item3]

Agent must collect three different types of items in the map and reach the goal. From Alegre et al. 2022.