gold_explorer

gold_explorer#

class prt_sim.jhu.gold_explorer.GoldExplorer(render_mode: str | None = 'rgb_array')[source]#

The Gold Explorer puzzle

Action space: integer representing a discrete action described in the table below

Observation space: integer between 0 and 127 representing the state as an octal number, <gold bit><row><column>

Reward: +15 for obtaining gold coins, +30 for obtaining the motherlode, -30 for entering a mine field, -1 for every other location

execute_action(action: int) → Tuple[int, float, bool][source]#

Executes an action for the explorer.

Returns:

get_number_of_actions() → int[source]#

Returns the number of discrete actions in the puzzle

get_number_of_states() → int[source]#

Returns the number of states in the puzzle

reset(seed: int | None = None, randomize_start: bool | None = False) → int[source]#

Resets the environment to the initial state.

Parameters:

seed (int, optional) – Random seed. Defaults to None.
randomize_start (bool, optional) – Whether to randomize the starting state. Not all environments will support this. Defaults to False.

Returns:

current state value

Return type:

int