plot#
Functions#
Creates a plot of the percentage of optimal actions over the training episodes.
Plots the mean and variance of the bandit probabilities.
Plots the rewards received by the agent(s) playing the bandits game.
- prt_sim.jhu.plot.plot_bandit_percent_optimal_action(optimal_bandits: ndarray, actions: ndarray) None[source]#
Creates a plot of the percentage of optimal actions over the training episodes.
- Parameters:
optimal_bandits (np.ndarray) – array of optimal bandit indexes
actions (np.ndarray) – actions chosen by the agent(s) with shape (# agents, # episodes)
- prt_sim.jhu.plot.plot_bandit_probabilities(env: KArmBandits) None[source]#
Plots the mean and variance of the bandit probabilities.
- Parameters:
env (KArmBandits) – bandits environment