open_ai

Range of RL projects using OpenAI gym package

Autonomous Taxi

Goal: Have a taxi agent learn how to navigate a grid world to pick up and drop off a passenger using tabular q-learning

This notebook was created to teach coworkers about Q-learning

Lunar Lander

Goal: Teach a DQN agent to learn how to land a ship on a landing pad.

Experiments:

Memory size (memory): experiment_results/avg_reward_dqn_11_23_2021_13_04.png

Observations: Nearing 250 episodes, agents trained with both 100k and 1E6 experience replay buffer size observed better 52CC rolling average rewards. Nearly double 1E7 memory size.

Learning rate (alpha): experiment_results/avg_reward_dqn_11_25_2021_01_53.png

Observations: Agent learned in a more stable fashion at alpha=0.001. Maneuvers created by agent learning at 0.01 seemed much more risky (swinging wildly from side to side) opposed to the conservative upright landing style of the 0.001 agent.

Discount rate (gamma): experiment_results/avg_reward_dqn_11_24_2021_20_27.png

Observations:

Note: Best results would be observed if multiple experiments were performed and variance was calculated.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.idea		.idea
autonomous_taxi		autonomous_taxi
lunar_lander		lunar_lander
.DS_Store		.DS_Store
.gitattributes		.gitattributes
README.md		README.md
avg_reward_random.png		avg_reward_random.png
total_reward_dqn.png		total_reward_dqn.png
total_reward_random.png		total_reward_random.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

open_ai

Autonomous Taxi

Lunar Lander

Experiments:

About

Uh oh!

Releases

Packages

Languages

megforr/open_ai

Folders and files

Latest commit

History

Repository files navigation

open_ai

Autonomous Taxi

Lunar Lander

Experiments:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages