8000 GitHub - gao370829/TF2-RL: Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC]
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
forked from anita-hu/TF2-RL

Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC]

Notifications You must be signed in to change notification settings

gao370829/TF2-RL

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Reinforcement Learning Agents

Implemented for Tensorflow 2.0+

New Updates!

  • All agents have tensorboard logs during training!!
  • SAC
  • DDPG OU Noise

Future Plans

  • SAC Discrete

Usage

  • Install dependancies imported (my tf2 conda env as reference)
  • Each file contains example code that runs training on CartPole env
  • Training: python3 TF2_DDPG_LSTM.py
  • Tensorboard: tensorboard --logdir=DDPG/logs

Hyperparameter tuning

Agents

All agents tested using CartPole env

Name On/off policy Model Action space support Exploration method
DQN off-policy Dense, LSTM discrete e-greedy
DDPG off-policy Dense, LSTM discrete, continuous OU or Gaussian noise
AE-DDPG off-policy Dense discrete, continuous Random walk noise
SAC off-policy Dense continuous Maximum entropy

Models

Models used to generate the demos are included in the repo, you can also find q value and reward graphs

Demos

DQN Basic, time step = 4, 500 reward DQN LSTM, time step = 4, 500 reward
DDPG Basic, 222 reward DDPG LSTM, time step = 5, 500 reward
AE-DDPG Basic, 500 reward

About

Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC]

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%
0