Reinforcement Learning Agents

Implemented for Tensorflow 2.0+

New Updates!

All agents have tensorboard logs during training!!
SAC
DDPG OU Noise

Future Plans

SAC Discrete

Usage

Install dependancies imported (my tf2 conda env as reference)
Each file contains example code that runs training on CartPole env
Training: python3 TF2_DDPG_LSTM.py
Tensorboard: tensorboard --logdir=DDPG/logs

Hyperparameter tuning

Install hyperopt https://github.com/hyperopt/hyperopt
Optional: switch agent used and configure param space in hyperparam_tune.py
Run: python3 hyperparam_tune.py

Agents

All agents tested using CartPole env

Name	On/off policy	Model	Action space support	Exploration method
DQN	off-policy	Dense, LSTM	discrete	e-greedy
DDPG	off-policy	Dense, LSTM	discrete, continuous	OU or Gaussian noise
AE-DDPG	off-policy	Dense	discrete, continuous	Random walk noise
SAC	off-policy	Dense	continuous	Maximum entropy

Models

Models used to generate the demos are included in the repo, you can also find q value and reward graphs

Demos

DQN Basic, time step = 4, 500 reward	DQN LSTM, time step = 4, 500 reward

DDPG Basic, 222 reward	DDPG LSTM, time step = 5, 500 reward

AE-DDPG Basic, 500 reward

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Reinforcement Learning Agents

New Updates!

Future Plans

Usage

Hyperparameter tuning

Agents

Models

Demos

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
AE-DDPG		AE-DDPG
DDPG		DDPG
DQN		DQN
SAC		SAC
.gitignore		.gitignore
README.md		README.md
hyperparam_tune.py		hyperparam_tune.py
mytf2env.txt		mytf2env.txt

gao370829/TF2-RL

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning Agents

New Updates!

Future Plans

Usage

Hyperparameter tuning

Agents

Models

Demos

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages