Easy21

python version: 3.10.13 torch version: 2.1.1+cu121

This is the Easy21 Assignment for David Silver's 2015 COMPM050/COMPGI13, Reinforcement Learning, which is a REALLY REALLY EXCELLENT course.
If you encounter difficulties while trying to complete its assignments, you can refer to my implementation.

非常推荐David Silver的这个课程，有许多在十年后的今天看来仍不落后的观点和讲解，如果你想要实现大作业Easy21却觉得无从下手，或是觉得别人的代码缺少解释，可以参考我的实现:D

File construct:
├─policy_gradient
├─tabular_method
└─utils

在policy_gradient中，有一个简单的linear_approximation实现，与一个REINFORCE算法的实现在tabular_method中，包含了原始的Easy21所要求的实现，以及一个百万次迭代后的Q值数组

Citation:

Hands-on-RL

Advanced Topics 2015 (COMPM050/COMPGI13)

Easy21 by hereismari

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
policy_gradient		policy_gradient
tabular_method		tabular_method
utils		utils
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Easy21

About

Uh oh!

Releases

Packages

Languages

yuanzihao/Easy21

Folders and files

Latest commit

History

Repository files navigation

Easy21

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages