8000 GitHub - yuanzihao/Easy21
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

yuanzihao/Easy21

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Easy21

python version: 3.10.13 torch version: 2.1.1+cu121


This is the Easy21 Assignment for David Silver's 2015 COMPM050/COMPGI13, Reinforcement Learning, which is a REALLY REALLY EXCELLENT course.
If you encounter difficulties while trying to complete its assignments, you can refer to my implementation.


非常推荐David Silver的这个课程,有许多在十年后的今天看来仍不落后的观点和讲解, 如果你想要实现大作业Easy21却觉得无从下手,或是觉得别人的代码缺少解释,可以参考我的实现:D

File construct:
├─policy_gradient
├─tabular_method
└─utils

在policy_gradient中,有一个简单的linear_approximation实现,与一个REINFORCE算法的实现 在tabular_method中,包含了原始的Easy21所要求的实现,以及一个百万次迭代后的Q值数组


Citation:

  1. Hands-on-RL
  2. Advanced Topics 2015 (COMPM050/COMPGI13)
  3. Easy21 by hereismari

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

0