8000 GitHub - WskThomas/projetrl
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

WskThomas/projetrl

Repository files navigation

Project RL - AplaZero General

Implementation of different sets (nim, hex and pentago) based on the baseline you can find here. The provided baseline has been simplified (games such as tictactoe othello have been removed) in order to focus on the games that are required.

To use a game of your choice, subclass the classes in Game.py and NeuralNet.py and implement their functions. Example implementations for Othello can be found in hex/hexGame.py and hex/pytorch/NNet.py.

Coach.py contains the core training loop and MCTS.py performs the Monte Carlo Tree Search. The parameters for the self-play can be specified in main.py. Additional neural network parameters are in hex/pytorch/NNet.py (cuda flag, batch size, epochs, learning rate etc.).

To start training a model for hex:

python main.py

Choose your framework and game in main.py.

To play against a pretrained model:

python pit.py

Choose your framework and game in pit.py.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
0