Stars
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
PyTorch implementation of TabNet paper : https://arxiv.org/pdf/1908.07442.pdf
Lviv Data Science Summer School 2019 lecture on Automated Machine Learning