More
More
-
-
D2L-Exercises Public
Forked from Chandan-h-509/D2L-ExercisesSolutions to the exercises in Dive into Deep Learning, in PyTorch
Jupyter Notebook UpdatedNov 27, 2021 -
This is the repo where Suraj and Eric tackle leetcode challenges following neetcode.io. We aim to solve one challenge with as many methods as possible
-
-
GPG Public
Forked from AMAP-ML/GPGGPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning
Python UpdatedApr 8, 2025 -
jaxued Public
Forked from DramaCow/jaxuedforked from jaxued
Python Apache License 2.0 UpdatedDec 28, 2024 -
Kinetix Public
Forked from FLAIROx/KinetixACCEL experiments
Python MIT License UpdatedMar 24, 2025 -
-
ML-Side-Project-Stock-Algo Public
This is the repo for my Master Thesis on stock trading Algo
Jupyter Notebook MIT License UpdatedApr 18, 2023 -
oat Public
Forked from sail-sg/oat🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.
Python Apache License 2.0 UpdatedMay 7, 2025 -
PRIMAL Public
Forked from gsartoretti/PRIMALPRIMAL: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Distributed RL/IL code for Multi-Agent Path Finding (MAPF)
Python MIT License UpdatedMar 27, 2023 -
recurrent-ppo-truncated-bptt Public
Forked from MarcoMeter/recurrent-ppo-truncated-bpttBaseline implementation of recurrent PPO using truncated BPTT
Python MIT License UpdatedApr 28, 2024 -
-
sampling-for-learnability Public
Forked from amacrutherford/sampling-for-learnabilityOfficial codebase for "Sampling For Learnability", published at NeurIPS 2024
Jupyter Notebook Apache License 2.0 UpdatedJan 7, 2025 -
Make HTS Forecasting paper implemented on a M5 dataset
-
TuoTuo Public
TuoTuo is a Topic Modeling library for Researchers and Engineers
-
ued_llm_jax Public
Forked from facebookresearch/minimaxEfficient baselines for autocurricula in JAX.
Python Apache License 2.0 UpdatedAug 31, 2024 -
understand-r1-zero Public
Forked from sail-sg/understand-r1-zeroUnderstanding R1-Zero-Like Training: A Critical Perspective
Python MIT License UpdatedApr 4, 2025 -
webgames Public
Forked from convergence-ai/webgamesChallenges for general-purpose web-browsing AI agents
TypeScript Other UpdatedFeb 7, 2025