Stars
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
Awesome-LLM: a curated list of Large Language Model
Awesome RL Reasoning Recipes ("Triple R")
Awesome Reasoning LLM Tutorial/Survey/Guide
Minimal reproduction of DeepSeek R1-Zero
[IJCV] Official repository for "Image Captions are Natural Prompts for Text-to-Image Models"
[Preprint] Official repository for "Short-length Adversarial Training Helps LLMs Defend Long-length Jailbreak Attacks: Theoretical and Empirical Evidence"
Automaticly generate your styled QR code in your web app.
[NeurIPS 2024] Official repository for "Offline Behavior Distillation"
Pytorch implementations of density estimation algorithms: BNAF, Glow, MAF, RealNVP, planar flows
PyTorch implementations of algorithms for density estimation
[TNNLS] Official repository for "Attentive Learning Facilitates Generalization of Neural Networks"
A curated list of reinforcement learning with human feedback resources (continually updated)
Robust recipes to align language models with human and AI preferences
RewardBench: the first evaluation tool for reward models.
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Isaac Gym Reinforcement Learning Environments
Python Implementation of Reinforcement Learning: An Introduction
Repo for offline reinforcement learning methods
An educational resource to help anyone learn deep reinforcement learning.
An index of algorithms for offline reinforcement learning (offline-rl)
Public recipe files for Apptainer containers used on CSC HPC environments
An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.
ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)
SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection