Stars
Open-source Multi-agent Poster Generation from Papers
A Framework for LLM-based Multi-Agent Reinforced Training and Inference
A collection of papers on discrete diffusion models
[CVPR 2025] LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding
No fortress, purely open ground. OpenManus is Coming.
A fast communication-overlapping library for tensor/expert parallelism on GPUs.
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
Awesome Reasoning LLM Tutorial/Survey/Guide
🔥CVPR 2025 Multimodal Large Language Models Paper List
Explore the Multimodal “Aha Moment” on 2B Model
R1-onevision, a visual language model capable of deep CoT reasoning.
Muon: An optimizer for hidden layers in neural networks
Wan: Open and Advanced Large-Scale Video Generative Models
A library for advanced large language model reasoning
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
FlashMLA: Efficient MLA decoding kernels
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.
Solve Visual Understanding with Reinforced VLMs
[ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
Official Repo for Open-Reasoner-Zero
Train transformer language models with reinforcement learning.
Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
verl: Volcano Engine Reinforcement Learning for LLMs
Collect every awesome work about r1!