Stars
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
SGLang is a fast serving framework for large language models and vision language models.
[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.
Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
Everything about the SmolLM2 and SmolVLM family of models
lightonai / mamba-amd
Forked from state-spaces/mambaPort of Mamba to run and run efficiently on AMD.
Fully open reproduction of DeepSeek-R1
DiffClass: Diffusion-Based Class Incremental Learning
[NeurIPS 2024] Exploring Token Pruning in Vision State Space Models
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
Open-Sora: Democratizing Efficient Video Production for All
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
HunyuanVideo: A Systematic Framework For Large Video Generation Model
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
Scalable toolkit for efficient model alignment
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Train transformer language models with reinforcement learning.
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
[NeurIPS2024] Fast and Memory-Efficient Video Diffusion Using Streamlined Inference