Stars
Wan: Open and Advanced Large-Scale Video Generative Models
FlashMLA: Efficient MLA decoding kernels
MoBA: Mixture of Block Attention for Long-Context LLMs
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Curated list of datasets and tools for post-training.
Democratizing Reinforcement Learning for LLMs
verl: Volcano Engine Reinforcement Learning for LLMs
🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts
Machine Learning Engineering Open Book
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
🧑🚀 全世界最好的LLM资料总结(视频生成、Agent、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)
This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.
A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
A generative speech model for daily dialogue.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
4CCA lencx / Noi
🚀 Power Your World with AI - Explore, Extend, Empower.
A latent text-to-image diffusion model
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.