More
10000
Lists (16)
Sort Name ascending (A-Z)
Stars
A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training
MAGI-1: Autoregressive Video Generation at Scale
An Open-source RL System from ByteDance Seed and Tsinghua AIR
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
[ICLR 2025 Oral] The official implementation of "Diffusion-Based Planning for Autonomous Driving with Flexible Guidance"
verl: Volcano Engine Reinforcement Learning for LLMs
Large Concept Models: Language modeling in a sentence representation space
Efficient Triton Kernels for LLM Training
The official repository of the Omni-MATH benchmark.
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Solutions of Reinforcement Learning, An Introduction
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
An educational resource to help anyone learn deep reinforcement learning.
Python Implementation of Reinforcement Learning: An Introduction
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
DSIR large-scale data selection framework for language model training
Library for fast text representation and classification.
OLMoE: Open Mixture-of-Experts Language Models
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models
[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
Measuring Massive Multitask Language Understanding | ICLR 2021