Minimal pretraining script for language modeling in PyTorch. Supporting torch compilation and DDP. It includes a model implementation and a data preprocessing.

Python 27 6 Updated Jul 3, 2025

mlfoundations / dclm

DataComp for Language Models

HTML 1,323 122 Updated Mar 19, 2025

HanGuo97 / log-linear-attention

Python 222 13 Updated Jun 6, 2025

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 9,474 2,519 Updated Jul 7, 2025

hendrycks / test

Measuring Massive Multitask Language Understanding | ICLR 2021

Python 1,447 105 Updated May 28, 2023

sandyresearch / chipmunk

🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× vs cuBLAS

Cuda 74 2 Updated Jun 26, 2025

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,641 263 Updated Jun 18, 2025

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 23,309 1,676 Updated Jul 1, 2025

Delgan / loguru

Python logging made (stupidly) simple

Python 22,117 741 Updated Jul 5, 2025

tile-ai / TileOPs

Python 42 5 Updated Jul 2, 2025

TianjinYellow / SPAM-Optimizer

Python 33 1 Updated Mar 12, 2025

jiaweizzhao / GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,574 159 Updated Oct 28, 2024

zaydzuhri / softpick-attention

Implementations of attention with the softpick function, naive and FlashAttention-2

Python 80 5 Updated Apr 30, 2025

ian4hu / Clipy

Forked from Clipy/Clipy

Clipboard extension app for macOS.

Swift 270 19 Updated Dec 11, 2021

EleutherAI / nanoGPT-mup

Forked from karpathy/nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 141 11 Updated Jun 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tianyang Lin nil0x9

Achievements

Achievements

Block or report nil0x9

Stars

NousResearch / DisTrO

bloc97 / DeMo

pytorch / ao

shawntan / scattermoe

Dao-AILab / quack

horseee / dKV-Cache

axolotl-ai-cloud / axolotl

HazyResearch / Megakernels

Raincleared-Song / sparse_gpu_operator

FMInference / DejaVu

FasterDecoding / TEAL

thunlp / MoEfication

Shenyi-Z / ToCa

alexandres / terashuf

JonasGeiping / cramming

Niccolo-Ajroldi / plainLM