-
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedJun 6, 10000 2025 -
verl Public
Forked from volcengine/verlveRL: Volcano Engine Reinforcement Learning for LLM
Python Apache License 2.0 UpdatedMay 15, 2025 -
-
flash-linear-attention Public
Forked from fla-org/flash-linear-attention🚀 Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Python MIT License UpdatedFeb 4, 2025 -
flame Public
Forked from fla-org/flame🔥 A minimal training framework for scaling FLA models
Python MIT License UpdatedFeb 4, 2025 -
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
-
torchtitan Public
Forked from pytorch/torchtitanA PyTorch native library for large model training
Python BSD 3-Clause "New" or "Revised" License UpdatedJan 2, 2025 -
hands-on-pmpp Public
A collection of my solutions and code implementations for exercises from 'Programming Massively Parallel Processors' (PMPP)
Cuda UpdatedDec 9, 2024 -
hqq Public
Forked from mobiusml/hqqOfficial implementation of Half-Quadratic Quantization (HQQ)
Python Apache License 2.0 UpdatedNov 21, 2024 -
-
-
tennis-interview-reader Public
Search for and read summaries of player interviews instead of watching full videos
Python UpdatedSep 16, 2024 -
-
causal-conv1d Public
Forked from Dao-AILab/causal-conv1dCausal depthwise conv1d in CUDA, with a PyTorch interface
Cuda BSD 3-Clause "New" or "Revised" License UpdatedAug 12, 2024 -
-
grouped_gemm Public
Forked from fanshiqing/grouped_gemmPyTorch bindings for CUTLASS grouped GEMM.
Cuda Apache License 2.0 UpdatedJun 3, 2024 -
chat-terminal Public
chat-terminal is a CLI tool designed to simplify the process of using repetitive prompts with ChatGPT.
Rust UpdatedMar 26, 2024 -
Triton-Puzzles Public
Forked from srush/Triton-PuzzlesPuzzles for learning Triton
Jupyter Notebook Apache License 2.0 UpdatedMar 26, 2024 -
-
self-rewarding-lm-pytorch Public
Forked from lucidrains/self-rewarding-lm-pytorchImplementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
Python MIT License UpdatedJan 26, 2024 -
nanoGPT Public
Forked from karpathy/nanoGPTThe simplest, fastest repository for training/finetuning medium-sized GPTs.
Python MIT License UpdatedJan 14, 2024 -
A series of large language models trained from scratch by developers @01-ai
Python Apache License 2.0 UpdatedJan 11, 2024 -
OpenRLHF Public
Forked from OpenRLHF/OpenRLHFA Ray-based High-performance RLHF framework (for 7B on RTX4090 and 34B on A100)
Python Apache License 2.0 UpdatedNov 18, 2023 -
lox Public
Another implementation of the Lox language from the book Crafting Interpreters
-
salsa Public
Forked from salsa-rs/salsaA generic framework for on-demand, incrementalized computation. Inspired by adapton, glimmer, and rustc's query system.
Rust Apache License 2.0 UpdatedNov 10, 2023 -
DeepSpeedExamples Public
Forked from deepspeedai/DeepSpeedExamplesExample models using DeepSpeed
Python Apache License 2.0 UpdatedNov 7, 2023 -
-
-
dada Public
Forked from dada-lang/dadaI speak only of myself since I do not wish to convince, I have no right to drag others into my river, I oblige no one to follow me and everybody practices his art in his own way.
JavaScript Apache License 2.0 UpdatedSep 11, 2023