8000 nil0x9 (Tianyang Lin) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View nil0x9's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report nil0x9

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Distributed Training Over-The-Internet

946 41 Updated May 15, 2025

DeMo: Decoupled Momentum Optimization

Python 189 9 Updated Dec 2, 2024

PyTorch native quantization and sparsity for training and inference

Python 2,156 294 Updated Jul 8, 2025

Triton-based implementation of Sparse Mixture of Experts.

Python 225 18 Updated Nov 28, 2024

A Quirky Assortment of CuTe Kernels

Python 126 5 Updated Jul 4, 2025
Python 88 4 Updated May 22, 2025

Go ahead and axolotl questions

Python 9,842 1,064 Updated Jul 8, 2025

kernels, of the mega variety

Python 432 22 Updated Jun 2, 2025

GPU operators for sparse tensor operations

Python 33 1 Updated Mar 11, 2024
Python 330 41 Updated Apr 2, 2024
Python 136 8 Updated Feb 15, 2025
Python 138 15 Updated Jul 21, 2024

Accelerating Diffusion Transformers with Token-wise Feature Caching

Python 162 7 Updated Mar 14, 2025

terashuf shuffles multi-terabyte text files using limited memory

C++ 222 15 Updated Feb 5, 2023

Cramming the training of a (BERT-type) language model into limited compute.

Python 1,338 101 Updated Jun 13, 2024

Minimal pretraining script for language modeling in PyTorch. Supporting torch compilation and DDP. It includes a model implementation and a data preprocessing.

Python 27 6 Updated Jul 3, 2025

DataComp for Language Models

HTML 1,323 122 Updated Mar 19, 2025

A framework for few-shot evaluation of language models.

Python 9,474 2,519 Updated Jul 7, 2025

Measuring Massive Multitask Language Understanding | ICLR 2021

Python 1,447 105 Updated May 28, 2023

🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× vs cuBLAS

Cuda 74 2 Updated Jun 26, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,641 263 Updated Jun 18, 2025

Official inference repo for FLUX.1 models

Python 23,309 1,676 Updated Jul 1, 2025

Python logging made (stupidly) simple

Python 22,117 741 Updated Jul 5, 2025
Python 42 5 Updated Jul 2, 2025
Python 33 1 Updated Mar 12, 2025

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,574 159 Updated Oct 28, 2024

Implementations of attention with the softpick function, naive and FlashAttention-2

Python 80 5 Updated Apr 30, 2025

Clipboard extension app for macOS.

Swift 270 19 Updated Dec 11, 2021

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 141 11 Updated Jun 27, 2025
Next
0