Lists (1)
Sort Name ascending (A-Z)
Stars
Code accompanying the paper "Generalized Interpolating Discrete Diffusion"
Minimalistic large language model 3D-parallelism training
TransMLA: Multi-Head Latent Attention Is All You Need
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Efficient Triton Kernels for LLM Training
PyTorch building blocks for the OLMo ecosystem
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
Democratizing Reinforcement Learning for LLMs
wolfecameron / nanoMoE
Forked from karpathy/nanoGPTAn extension of the nanoGPT repository for training small MOE models.
Stanford Drone Dataset with non-convex Constraints
Sum-of-squares Non-monotonic Probabilistic Circuits
A computer algebra system written in pure Python
Code for "TabZilla: When Do Neural Nets Outperform Boosted Trees on Tabular Data?"
Official implementation of E(n)-equivariant Graph Neural Cellular Automata
A New Modeling Framework for Continuous, Sequential Domains
Code release for Hoogeboom, Emiel, Jorn WT Peters, Rianne van den Berg, and Max Welling. "Integer Discrete Flows and Lossless Compression." Conference on Neural Information Processing Systems (2019).
Squared Non-monotonic Probabilistic Circuits
Vector (and Scalar) Quantization, in Pytorch
Tabular Deep Learning Library for PyTorch
a python framework to build, learn and reason about probabilistic circuits and tensor networks
Pytorch implementation of Block Neural Autoregressive Flow