Yifei-Zuo

Yifei Zuo Yifei-Zuo

50 followers · 119 following

Northwestern University
Evanston, IL
https://yifei-zuo.github.io/
in/yifei-zuo-5a6138235

Achievements

Highlights

Organizations

Starred repositories

pytorch / torchtune

PyTorch native post-training library

Python 5,171 599 Updated May 11, 2025

vectozavr / llm-hessian

Using PyTorch autograd to compute Hessian of Perplexity for Large Language Models

Python 21 1 Updated Apr 17, 2025

saprmarks / dictionary_learning

Python 288 68 Updated Feb 12, 2025

pytorch-labs / LeanRL

LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.

Python 566 25 Updated Oct 26, 2024

Dongxiaojie996 / xiyingdong_phd_thesis

董袭莹的博士论文

181 20 Updated Apr 30, 2025

wolfecameron / nanoMoE

Forked from karpathy/nanoGPT

An extension of the nanoGPT repository for training small MOE models.

Python 140 16 Updated Mar 9, 2025

ekinakyurek / marc

Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"

Python 307 28 Updated Nov 19, 2024

kyleliang919 / Super_Muon

Python 54 4 Updated Mar 21, 2025

huggingface / Math-Verify

Python 685 28 Updated Apr 28, 2025

MoonshotAI / Kimina-Prover-Preview

Technical report of Kimina-Prover Preview.

278 8 Updated May 10, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 28,922 5,945 Updated May 11, 2025

cvxpy / cvxpy

A Python-embedded modeling language for convex optimization problems.

C++ 5,746 1,101 Updated May 11, 2025

test-time-training / ttt-tk

Cuda 28 1 Updated Apr 7, 2025

pytorch / torchtitan

A PyTorch native library for large-scale model training

Python 3,678 356 Updated May 10, 2025

google / learned_optimization

Python 776 65 Updated Apr 24, 2025

ByteDance-Seed / Triton-distributed

Distributed Triton for Parallel Systems

MLIR 677 41 Updated May 2, 2025

hyperopt / hyperopt

Distributed Asynchronous Hyperparameter Optimization in Python

Python 7,406 1,064 Updated Feb 4, 2025

shtoshni / learning-chess-blindfolded

AAAI 2022 Paper: Bet even Beth Harmon couldn't learn chess like that :)

Jupyter Notebook 38 7 Updated Mar 3, 2021

NX-AI / mlstm_kernels

Tiled Flash Linear Attention library for fast and efficient mLSTM Kernels.

Jupyter Notebook 56 2 Updated May 11, 2025

lixilinx / psgd_torch

Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation preconditioner and more)

Python 174 12 Updated May 10, 2025

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 3,975 352 Updated May 11, 2025

buoyancy99 / diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 852 40 Updated Apr 1, 2025

Zymrael / savanna

Pretraining infrastructure for multi-hybrid AI model architectures

Python 155 13 Updated May 7, 2025

kyleliang919 / Online-Subspace-Descent

This repo is based on https://github.com/jiaweizzhao/GaLore

Python 27 Updated Sep 18, 2024

lmgame-org / GamingAgent

Computer gaming agents that run on your PC and laptops.

Python 589 62 Updated May 10, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,314 585 Updated May 9, 2025

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 1,110 84 Updated May 11, 2025

pytorch / tensordict

TensorDict is a pytorch dedicated tensor container.

Python 924 89 Updated May 9, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 7,616 753 Updated May 8, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

Cuda 11,531 831 Updated Apr 29, 2025

Yifei Zuo Yifei-Zuo

Highlights

Organizations

Starred repositories

Machine learning

C++