8000 Yifei-Zuo (Yifei Zuo) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Yifei-Zuo's full-sized avatar

Highlights

  • Pro

Organizations

@uwsampl

Block or report Yifei-Zuo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

PyTorch native post-training library

Python 5,171 599 Updated May 11, 2025

Using PyTorch autograd to compute Hessian of Perplexity for Large Language Models

Python 21 1 Updated Apr 17, 2025

LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.

Python 566 25 Updated Oct 26, 2024

董袭莹的博士论文

181 20 Updated Apr 30, 2025

An extension of the nanoGPT repository for training small MOE models.

Python 140 16 Updated Mar 9, 2025

Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"

Python 307 28 Updated Nov 19, 2024
Python 54 4 Updated Mar 21, 2025
Python 685 28 Updated Apr 28, 2025

Technical report of Kimina-Prover Preview.

278 8 Updated May 10, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 28,922 5,945 Updated May 11, 2025

A Python-embedded modeling language for convex optimization problems.

C++ 5,746 1,101 Updated May 11, 2025

A PyTorch native library for large-scale model training

Python 3,678 356 Updated May 10, 2025

Distributed Triton for Parallel Systems

MLIR 677 41 Updated May 2, 2025

Distributed Asynchronous Hyperparameter Optimization in Python

Python 7,406 1,064 Updated Feb 4, 2025

AAAI 2022 Paper: Bet even Beth Harmon couldn't learn chess like that :)

Jupyter Notebook 38 7 Updated Mar 3, 2021

Tiled Flash Linear Attention library for fast and efficient mLSTM Kernels.

Jupyter Notebook 56 2 Updated May 11, 2025

Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation preconditioner and more)

Python 174 12 Updated May 10, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 3,975 352 Updated May 11, 2025

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 852 40 Updated Apr 1, 2025

Pretraining infrastructure for multi-hybrid AI model architectures

Python 155 13 Updated May 7, 2025

This repo is based on https://github.com/jiaweizzhao/GaLore

Python 27 Updated Sep 18, 2024

Computer gaming agents that run on your PC and laptops.

Python 589 62 Updated May 10, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,314 585 Updated May 9, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 1,110 84 Updated May 11, 2025

TensorDict is a pytorch dedicated tensor container.

Python 924 89 Updated May 9, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,616 753 Updated May 8, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,531 831 Updated Apr 29, 2025
Next
0