8000 qingquansong (Qingquan Song) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View qingquansong's full-sized avatar

Block or report qingquansong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

prime-rl is a codebase for decentralized async RL training at scale

Python 356 51 Updated Jul 5, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 2,484 164 Updated Jun 17, 2025

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 270 15 Updated Jul 6, 2025

3x Faster Inference; Unofficial implementation of EAGLE Speculative Decoding

Python 69 11 Updated Jul 3, 2025

所有小初高、大学PDF教材。

Roff 43,662 9,749 Updated May 18, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 48,432 5,329 Updated Jul 2, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 84,425 10,297 Updated Jun 26, 2025

Train transformer language models with reinforcement learning.

Python 14,459 2,015 Updated Jul 4, 2025

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,366 267 Updated Jul 5, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,852 279 Updated May 15, 2025

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 712 33 Updated Mar 19, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,380 1,725 Updated Jul 5, 2025

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 1,154 77 Updated Jul 5, 2025

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 998 67 Updated May 28, 2025

Expert Parallelism Load Balancer

Python 1,227 195 Updated Mar 24, 2025

Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.

Shell 318 127 Updated Jul 4, 2025

Best practices & guides on how to write distributed pytorch training code

Python 444 37 Updated Feb 24, 2025

Puzzles for learning Triton

Jupyter Notebook 1,741 138 Updated Nov 18, 2024

LLM training parallelisms (DP, FSDP, TP, PP) in pure C

C 7 Updated Dec 27, 2024

💯 Curated coding interview preparation materials for busy software engineers

TypeScript 127,337 15,513 Updated Jun 6, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 19,104 2,276 Updated Jul 3, 2025
Python 1,378 200 Updated Jun 26, 2025

A Telegram bot to recommend arXiv papers

Python 275 24 Updated Apr 12, 2025

JAX Implementation of Liger Kernels

Python 9 1 Updated Oct 31, 2024

LLM training code for Databricks foundation models

Python 4,274 568 Updated Jun 30, 2025

Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA

C++ 1,518 91 Updated Jul 2, 2025

AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.

Python 28,405 4,288 Updated May 28, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 6,651 568 Updated Jul 4, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 15,744 2,271 Updated Jul 6, 2025

AirLLM 70B inference with single 4GB GPU

Jupyter Notebook 5,810 459 Updated May 6, 2025
Next
0