qingquansong

Qingquan Song qingquansong

LLM & RecSys & AutoML @ LinkedIn

70 followers · 21 following

Achievements

x2 x2

Achievements

x2 x2

Stars

PrimeIntellect-ai / prime-rl

prime-rl is a codebase for decentralized async RL training at scale

Python 356 51 Updated Jul 5, 2025

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 2,484 164 Updated Jun 17, 2025

NVlabs / Fast-dLLM

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 270 15 Updated Jul 6, 2025

NickL77 / BaldEagle

3x Faster Inference; Unofficial implementation of EAGLE Speculative Decoding

Python 69 11 Updated Jul 3, 2025

TapXWorld / ChinaTextbook

所有小初高、大学PDF教材。

Roff 43,662 9,749 Updated May 18, 2025

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 48,432 5,329 Updated Jul 2, 2025

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 84,425 10,297 Updated Jun 26, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 14,459 2,015 Updated Jul 4, 2025

ModelTC / lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,366 267 Updated Jul 5, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,852 279 Updated May 15, 2025

fla-org / native-sparse-attention

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 712 33 Updated Mar 19, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,380 1,725 Updated Jul 5, 2025

thu-pacman / chitu

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 1,154 77 Updated Jul 5, 2025

bytedance / flux

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 998 67 Updated May 28, 2025

deepseek-ai / EPLB

Expert Parallelism Load Balancer

Python 1,227 195 Updated Mar 24, 2025

aws-samples / awsome-distributed-training

Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.

Shell 318 127 Updated Jul 4, 2025

LambdaLabsML / distributed-training-guide

Best practices & guides on how to write distributed pytorch training code

Python 444 37 Updated Feb 24, 2025

srush / Triton-Puzzles

Puzzles for learning Triton

Jupyter Notebook 1,741 138 Updated Nov 18, 2024

EugenHotaj / llm_parallelisms.c

LLM training parallelisms (DP, FSDP, TP, PP) in pure C

C 7 Updated Dec 27, 2024

yangshun / tech-interview-handbook

💯 Curated coding interview preparation materials for busy software engineers

TypeScript 127,337 15,513 Updated Jun 6, 2025

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 19,104 2,276 Updated Jul 3, 2025

databricks / megablocks

Python 1,378 200 Updated Jun 26, 2025

yuandong-tian / arXiv_recbot

A Telegram bot to recommend arXiv papers

Python 275 24 Updated Apr 12, 2025

phlippe / liger_kernels

JAX Implementation of Liger Kernels

Python 9 1 Updated Oct 31, 2024

mosaicml / llm-foundry

LLM training code for Databricks foundation models

Python 4,274 568 Updated Jun 30, 2025

mirage-project / mirage

Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA

C++ 1,518 91 Updated Jul 2, 2025

feder-cr / Jobs_Applier_AI_Agent_AIHawk

AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.

Python 28,405 4,288 Updated May 28, 2025

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 6,651 568 Updated Jul 4, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 15,744 2,271 Updated Jul 6, 2025

lyogavin / airllm

AirLLM 70B inference with single 4GB GPU

Jupyter Notebook 5,810 459 Updated May 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qingquan Song qingquansong

Achievements

Achievements

Block or report qingquansong

Stars

PrimeIntellect-ai / prime-rl

ML-GSAI / LLaDA

NVlabs / Fast-dLLM

NickL77 / BaldEagle

TapXWorld / ChinaTextbook

RVC-Boss / GPT-SoVITS

openai / whisper

huggingface / trl

ModelTC / lightllm

deepseek-ai / open-infra-index

fla-org / native-sparse-attention

volcengine / verl

thu-pacman / chitu

bytedance / flux

deepseek-ai / EPLB

aws-samples / awsome-distributed-training

LambdaLabsML / distributed-training-guide

srush / Triton-Puzzles

EugenHotaj / llm_parallelisms.c

yangshun / tech-interview-handbook

liguodongiot / llm-action

databricks / megablocks

yuandong-tian / arXiv_recbot

phlippe / liger_kernels

mosaicml / llm-foundry

mirage-project / mirage

feder-cr / Jobs_Applier_AI_Agent_AIHawk

InternLM / lmdeploy

sgl-project / sglang

lyogavin / airllm