- Beijing
-
00:37
(UTC +08:00) - https://scholar.google.com/citations?hl=zh-CN&user=MBR97ZIAAAAJ
Stars
Code for paper: [ICLR2025 Oral] FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference
Radial Attention Official Implementation
Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.
[Arxiv] Discrete Diffusion in Large Language and Multimodal Models: A Survey
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
NoakLiu / FastCache-xDiT
Forked from xdit-project/xDiTFastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation [Efficient ML Model]
Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"
A Collection of Papers on Diffusion Language Models
Official PyTorch implementation for "Large Language Diffusion Models"
Train your Agent model via our easy and efficient framework
[ICML2025] Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
This is a repo to track the latest autoregressive visual generation papers.
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
A sparse attention kernel supporting mix sparse patterns
[ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization
[ICLR'25] ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation
📄 Awesome CV is LaTeX template for your outstanding job application
SpargeAttention: A training-free sparse attention that can accelerate any model inference.
MAGI-1: Autoregressive Video Generation at Scale
A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training
Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving