8000 HandH1998 (HandH1998) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View HandH1998's full-sized avatar

Block or report HandH1998

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

6,529 899 Updated Jul 7, 2025

Code for paper: [ICLR2025 Oral] FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference

Python 117 7 Updated May 19, 2025

Radial Attention Official Implementation

Python 279 11 Updated Jul 6, 2025

Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

Python 157 9 Updated Jul 3, 2025

A Quirky Assortment of CuTe Kernels

Python 126 5 Updated Jul 4, 2025

[Arxiv] Discrete Diffusion in Large Language and Multimodal Models: A Survey

Python 156 2 Updated Jul 7, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,790 208 Updated Jun 20, 2025

FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation [Efficient ML Model]

Python 28 1 Updated May 30, 2025
Python 88 4 Updated May 22, 2025

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 272 15 Updated Jul 6, 2025

A Collection of Papers on Diffusion Language Models

84 Updated Jul 4, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 2,498 164 Updated Jun 17, 2025

kernels, of the mega variety

Python 429 22 Updated Jun 2, 2025

Train your Agent model via our easy and efficient framework

Python 1,248 111 Updated Jul 1, 2025
8000
Python 108 8 Updated Jun 6, 2025

[ICML2025] Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity

Python 359 15 Updated Jun 6, 2025

This is a repo to track the latest autoregressive visual generation papers.

364 6 Updated Jun 25, 2025

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 11,832 2,057 Updated Jun 19, 2025

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Python 2,295 118 Updated Jul 7, 2025

A sparse attention kernel supporting mix sparse patterns

C++ 245 12 Updated Feb 13, 2025

[ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization

Python 42 4 Updated Nov 27, 2024

[ICLR'25] ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation

Python 101 10 Updated Mar 21, 2025

📄 Awesome CV is LaTeX template for your outstanding job application

TeX 24,764 5,008 Updated Jun 25, 2025

SpargeAttention: A training-free sparse attention that can accelerate any model inference.

Cuda 639 46 Updated Jun 19, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,358 194 Updated Jun 17, 2025

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 405 23 Updated Jul 7, 2025

Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving

Python 202 21 Updated Jul 7, 2025
Python 60 3 Updated Apr 26, 2025
Next
0