8000 bsdcfp / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View bsdcfp's full-sized avatar

Block or report bsdcfp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,418 421 Updated May 20, 2025

VGDFR: Diffuison-based Video Generation with Dynamic Frame Rate

Python 10 Updated May 16, 2025

Neighborhood Attention Extension. Bringing attention to a neighborhood near you!

Cuda 497 41 Updated Mar 18, 2025

Lets make video diffusion practical!

Python 13,330 1,142 Updated May 4, 2025

Efficient Triton Kernels for LLM Training

Python 5,034 324 Updated May 19, 2025

End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).

Python 351 12 Updated Feb 19, 2025

https://wavespeed.ai/ [WIP] The all in one inference optimization solution for ComfyUI, universal, flexible, and fast.

Python 1,018 42 Updated Mar 27, 2025

https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Python 1,254 80 Updated Mar 27, 2025

Helpful tools and examples for working with flex-attention

Python 791 45 Updated May 5, 2025

SD.Next: All-in-one WebUI for AI generative image and video creation

Python 6,299 483 Updated May 18, 2025

NVIDIA curated collection of educational resources related to general purpose GPU programming.

Jupyter Notebook 448 81 Updated May 13, 2025

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,400 165 Updated May 19, 2025

[CVPR 2025] DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention

Python 163 7 Updated Mar 1, 2025

Official PyTorch Implementation of "Optimal Stepsize for Diffusion Sampling".

Python 160 10 Updated Apr 13, 2025
Jupyter Notebook 166 9 Updated Jan 14, 2025

https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching

Python 277 27 Updated May 13, 2025

⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)

Python 588 39 Updated Mar 11, 2025

[TMLR 2025] Efficient Diffusion Models: A Survey

59 3 Updated Apr 29, 2025

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 9,490 672 Updated May 14, 2025

Accelerate inference in Flux and Sana for ComfyUI.

Python 198 4 Updated Mar 13, 2025

[ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Python 220 10 Updated Dec 27, 2024

Official repo for CFG-Zero*

Python 552 20 Updated May 2, 2025

Wan 2.1 for the GPU Poor

Python 861 98 Updated May 18, 2025

Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"

Python 1,477 115 Updated Apr 14, 2025
Python 2,084 197 Updated Apr 28, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 11,404 1,098 Updated May 14, 2025

Enjoy the magic of Diffusion models!

Python 8,651 775 Updated May 19, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 10,050 877 Updated Apr 27, 2025

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Python 793 30 Updated Apr 18, 2025
Next
0