8000 yigithanyigit (Yiğithan Yiğit) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View yigithanyigit's full-sized avatar

Highlights

  • Pro

Block or report yigithanyigit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Making Flux go brrr on GPUs.

Python 67 5 Updated Jul 4, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 3,301 363 Updated Jul 3, 2025

Radial Attention Official Implementation

Python 259 10 Updated Jul 3, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,926 1,555 Updated Jul 4, 2025

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

C++ 341 61 Updated Jul 4, 2025

My annotated papers and meeting recordings for the EleutherAI ML Performance research paper reading group

Python 18 Updated May 18, 2025

A Quirky Assortment of CuTe Kernels

Python 126 4 Updated Jul 4, 2025

Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA

C++ 1,514 92 Updated Jul 2, 2025

[NeurlPS2024] One-Step Effective Diffusion Network for Real-World Image Super-Resolution

Python 454 29 Updated Apr 20, 2025

End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).

Python 365 13 Updated May 29, 2025

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 711 32 Updated May 17, 2025

FastVideo is a unified framework for accelerated video generation.

Python 1,577 107 Updated Jul 4, 2025

From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers

Python 216 14 Updated Jun 25, 2025

A pipeline parallel training script for diffusion models.

Python 1,220 166 Updated Jul 3, 2025

Pocket Flow: Codebase to Tutorial

Python 10,542 1,164 Updated May 23, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 17,318 1,424 Updated Jun 28, 2025

🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× vs cuBLAS

Cuda 74 2 Updated Jun 26, 2025

A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment…

Python 1,027 94 Updated Jun 30, 2025

Self-contained, minimalistic implementation of diffusion models with Pytorch.

Python 1,075 137 Updated Jun 28, 2022

The ultimate training toolkit for finetuning diffusion models

Python 5,030 595 Updated Jun 29, 2025

[NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising

Python 202 12 Updated Feb 22, 2025

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Python 902 45 Updated Jun 27, 2024

Official repo for CFG-Zero*

Python 619 21 Updated May 2, 2025

Accessible large language models via k-bit quantization for PyTorch.

Python 7,192 712 Updated Jul 2, 2025

OneDiff: An out-of-the-box acceleration library for diffusion models.

Jupyter Notebook 1,904 124 Updated May 8, 2025

Applied AI experiments and examples for PyTorch

Python 278 29 Updated May 29, 2025

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 130 40 Updated Jan 2, 2025

Combining Teacache with xDiT to Accelerate Visual Generation Models

Python 26 6 Updated Apr 21, 2025

XAttention: Block Sparse Attention with Antidiagonal Scoring

Python 170 10 Updated Jun 28, 2025

https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching

Python 319 30 Updated May 13, 2025
Next
0