yigithanyigit

Yiğithan Yiğit yigithanyigit

-.- mostly computer systems GSoC '24 @FFmpeg & @Netflix

15 followers · 23 following

Achievements

Highlights

Lists (1)

Sort

🔮 Future ideas

3 repositories

Stars

huggingface / flux-fast

Making Flux go brrr on GPUs.

Python 67 5 Updated Jul 4, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 3,301 363 Updated Jul 3, 2025

mit-han-lab / radial-attention

Radial Attention Official Implementation

Python 259 10 Updated Jul 3, 2025

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,926 1,555 Updated Jul 4, 2025

NVIDIA / Fuser

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

C++ 341 61 Updated Jul 4, 2025

danielvegamyhre / ml-perf-reading-group

My annotated papers and meeting recordings for the EleutherAI ML Performance research paper reading group

Python 18 Updated May 18, 2025

Dao-AILab / quack

A Quirky Assortment of CuTe Kernels

Python 126 4 Updated Jul 4, 2025

mirage-project / mirage

Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA

C++ 1,514 92 Updated Jul 2, 2025

cswry / OSEDiff

[NeurlPS2024] One-Step Effective Diffusion Network for Real-World Image Super-Resolution

Python 454 29 Updated Apr 20, 2025

sayakpaul / diffusers-torchao

End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).

Python 365 13 Updated May 29, 2025

tianweiy / CausVid

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 711 32 Updated May 17, 2025

hao-ai-lab / FastVideo

FastVideo is a unified framework for accelerated video generation.

Python 1,577 107 Updated Jul 4, 2025

Shenyi-Z / TaylorSeer

From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers

Python 216 14 Updated Jun 25, 2025

tdrussell / diffusion-pipe

A pipeline parallel training script for diffusion models.

Python 1,220 166 Updated Jul 3, 2025

The-Pocket / PocketFlow-Tutorial-Codebase-Knowledge

Pocket Flow: Codebase to Tutorial

Python 10,542 1,164 Updated May 23, 2025

nari-labs / dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 17,318 1,424 Updated Jun 28, 2025

sandyresearch / chipmunk

🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× vs cuBLAS

Cuda 74 2 Updated Jun 26, 2025

NVIDIA / TensorRT-Model-Optimizer

A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment…

Python 1,027 94 Updated Jun 30, 2025