Starred repositories
ComfyUI extension for mixing model during sampling
Official implementations for paper: VACE: All-in-One Video Creation and Editing
woct0rdho / triton-windows
Forked from triton-lang/tritonFork of the Triton language and compiler for Windows support and easy installation
Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control
KONAKONA666 / LTX-Video
Forked from Lightricks/LTX-VideoLTXVideo Q8
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Creating a diffusion model from scratch in PyTorch to learn exactly how they work.
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)
At the moment this is mostly a tech demo to show how to build a web app on top of ComfyUI
Dream Interpreter inside ComfyUI
Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation
Stable Diffusion Regularization Images in 512px, 768px and 1024px on 1.5, 2.1 and SDXL 1.0 checkpoints
Nodes for better inpainting with ComfyUI: Fooocus inpaint model for SDXL, LaMa, MAT, and various other tools for pre-filling inpaint & outpaint areas.
Custom nodes that extend the capabilities of Comfyui
Official implementations for paper: Anydoor: zero-shot object-level image customization
[CVPR 2024] An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation
Generative models for conditional audio generation
10 Weeks, 20 Lessons, Data Science for All!
Official implementation of the NeurIPS 2023 paper "Photoswap: Personalized Subject Swapping in Images"
A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, …
[CVPR2024, Highlight] Official code for DragDiffusion