Stars
The official code implementation of the paper "OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data."
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
A general fine-tuning kit geared toward diffusion models.
Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)
ConceptAttention: A method for interpreting multi-modal diffusion transformers.
Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD, TMLR 2024)
[ICCV 2025] OminiControl: Minimal and Universal Control for Diffusion Transformer
JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers
Implementation of Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models
Nodes for image juxtaposition for Flux in ComfyUI
Automated context-based evaluation for object detection models.
The ultimate training toolkit for finetuning diffusion models
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).
A detailed diagram laying out the full Flux.1 [dev] architecture as shared by Black Forest Labs at https://github.com/black-forest-labs/flux.
A collection of resources and papers on Diffusion Models
An AI-powered custom node for ComfyUI designed to enhance workflow automation and provide intelligent assistance
Basic Stable Diffusion Workflows for ComyUI using minimal custom nodes
This extension provides inference-time optimization techniques to enhance diffusion-based image generation quality through random search and zero-order optimization algorithms, along with an ensemb…
High-Resolution Image Synthesis with Latent Diffusion Models
Official codebase for the Paper “Retrieval-Augmented Diffusion Models”
A beautiful, simple, clean, and responsive Jekyll theme for academics
Github Pages template based upon HTML and Markdown for personal, portfolio-based websites.