Stars
Materials for the LLM Engineering Essentials course
Model Context Protocol Servers
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.
SliderSpace: Decomposing the Visual Capabilities of Diffusion Models
ComfyUI Custom Nodes for "TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching". This generates high-quality 44.1kHz audio up to 30 seconds using just a text prompt.
A feature-rich command-line audio/video downloader
Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
the first library to let you embed a developer agent in your own app!
Secure open source cloud runtime for AI apps & AI agents
Syncthing Windows Setup
[CVPR 2024] An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation
GGUF Quantization support for native ComfyUI models
A CLI tool to aggregate your codebase into a single Markdown file for use with Claude Projects or custom ChatGPTs.
ControlNet++: All-in-one ControlNet for image generations and editing!
AuraSR: GAN-based Super-Resolution for real-world
[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
A powerful tool that translates ComfyUI workflows into executable Python code.
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
Probabilistic language based on pattern matching and constraint propagation, 153 examples
State-of-the-art 2D and 3D Face Analysis Project
[AAAI 2023] Painterly image harmonization in both spatial domain and frequency domain.
[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing