Stars
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Open-Sora: Democratizing Efficient Video Production for All
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Segment Anything in High Quality [NeurIPS 2023]
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
A collection of awesome text-to-image generation studies.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
Stable Diffusion and Flux in pure C/C++
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
[AAAI 2025] Official implementation of "OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on"
pytorch implementation of openpose including Hand and Body Pose Estimation.
Inpaint anything using Segment Anything and inpainting models.
High-Resolution Image Synthesis with Latent Diffusion Models
Stable Diffusion web UI
Official PyTorch implementation for the paper High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled Conditions (ECCV 2022).
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
[CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On
ChatLaw:A Powerful LLM Tailored for Chinese Legal. 中文法律大模型