Stars
Official PyTorch implementation of "VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization" (CVPR 2021)
Huggingface cloth segmentation using U2NET
A License-Plate detecttion application based on YOLO
License Plate Detection using YOLOv8
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
Official implementation of "OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association" in PyTorch.
[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box
This repo contains code and a pre-trained model for clothes segmentation.
Code for the ICCV 2021 paper "Pixel Difference Networks for Efficient Edge Detection" (Oral).
Fashion Landmark Detection in the Wild
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text removal, text image super resolution, text editing, handwritten ge…
Official Code for Stable Cascade
Dress Code: High-Resolution Multi-Category Virtual Try-On. ECCV 2022
This is the official repository for the paper "Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing". ICCV 2023
mfrashad / ClothingGAN
Forked from harskish/ganspaceAI-Powered Clothing Design Generator
A repository to curate and summarise research papers related to fashion and e-commerce
This repository is an implementation of the Wav2Vec2 model for converting speech into text through a series of speech recognition, noise removal and STT to transcribe the text from a video file.
Official PyTorch implementation of ECCV 2024 Paper: ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback.
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
Official implementations for paper: Dynamic Typography: Bringing Text to Life via Video Diffusion Prior
Robust Speech Recognition via Large-Scale Weak Supervision
Jupyter Notebooks and code for the book Artificial Intelligence in Finance (O'Reilly) by Yves Hilpisch.
VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling