Lists (1)
Sort Name ascending (A-Z)
Starred repositories
DeepSeek-VL: Towards Real-World Vision-Language Understanding
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion
[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.
Official PyTorch implementation of BDMM: Bidirectionally Deformable Motion Modulation For Video-based Human Pose Transfer [ICCV 2023]
Code for the paper "Jukebox: A Generative Model for Music"
A collection of papers and codes for human pose transfer
This repository contains the source code for the paper First Order Motion Model for Image Animation
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
NVIDIA's Deep Imagination Team's PyTorch Library
Motion Retargeting Video Subjects
[CVPR2023] Implementation of ''Omni Aggregation Networks for Lightweight Image Super-Resolution".
Core Engine of Singing Voice Conversion & Singing Voice Clone
love112358 / spleeter
Forked from deezer/spleeterDeezer source separation library including pretrained models.
Unsupervised Speech Decomposition Via Triple Information Bottleneck
Clone a voice in 5 seconds to generate arbitrary speech in real-time
🔊 Text-Prompted Generative Audio Model
Unofficial PyTorch Code for “MBLLEN: Low-light Image/Video Enhancement Using CNNs”, BMVC 2018.
Code for “MBLLEN: Low-light Image/Video Enhancement Using CNNs”, BMVC 2018.
Low-Light Image Enhancement via Edge-Enhanced Multi-Exposure Fusion Network
C++ library based on tensorrt integration
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
A Python module to decode video frames directly, using the FFmpeg C API.