Stars
FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
A multi-voice TTS system trained with an emphasis on quality
[CVPR 2025] HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation
💖🧸 A container of souls of AI waifu / virtual characters to bring them into our worlds, wishing to achieve Neuro-sama's altitude, completely LLM and AI driven, capable of realtime voice chat, Minec…
A powerful framework for building realtime voice AI agents 🤖🎙️📹
Official pytorch implementation for Learning to Listen: Modeling Non-Deterministic Dyadic Facial Motion (CVPR 2022)
Real time interactive streaming digital human
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
[SIGGRAPH 2025] LAM: Large Avatar Model for One-shot Animatable Gaussian Head
Using Claude Sonnet 3.5 to forward (reverse) engineer code from VASA white paper - WIP - (this is for La Raza 🎷)
Official Pytorch Implementation of SPECTRE: Visual Speech-Aware Perceptual 3D Facial Expression Reconstruction from Videos
A complete head tracking pipeline from videos to NeRF/3DGS-ready datasets.
The automated build & install script for MPI-IS/mesh
Machine learning metrics for distributed, scalable PyTorch applications.
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Out of time: automated lip sync in the wild
OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.
Dynamic Scene Representation Gaussian Splatting
A Conversational Speech Generation Model
Code and dataset for photorealistic Codec Avatars driven from audio
Summary of publicly available ressources such as code, datasets, and scientific papers for the FLAME 3D head model