Stars
Script for automatic programming with GPT4
A real-time silent speech recognition tool.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD, TMLR 2024)
Zero-Shot Speech Editing and Text-to-Speech in the Wild
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
A linear estimator on top of clip to predict the aesthetic quality of pictures
High-Resolution Image Synthesis with Latent Diffusion Models