Lists (7)
Sort Name ascending (A-Z)
Stars
Code release for paper "Test-Time Training Done Right"
Official Code Release for [SIGGRAPH 2025] RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination
Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation
Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance
Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation
Official repository for the paper "CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models"
“FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with any VAE.
IDOL: Instant Photorealistic 3D Human Creation from a Single Image. An open-source project for fast, high-fidelity, and generalizable 3D human reconstruction from a single image.
A curated list of awesome resources, tools, libraries, and applications related to Flux AI technology. This repository aims to be a comprehensive collection for developers, researchers, and enthusi…
[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
[CVPR 2025 Highlight] Real-time High-fidelity Gaussian Human Avatars with Position-based Interpolation of Spatially Distributed MLPs
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
[ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints
🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning
Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"
OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from simple icons to in…
Train universal codec avatars
[ICLR 2025 Oral] Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"
BG-Triangle: Bézier Gaussian Triangle for 3D Vectorization and Rendering
Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"
SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling
[Siggraph Asia 2024] Follow-Your-Emoji: This repo is the official implementation of "Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation"
[CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation
real time face swap and one-click video deepfake with only a single image