Starred repositories
Robust realtime face and facial landmark tracking on CPU with Unity integration
Character Animation (AnimateAnyone, Face Reenactment)
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source.
Rembg is a tool to remove images background
The official Python SDK for Model Context Protocol servers and clients
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
Reference implementation for DPO (Direct Preference Optimization)
LLaMA: Open and Efficient Foundation Language Models
The official PyTorch implementation of the paper "MotionGPT: Finetuned LLMs are General-Purpose Motion Generators"
Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…
Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"
[CSUR] A Survey on Video Diffusion Models
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.
大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"
An open-source library with a powerful Contrastive Language-and-Motion (CLaM) pre-training evaluator
Person-homepage
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]