Stars
zero-shot voice conversion & singing voice conversion, with real-time support
Meridian cuts through news noise by scraping hundreds of sources, analyzing stories with AI, and delivering concise, personalized daily briefs.
Data validation using Python type hints
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
No fortress, purely open ground. OpenManus is Coming.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…
Code release for the paper "DartControl: A Diffusion-Based Autoregressive Motion Model for Real-Time Text-Driven Motion Control"
LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds
HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Toolkit for linearizing PDFs for LLM datasets/training
Wan: Open and Advanced Large-Scale Video Generative Models
[CVPR 2025 Highlight] X-Dyna: Expressive Dynamic Human Image Animation
SkyReels V1: The first and most advanced open-source human-centric video foundation model
Real-time pose estimation pipeline with 🤗 Transformers
Markerless kinematics with any cameras — From 2D Pose estimation to 3D OpenSim motion
Official code for "SMPLest-X: Ultimate Scaling for Expressive Human Pose and Shape Estimation"
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/
Sonic is a method about ' Shifting Focus to Global Audio Perception in Portrait Animation',you can use it in comfyUI
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
[ICLR2025] DisPose: Disentangling Pose Guidance for Controllable Human Image Animation
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
[CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer