Starred repositories
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Support iOS17 & Wallpaper: a Tool with Example of how to convert a video into a LivePhoto
Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.
Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IF…
Tensorflow 2.x based implementation of EDSR, WDSR and SRGAN for single image super-resolution
Implementation of Super Resolution CNN in Keras.
mash up of Wan2.1 + Meta Sapiens + Seaweed Diffusion APT for One-Step Video Generation if you have compute - call me
Wan: Open and Advanced Large-Scale Video Generative Models
Face recognition, face liveness detection: face matching, face compare, face comparison, face identification, face anti-spoofing, face identity, facial recognition, face representation, face recons…
Repository for precision tracking from 3D point clouds
Real-time 3D face tracking and reconstruction from 2D video
[NeurIPS Workshop 2019] Official code of the paper "Probabilistic 3D Multi-Object Tracking for Autonomous Driving." First Place of the First NuScenes Tracking Challenge in the AI Driving Olympics W…
🔥3D-MOT(点云多目标检测和追踪C++) (2020 · 秋) 代码有详细注解
Official implementation of Monocular Quasi-Dense 3D Object Tracking, TPAMI 2022
(IROS 2020, ECCVW 2020) Official Python Implementation for "3D Multi-Object Tracking: A Baseline and New Evaluation Metrics"
Algorithms and Publications on 3D Object Tracking
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API
No fortress, purely open ground. OpenManus is Coming.
Get Apple Live Photo from .mp4 or .mov
[ICLR 2025 Oral] TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation
[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/
High performance self-hosted photo and video management solution.
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.