Stars
[NeurIPS 2024] DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
Code for the manim-generated scenes used in 3blue1brown videos
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
A basic simulator for multi-robot patrol based on time-step simulation. Easy to set new Robot, Env, patrol Algorithms.
VisionNav is an advanced navigation system for visually impaired individuals, providing real-time voice-guided assistance using OpenCV, SLAM, YOLO for object detection, and Text-to-Speech. It suppo…
C#Halcon视觉软件,2020年05月修整期间编写的工业集成软件框架,目前不从事该行业,因此开放出来交流学习。软件已作删减,仅保留视觉部分,需自行添加Halcon的DLL方可正常运行软件。希望对大家有帮助。
[ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"
Convert your videos to densepose and use it on MagicAnimate
A playbook for systematically maximizing the performance of deep learning models.
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)
[CVPR 2022--Oral] Restormer: Efficient Transformer for High-Resolution Image Restoration. SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.
[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.
"Retinexformer: One-stage Retinex-based Transformer for Low-light Image Enhancement" (ICCV 2023) & (NTIRE 2024 Challenge)
The code release of paper "AAAI Ultra-High-Definition Low-Light Image Enhancement: A Benchmark and Transformer-Based Method", AAAI 2023
A curated list of awesome research papers, projects, code, dataset, workshops etc. related to virtual try-on.
A collection of resources and papers on Diffusion Models
Official Repository of "Unpaired Image-to-Image Translation via Neural Schrödinger Bridge" (ICLR 2024)