Stars
one summary of diffusion-based image processing, including restoration, enhancement, coding, quality assessment
Diffusion Model-Based Image Editing: A Survey (TPAMI 2025)
[TPAMI'23] Unifying Flow, Stereo and Depth Estimation
[CSUR] A Survey on Video Diffusion Models
An elegant \LaTeX\ résumé template. 大陆镜像 https://gods.coding.net/p/resume/git
Official implementation of the paper “MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes”
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
A collection of awesome video generation studies.
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
[ECCV 2024] Officially implement of the paper "DrivingDiffusion: Layout-Guided Multi-View Driving Scenarios Video Generation with Latent Diffusion Model".
[ECCV 2022 oral] Monocular 3D Object Detection with Depth from Motion
Awesome Monocular 3D detection
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Vision-Centric BEV Perception: A Survey
Awesome papers about Multi-Camera 3D Object Detection and Segmentation in Bird's-Eye-View, such as DETR3D, BEVDet, BEVFormer, BEVDepth, UniAD
Paper reading notes on Deep Learning and Machine Learning
the CS missing semester Chinese version
Datawhale成员整理的面经,内容包括机器学习,CV,NLP,推荐,开发等,欢迎大家star
记录cv算法工程师的成长之路,分享计算机视觉和模型压缩部署技术栈笔记。https://harleyszhang.github.io/cv_note/
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
Two-Stage Peer-Regularized Feature Recombination for Arbitrary Image Style Transfer
The author's officially unofficial PyTorch BigGAN implementation.