Stars
Official pytorch implementation of ZiRa, a method for incremental vision language object detection (IVLOD),which has been accepted by NeurIPS 2024.
Latent Diffusion Transformer for Talking Video Synthesis
收录及复现的高光谱遥感图像分类模型
Code implementation of ProtoSAM - One Shot Medical Image Segmentation with Foundationl Models
Adapting Segment Anything Model for Medical Image Segmentation
A benchmark for remote sensing image translation (IEEE GRSL 2022).
[TGRS 2025] Code for "PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images"
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
[TITS 2024] You Only Look Clusters for Tiny Object Detection in Aerial Images
CAMixerSR: Only Details Need More “Attention” (CVPR 2024)
APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)
[CVPR 2024] Transcending the Limit of Local Window: Advanced Super-Resolution Transformer with Adaptive Token Dictionary
SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution
Collect super-resolution related papers, data, repositories
Official implementation of the CVPR 2022 paper "DETReg: Unsupervised Pretraining with Region Priors for Object Detection".
[NeurIPS 2022] Revisiting Realistic Test-Time Training: Sequential Inference and Adaptation by Anchored Clustering
[CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models
(CVPR 2024) Point, Segment and Count: A Generalized Framework for Object Counting
Mixed Pseudo Labels for Semi-Supervised Object Detection
[CVPR 2022] Official CoTTA Code for our paper Continual Test-Time Domain Adaptation
A large remote sensing unlabeled dataset for semi-supervised oriented object detection.
A benchmark for cross-domain few-shot object detection (ECCV24 paper: Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector)
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
[CVPR 2024] Official Repository for "Efficient Test-Time Adaptation of Vision-Language Models"
This is the official code implementation for 'What, How, and When Should Object Detectors Update in Continually Changing Test Domains?' presented at CVPR 2024.
Medical SAM 2: Segment Medical Images As Video Via Segment Anything Model 2