Stars
Hydro - Next generation high performance online-judge platform - 新一代高效强大的信息学在线测评系统 (a.k.a. vj5)
A simplified pytorch version of densecap
[NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP
Draw confusion matrix for image classification or others
[AAAI2025] ChatterBox: Multi-round Multimodal Referring and Grounding, Multimodal, Multi-round dialogues
Recent LLM-based CV and related works. Welcome to comment/contribute!
Code for our ICLR 2024 paper "PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts"
AAAI 2024: Visual Instruction Generation and Correction
Code for our ACMMM2020 paper "Context-aware Feature Generation for Zero-shot Semantic Segmentation".
Deep Fourier Ranking Quantization for Semi-Supervised Image Retrieval -- TIP22
Neighborhood-Adaptive Structure Augmented Metric Learning -- AAAI2022 oral
MomentDiff: Generative Video Moment Retrieval from Random to Real--NeurIPS 2023
Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval --ICCV2023 Oral
A temporary webpage for our survey in AGI for computer vision
Segment Anything in High Quality [NeurIPS 2023]
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips
assistant tools for attention visualization in deep learning
Paint by Example: Exemplar-based Image Editing with Diffusion Models
📄 Awesome CV is LaTeX template for your outstanding job application
This may be the simplest implement of DDPM. You can directly run Main.py to train the UNet on CIFAR-10 dataset and see the amazing process of denoising.
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Code for CVPR23 Highlight "I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification" and NeurIPS2022 "I2DFormer: Learning Image to Document Atte…
Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval ECCV22