-
SAIS, UCAS
- Beijing
-
22:23
(UTC -12:00)
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
LP-OVOD: Open-Vocabulary Object Detection by Linear Probing (WACV 2024)
This is the official implementation for our ICLR 2025 paper "Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection"
A lightweight script for real-time GPU monitoring. Automatically sends email alerts when VRAM usage drops below a specified threshold.
🚀 One-stop solution for creating your digital avatar from chat logs 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. 从聊天…
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
[NeurIPS 2022] Official repository of paper titled "Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection".
Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection
[AAAI2025] Code Release of OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision
Official code for TPAMI 2025 paper "ProtoGCD: Unified and Unbiased Prototype Learning for Generalized Category Discovery"
A list of papers that studies Novel Class Discovery
[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"
(TMM 2025) Official repository of paper "A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection"
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull request…
Avalanche: an End-to-End Library for Continual Learning based on PyTorch.
A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
FreeDA: Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation (CVPR 2024)
A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.
Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown
Segment Anything combined with CLIP
Code for CVPR2025 "MMRL: Multi-Modal Representation Learning for Vision-Language Models" and its extension "MMRL++: Parameter-Efficient and Interaction-Aware Representation Learning for Vision-Lang…