Stars
A list of referring video object segmentation papers
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Repository for Ancient Wisdom, Modern Tools: Exploring Retrieval-Augmented LLMs for Ancient Indian Philosophy (Machine Learning for Ancient Languages, ACL Workshop 2024)
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
An efficient modular implementation of Associating Objects with Transformers for Video Object Segmentation in PyTorch
RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.
Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
[ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation
Inpaint anything using Segment Anything and inpainting models.
Segment Anything in High Quality [NeurIPS 2023]
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…
🔖 Curated list of video object segmentation (VOS) papers, datasets, and projects.
A list of video object segmentation (VOS) papers