-
LLaMA-Factory Public
Forked from hiyouga/LLaMA-FactoryUnified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Python Apache License 2.0 UpdatedMay 28, 2025 -
NeMo Public
[ECCV 2024] Finding NeMo: Negative-mined Mosaic Augmentation for Referring Image Segmentation
JavaScript UpdatedMar 6, 2025 -
-
NeMo_test Public
[ECCV 2024] Finding NeMo: Negative-mined Mosaic Augmentation for Referring Image Segmentation
Python UpdatedSep 19, 2024 -
coma Public
Forked from snuvclab/comaOfficial Repository for ECCV 2024 paper Beyond the Contact: Discovering Comprehensive Affordance for 3D Objects from Pre-trained 2D Diffusion Models
Python Other UpdatedSep 14, 2024 -
detectron2 Public
Forked from facebookresearch/detectron2Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Python Apache License 2.0 UpdatedAug 28, 2023 -
ReLA Public
Forked from henghuiding/ReLA[CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation
Python MIT License UpdatedAug 28, 2023 -
VPD Public
Forked from wl-zhao/VPD[ICCV 2023] VPD is a framework that leverages the high-level and low-level knowledge of a pre-trained text-to-image diffusion model to downstream visual perception tasks.
Jupyter Notebook MIT License UpdatedAug 10, 2023 -
imagen-pytorch Public
Forked from lucidrains/imagen-pytorchImplementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Python MIT License UpdatedApr 26, 2023 -
bkms2_2023spring Public
Forked from simonjisu/bkms2_2023spring2023 Spring Big Data and Knowledge Management System 2
Jupyter Notebook UpdatedApr 24, 2023 -
PSVL Public
Forked from gistvision/PSVLCode for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).
Python UpdatedApr 10, 2023 -
pix2word Public
Forked from google-research/composed_image_retrievalShell Apache License 2.0 UpdatedMar 26, 2023 -
stable-diffusion Public
Forked from CompVis/stable-diffusionA latent text-to-image diffusion model
Jupyter Notebook Other UpdatedMar 13, 2023 -
ReLoCLNet Public
Forked from 26hzhang/ReLoCLNetVideo Corpus Moment Retrieval with Contrastive Learning (SIGIR 2021)
Python MIT License UpdatedFeb 15, 2023 -
ETF_Correlation_Graph Public
Forked from jhbale11/ETF_Correlation_GraphProject for SNU GSDS BKMS1
Jupyter Notebook MIT License UpdatedDec 18, 2022 -
FashionViL Public
Forked from BrandonHanx/mmf[ECCV 2022] FashionViL: Fashion-Focused V+L Representation Learning
Python Other UpdatedNov 15, 2022 -
-
skip_thoughts Public
Forked from RetroRabbit/skip_thoughtsInstallable package for skip_thoughts forked from Tensorflow/research
Python Apache License 2.0 UpdatedMay 26, 2022 -
-
-
Vision-Language-Transformer Public
Forked from henghuiding/Vision-Language-Transformer[ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation
-
-
-
-
-
-
-
dcnet Public
Forked from ozmig77/dcnet[2021 AAAI] Dual Compositional Learning in Interactive Image Retrieval
Jupyter Notebook UpdatedMar 15, 2021 -
DRN Public
Forked from Alvin-Zeng/DRNDense Regression Network for Video Grounding (CVPR2020)
Python UpdatedJan 28, 2021 -
fashioniq2020_retrieval Public
Forked from zyiday/fashioniq2020_retrievalcode for Fashion IQ challenge 2020
Python Apache License 2.0 UpdatedJun 10, 2020