vicchu

vicchu

Starred repositories

BoltzmannEntropy / interviews.ai

It is my belief that you, the postgraduate students and job-seekers for whom the book is primarily meant will benefit from reading it; however, it is my hope that even the most experienced research…

4,636 304 Updated Jan 21, 2022

michuanhaohao / reid-strong-baseline

Bag of Tricks and A Strong Baseline for Deep Person Re-identification

Python 2,306 579 Updated Apr 23, 2020

pliang279 / awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

6,535 883 Updated Aug 20, 2024

PaddlePaddle / PaddleClas

A treasure chest for visual classification and recognition powered by PaddlePaddle

Python 5,690 1,188 Updated Jul 1, 2025

yfyuan01 / MultiturnFashionRetrieval

SIGIR paper Conversational Fashion Image Retrieval via Multiturn Natural Language Feedback

14 2 Updated Oct 17, 2022

vikshree / QA_PersonSearchLanguageData

This repo consists of the QA dataset collected for performing person search with natural language.

4 Updated Apr 9, 2021

Paranioar / Awesome_Matching_Pretraining_Transfering

The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Ins…

424 47 Updated Dec 15, 2024

Cuberick-Orion / CIRPLANT

Official implementation of the Composed Image Retrieval using Pretrained LANguage Transformers (CIRPLANT) | ICCV 2021 - Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models

Python 39 7 Updated Jun 26, 2024

UKPLab / MMT-Retrieval

Python 131 14 Updated Dec 10, 2022

open-mmlab / mmfashion

Open-source toolbox for visual fashion analysis based on PyTorch

Python 1,325 299 Updated May 10, 2024

yuewang-cuhk / awesome-vision-language-pretraining-papers

Recent Advances in Vision and Language PreTrained Models (VL-PTMs)

1,152 104 Updated Aug 19, 2022

Cuberick-Orion / CIRR

Official repository of ICCV 2021 - Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models

115 4 Updated May 21, 2025

Glovo / foodi-ml-dataset

Jupyter Notebook 60 6 Updated Dec 20, 2023

lancopku / IAIS

[ACL 2021] Learning Relation Alignment for Calibrated Cross-modal Retrieval

Python 31 4 Updated May 16, 2023

amzn / image-to-recipe-transformers

Code for CVPR 2021 paper: Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning

Python 85 24 Updated Mar 24, 2021

amzn / fashion-attribute-disentanglement

Python 42 7 Updated Oct 19, 2023

Eurus-Holmes / Awesome-Multimodal-Research

A curated list of Multimodal Related Research.

Python 1,362 150 Updated Aug 5, 2023

ict-bigdatalab / awesome-pretrained-models-for-information-retrieval

A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pretraining for IR). 99DA

670 49 Updated Jan 7, 2024

YehLi / xmodaler

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsens…

Python 969 106 Updated Feb 27, 2023

pzzhang / VinVL

project page for VinVL

356 25 Updated Jul 26, 2023

danieljf24 / awesome-video-text-retrieval

A curated list of deep learning resources for video-text retrieval.

625 66 Updated Oct 20, 2023

bismex / Awesome-cross-modality-person-re-identification

Awesome Cross-modality Person Re-identification

147 32 Updated Jul 14, 2022

willard-yuan / video-text-retrieval-papers

15 2 Updated Sep 16, 2021

TencentYoutuResearch / PersonReID-YouReID

A Simple, High-efficiency, Strong framework for person re-Identification.

Python 73 19 Updated Apr 19, 2021

facebookresearch / ParlAI

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Python 10,598 2,095 Updated Nov 3, 2023

facebookresearch / mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Python 5,578 938 Updated Apr 24, 2025

NeverMoreLCH / Awesome-VQA

A reading list of papers about Visual Question Answering.

33 6 Updated Aug 17, 2022

jokieleung / awesome-visual-question-answering

A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.

666 94 Updated Jul 6, 2023

ZephyrZhuQi / ssbaseline

Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]

Python 57 5 Updated Apr 5, 2022

microsoft / TAP

TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral)

Python 72 10 Updated May 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly