8000 IMCCretrieval (Eggroll) / Starred · GitHub

More Web Proxy on the site http://driver.im/

IMCCretrieval

Follow

Eggroll IMCCretrieval

Follow

12 followers · 6 following

Achievements

Achievements

Stars

xialeiliu / Awesome-Incremental-Learning

Awesome Incremental Learning

4,091 598 Updated Apr 28, 2025

fengyang95 / ebooks

157 65 Updated Sep 4, 2022

penghao-wu / vstar

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Python 622 39 Updated Jan 7, 2024

ttengwang / Awesome_Long_Form_Video_Understanding

Awesome papers & datasets specifically focused on long-term videos.

277 12 Updated Nov 15, 2024

huangb23 / VTimeLLM

[CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".

Python 279 12 Updated Jun 13, 2024

lntzm / MESM

The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)

Python 29 3 Updated Mar 29, 2024

BAAI-DCAI / Bunny

A family of lightweight multimodal models.

Python 1,020 74 Updated Nov 18, 2024

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

2,377 108 Updated Jun 4, 2025

facebookresearch / vicreg

VICReg official code base

Python 538 91 Updated Jul 6, 2023

USTC-IMCC / PaperReading

Paper Reading of IMCC groups.

18 16 Updated Apr 23, 2025

showlab / Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, and various other applications.

4,502 270 Updated Jun 6, 2025

KevinLight831 / CTP

[ICCV2023] - CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation

Python 35 5 Updated Oct 8, 2024

TXH-mercury / VALOR

[TPAMI2024] Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

Python 293 17 Updated Dec 25, 2024

zhoubolei / bolei_awesome_posters

CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!

1,670 150 Updated May 9, 2023

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

15,498 1,003 Updated Jun 6, 2025

TXH-mercury / VAST

[NIPS2023] Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

Jupyter Notebook 278 17 Updated Mar 14, 2024

GenjiB / ECLIPSE

Python 32 6 Updated Mar 10, 2023

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 38,790 4,412 Updated Jun 10, 2025

pengsida / learning_research

本人的科研经验

6,927 406 Updated Jun 4, 2025

OpenBMB / VisCPM

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

Python 1,058 93 Updated Jun 13, 2024

IMCCretrieval / DFRQ

Deep Fourier Ranking Quantization for Semi-Supervised Image Retrieval -- TIP22

Python 6 Updated Jul 1, 2023

OFA-Sys / ONE-PEACE

A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Python 1,039 70 Updated Oct 6, 2024

IMCCretrieval / MomentDiff

MomentDiff: Generative Video Moment Retrieval from Random to Real--NeurIPS 2023

Python 79 Updated Nov 2, 2023

YoujiaZhang / SigmaCCS

[Communications Chemistry 2023] Highly accurate and large-scale collision cross section prediction with graph neural network for compound identification

Jupyter Notebook 59 2 Updated Sep 22, 2021

YexiongLin / Openvino_Yolov5_async

yolov5的openvino模型，带异步推理

Python 56 Updated Oct 11, 2020

w1oves / DTP

[ICCV 2023] Official implement of <Disentangle then Parse: Night-time Semantic Segmentation with Illumination Disentanglement>

Python 71 1 Updated Feb 26, 2024

TongkunGuan / RFN

[TCSVT2022] Industria Scene Text Detection

Python 80 6 Updated Mar 3, 2023

Sense-X / SiT

Official implementation of "Self-slimmed Vision Transformer" (ECCV2022)

Python 72 1 Updated Jul 20, 2022

ChenHsing / SVFormer

Python 86 2 Updated Apr 12, 2023

ChenHsing / SimDA

[CVPR 2024] SimDA: Simple Diffusion Adapter for Efficient Video Generation

Python 128 4 Updated May 7, 2024

0