🎯
Focusing
Pinned Loading
-
AILab-CVC/SEED-Bench
AILab-CVC/SEED-Bench Public(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.
-
tulerfeng/Video-R1
tulerfeng/Video-R1 PublicVideo-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]
-
AV-Odyssey/AV-Odyssey
AV-Odyssey/AV-Odyssey PublicThis repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.