-
A*STAR
Highlights
- Pro
-
viper Public
Forked from cvlab-columbia/viperCode for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"
Jupyter Notebook Other UpdatedMar 7, 2025 -
VILA Public
Forked from NVlabs/VILAVILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Python Apache License 2.0 UpdatedFeb 25, 2025 -
-
MiniCPM-V Public
Forked from OpenBMB/MiniCPM-oMiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Python Apache License 2.0 UpdatedOct 17, 2024 -
Ask-Anything Public
Forked from OpenGVLab/Ask-Anything[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Python MIT License UpdatedOct 2, 2024 -
HumanMotionQA Public
Forked from markendo/HumanMotionQAMotion Question Answering via Modular Motion Programs
Jupyter Notebook UpdatedAug 12, 2024 -
-
ctr-din-pytorch Public
The Most Complete PyTorch Implementation of "Deep Interest Network for Click-Through Rate Prediction"
-
VideoX Public
Forked from microsoft/VideoXVideoX: a collection of video cross-modal models
Python Other UpdatedNov 18, 2022 -
ORViT Public
Forked from eladb3/ORViT"Object-Region Video Transformers”, Herzig et al., CVPR 2022
Python Apache License 2.0 UpdatedJul 6, 2022