-
v1 Public
Don't Look Only Once: Towards Multimodal Interactive Reasoning with Selective Visual Revisitation
-
EgoSpeak Public
[NAACL 2025 Findings] EgoSpeak: Learning When to Speak for Egocentric Conversational Agents in the Wild
-
-
dot Public
Forked from 16lemoing/dotDense Optical Tracking: Connecting the Dots
Python MIT License UpdatedNov 6, 2024 -
-
ViTPose Public
Forked from ViTAE-Transformer/ViTPoseThe official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [Arxiv'22] "ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estim…
Python Apache License 2.0 UpdatedMay 16, 2023 -
GazeTracking Public
Forked from antoinelame/GazeTracking👀 Eye Tracking library easily implementable to your projects
Python MIT License UpdatedMay 14, 2023 -
detectron2 Public
Forked from facebookresearch/detectron2Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Python Apache License 2.0 UpdatedApr 25, 2023 -
mrpep_fairseq Public
Forked from mrpep/fairseq(for ser-with-w2v2) Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python MIT License UpdatedApr 22, 2021