Stars
Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"
🔍 Explore Egocentric Vision: research, data, challenges, real-world apps. Stay updated & contribute to our dynamic repository! Work-in-progress; join us!
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]
Utility functions when working with Ai2-THOR. Try to do one thing once.
Course Project for CMU 16-867 Human Robot Interaction
Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"
GRiT: A Generative Region-to-text Transformer for Object Understanding (ECCV2024)
AGE animation official website URL release page(AGE动漫官网网址发布页)
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
✨✨Latest Advances on Multimodal Large Language Models
salaniz / pycocoevalcap
Forked from tylin/coco-captionPython 3 support for the MS COCO caption evaluation tools
Grounded Segment Anything: From Objects to Parts
Code for EMNLP 2022 Paper DANLI: Deliberative Agent for Following Natural Language Instructions
SPEAR: A Simulator for Photorealistic Embodied AI Research
Recent LLM-based CV and related works. Welcome to comment/contribute!
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
[CVPR 2023] vMAP: Vectorised Object Mapping for Neural Field SLAM
Learning mobile manipulation behaviors through reinforcement learning
Fine-Grained Egocentric Hand-Object Segmentation, ECCV 2022
Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code
Words in the Google Books Ngram Corpus (v3, all languages) with metadata and Python code