10000 594zyc (Yichi Zhang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View 594zyc's full-sized avatar

Highlights

  • Pro

Block or report 594zyc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"

Python 813 35 Updated Apr 12, 2025

🔍 Explore Egocentric Vision: research, data, challenges, real-world apps. Stay updated & contribute to our dynamic repository! Work-in-progress; join us!

111 9 Updated Nov 23, 2024

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 23,964 3,177 Updated Jun 12, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,609 1,420 Updated Jun 12, 2025

Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]

Python 137 13 Updated Nov 26, 2024

Ai2-THOR in browser.

Python 6 Updated Sep 6, 2022

Utility functions when working with Ai2-THOR. Try to do one thing once.

Python 46 6 Updated May 19, 2022

Course Project for CMU 16-867 Human Robot Interaction

Python 1 1 Updated Dec 13, 2022

Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"

Python 13 1 Updated Jan 25, 2024
Python 37 1 Updated Mar 22, 2024

GRiT: A Generative Region-to-text Transformer for Object Understanding (ECCV2024)

Python 326 30 Updated Jan 8, 2024

AGE animation official website URL release page(AGE动漫官网网址发布页)

7,315 187 Updated May 3, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 145,500 29,317 Updated Jun 12, 2025

✨✨Latest Advances on Multimodal Large Language Models

15,519 1,006 Updated Jun 6, 2025

Python 3 support for the MS COCO caption evaluation tools

Python 322 88 Updated Aug 1, 2024

ESPER

Python 23 2 Updated Mar 29, 2024

Grounded Segment Anything: From Objects to Parts

Jupyter Notebook 411 22 Updated May 19, 2023

Code for EMNLP 2022 Paper DANLI: Deliberative Agent for Following Natural Language Instructions

Python 19 2 Updated May 1, 2025

SPEAR: A Simulator for Photorealistic Embodied AI Research

C++ 280 23 Updated Jun 5, 2025

Recent LLM-based CV and related works. Welcome to comment/contribute!

866 37 Updated Mar 8, 2025

Inference code for Llama models

Python 58,363 9,785 Updated Jan 26, 2025

A library for differentiable robotics.

Python 1,415 112 Updated Mar 28, 2025

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,193 3,684 Updated Jul 4, 2024

[CVPR 2023] vMAP: Vectorised Object Mapping for Neural Field SLAM

Python 358 21 Updated Jun 16, 2023

Learning mobile manipulation behaviors through reinforcement learning

Python 59 4 Updated Apr 15, 2024

Fine-Grained Egocentric Hand-Object Segmentation, ECCV 2022

Python 112 15 Updated Feb 26, 2024

Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code

Python 74 14 Updated Aug 14, 2023

Words in the Google Books Ngram Corpus (v3, all languages) with metadata and Python code

Python 5 Updated Dec 7, 2022
Jupyter Notebook 226 29 Updated Dec 18, 2023
Next
0