hx8563

hx8563

11 followers · 24 following

Beijing

Achievements

Stars

tryolabs / norfair

Lightweight Python library for adding real-time multi-object tracking to any detector.

Python 2,491 263 Updated Apr 30, 2025

colmap / colmap

COLMAP - Structure-from-Motion and Multi-View Stereo

C++ 8,737 1,658 Updated Jun 5, 2025

epic-kitchens / epic-kitchens-100-annotations

🍽️ Annotations for the public release of the EPIC-KITCHENS-100 dataset

Python 148 28 Updated Aug 1, 2022

zhumorui / AnyRetrival

Enhancing Zero-shot Image Retrieval with Vision Foundation Models

Python 3 Updated Nov 22, 2024

zhumorui / maze_cot

Python 1 Updated Apr 21, 2025

embodied-generalist / embodied-generalist

[ICML 2024] Official code repository for 3D embodied generalist agent LEO

Python 439 39 Updated Apr 20, 2025

modelscope / Nexus-Gen

Python 211 11 Updated May 27, 2025

NirAharon / BoT-SORT

BoT-SORT: Robust Associations Multi-Pedestrian Tracking

Jupyter Notebook 1,087 448 Updated Aug 8, 2024

facebookresearch / omni3d

Code release for "Omni3D A Large Benchmark and Model for 3D Object Detection in the Wild"

Python 782 73 Updated Apr 7, 2024

facebookresearch / vggt

[CVPR 2025 Best Paper Award Candidate] VGGT: Visual Geometry Grounded Transformer

Python 7,265 743 Updated Jun 3, 2025

NovaSky-AI / SkyRL

SkyRL-v0: Train Real-World Long-Horizon Agents via Reinforcement Learning

Python 379 37 Updated Jun 5, 2025

UVA-Computer-Vision-Lab / ovmono3d

Code for "Open Vocabulary Monocular 3D Object Detection"

Python 49 2 Updated Apr 28, 2025

NVlabs / RoboSpatial

Python 49 1 Updated Apr 30, 2025

VlSomers / keypoint_promptable_reidentification

[ECCV24] Keypoint Promptable Re-Identification: SOTA ReID method robust to occlusions and multi-person ambiguity

Python 134 16 Updated Feb 2, 2025

78 / xiaozhi-esp32

An MCP-based chatbot | 一个基于MCP的聊天机器人

C++ 14,589 2,785 Updated Jun 6, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 5,080 310 Updated May 11, 2025

AnjieCheng / SpatialRGPT

[NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"

Python 201 16 Updated Dec 14, 2024

remyxai / VQASynth

Compose multimodal datasets 🎹

Python 394 17 Updated Jun 1, 2025

WeichenZh / Open3DVQA

Python 16 1 Updated Jun 5, 2025

AIR-THU / DAIR-V2X

Python 516 72 Updated Feb 21, 2025

DAMO-NLP-SG / VideoLLaMA2

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 1,172 80 Updated Jan 23, 2025

InternLM / InternLM-XComposer

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,842 174 Updated May 26, 2025

ai-forever / MoVQGAN

MoVQGAN - model for the image encoding and reconstruction

Jupyter Notebook 239 16 Updated Oct 31, 2023

scofield7419 / Video-of-Thought

Video Chain of Thought, Codes for ICML 2024 paper: "Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition"

Python 145 7 Updated Feb 25, 2025

google-research / big_vision

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,918 188 Updated May 19, 2025

JiehongLin / SAM-6D

[CVPR2024] Code for "SAM-6D: Segment Anything Model Meets Zero-Shot 6D Object Pose Estimation".

Python 518 53 Updated Jul 9, 2024

NVlabs / FoundationPose

[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects

Python 2,136 306 Updated Mar 3, 2025

manycore-research / SpatialLM

SpatialLM: Large Language Model for Spatial Understanding

Python 3,227 250 Updated Mar 28, 2025

DAMO-NLP-SG / VideoRefer

[CVPR 2025] The code for "VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM"

Python 206 11 Updated May 12, 2025

vukasin-stanojevic / BoostTrack

Python 177 25 Updated Jun 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hx8563

Achievements

Achievements

Block or report hx8563

Stars

tryolabs / norfair

colmap / colmap

epic-kitchens / epic-kitchens-100-annotations

zhumorui / AnyRetrival

zhumorui / maze_cot

embodied-generalist / embodied-generalist

modelscope / Nexus-Gen

NirAharon / BoT-SORT

facebookresearch / omni3d

facebookresearch / vggt

NovaSky-AI / SkyRL

UVA-Computer-Vision-Lab / ovmono3d

NVlabs / RoboSpatial

VlSomers / keypoint_promptable_reidentification

78 / xiaozhi-esp32

om-ai-lab / VLM-R1

AnjieCheng / SpatialRGPT

remyxai / VQASynth

WeichenZh / Open3DVQA

AIR-THU / DAIR-V2X

DAMO-NLP-SG / VideoLLaMA2

InternLM / InternLM-XComposer

ai-forever / MoVQGAN

scofield7419 / Video-of-Thought

google-research / big_vision

JiehongLin / SAM-6D

NVlabs / FoundationPose

manycore-research / SpatialLM

DAMO-NLP-SG / VideoRefer

vukasin-stanojevic / BoostTrack