8000 magicjia1 / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View magicjia1's full-sized avatar

Block or report magicjia1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

2,334 102 Updated May 4, 2025
C++ 15 Updated Mar 4, 2025

[CVPR 2025] The code for "VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM"

Python 205 11 Updated May 12, 2025

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…

Python 2,101 185 Updated Nov 7, 2024

[ICASSP 2025] "Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention"

Python 19 Update 8000 d Apr 27, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,162 58 Updated May 28, 2025

Implementation for Describe Anything: Detailed Localized Image and Video Captioning

Python 1,118 58 Updated May 6, 2025

SpatialLM: Large Language Model for Spatial Understanding

Python 3,206 250 Updated Mar 28, 2025

🎥 Python and OpenCV-based scene cut/transition detection program & library.

Python 3,919 436 Updated May 3, 2025

TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild Videos

Python 396 40 Updated May 16, 2025

[CVPR 2025 Best Paper Award Candidate] VGGT: Visual Geometry Grounded Transformer

Python 7,061 716 Updated May 22, 2025

[CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

Python 993 71 Updated Apr 25, 2025

Repository for our paper: FLD: Fourier Latent Dynamics for Structured Motion Representation and Learning, Proceedings of the 12th International Conference on Learning Representations (ICLR)

Python 292 31 Updated Jun 13, 2024

Official code for MotionBench (CVPR 2025)

Python 40 1 Updated Mar 3, 2025

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 5,614 522 Updated Jan 22, 2025

fit smpl parameters model using 3D joints

Python 202 14 Updated Nov 3, 2023

Code release of Video2Game

JavaScript 320 22 Updated Apr 25, 2024

LeetCode 101:力扣刷题指南

9,426 1,233 Updated Dec 8, 2024

[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

Jupyter Notebook 2,703 237 Updated Oct 27, 2024
821C

Official implementation of "ASAP: Aligning Simulation and Real-World Physics for Learning Agile Humanoid Whole-Body Skills"

Python 1,037 92 Updated Apr 29, 2025

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 827 58 Updated May 19, 2025

Code release for "Learning Dexterous Manipulation from Exemplar Object Trajectories and Pre-Grasps"

Python 64 12 Updated Jul 12, 2023

The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 2,059 127 Updated May 27, 2025

Official implementation of the project HuDOR: Bridging the Human to Robot Dexterity Gap through Object-Oriented Rewards project. Website: https://object-rewards.github.io

Python 22 2 Updated Apr 10, 2025

[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations

Python 885 94 Updated May 15, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 41,522 6,898 Updated Dec 9, 2024

GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization

Python 128 6 Updated Apr 6, 2025

Code for the paper "Learning from Massive Human Videos for Universal Humanoid Pose Control"

Python 122 9 Updated Jan 14, 2025
Next
0