magicjia1

magicjia1

0 followers · 2 following

Lists (1)

Sort

🚀 My stack

1 repository

Stars

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

2,334 102 Updated May 4, 2025

jingGM / GND

C++ 15 Updated Mar 4, 2025

DAMO-NLP-SG / VideoRefer

[CVPR 2025] The code for "VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM"

Python 205 11 Updated May 12, 2025

BAAI-Agents / Cradle

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…

Python 2,101 185 Updated Nov 7, 2024

WarmCongee / SDUMC

[ICASSP 2025] "Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention"

Python 19 Update 8000 d Apr 27, 2025

facebookresearch / perception_models

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,162 58 Updated May 28, 2025

NVlabs / describe-anything

Implementation for Describe Anything: Detailed Localized Image and Video Captioning

Python 1,118 58 Updated May 6, 2025

yilinliu77 / UrbanScene3D

106 8 Updated Aug 9, 2022

manycore-research / SpatialLM

SpatialLM: Large Language Model for Spatial Understanding

Python 3,206 250 Updated Mar 28, 2025

Breakthrough / PySceneDetect

🎥 Python and OpenCV-based scene cut/transition detection program & library.

Python 3,919 436 Updated May 3, 2025

yufu-wang / tram

TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild Videos

Python 396 40 Updated May 16, 2025

facebookresearch / vggt

[CVPR 2025 Best Paper Award Candidate] VGGT: Visual Geometry Grounded Transformer

Python 7,061 716 Updated May 22, 2025

DepthAnything / Video-Depth-Anything

[CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

Python 993 71 Updated Apr 25, 2025

mit-biomimetics / fld

Repository for our paper: FLD: Fourier Latent Dynamics for Structured Motion Representation and Learning, Proceedings of the 12th International Conference on Learning Representations (ICLR)

Python 292 31 Updated Jun 13, 2024

THUDM / MotionBench

Official code for MotionBench (CVPR 2025)

Python 40 1 Updated Mar 3, 2025

DepthAnything / Depth-Anything-V2

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 5,614 522 Updated Jan 22, 2025

wangsen1312 / joints2smpl

fit smpl parameters model using 3D joints

Python 202 14 Updated Nov 3, 2023

video2game / video2game

Code release of Video2Game

JavaScript 320 22 Updated Apr 25, 2024

changgyhub / leetcode_101

LeetCode 101：力扣刷题指南

9,426 1,233 Updated Dec 8, 2024

hustvl / 4DGaussians

[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

Jupyter Notebook 2,703 237 Updated Oct 27, 2024

821C

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

magicjia1

Block or report magicjia1

Lists (1)

🚀 My stack

Stars

yunlong10 / Awesome-LLMs-for-Video-Understanding

jingGM / GND

DAMO-NLP-SG / VideoRefer

BAAI-Agents / Cradle

WarmCongee / SDUMC

facebookresearch / perception_models

NVlabs / describe-anything

yilinliu77 / UrbanScene3D

manycore-research / SpatialLM

Breakthrough / PySceneDetect

yufu-wang / tram

facebookresearch / vggt

DepthAnything / Video-Depth-Anything

mit-biomimetics / fld

THUDM / MotionBench

DepthAnything / Depth-Anything-V2

wangsen1312 / joints2smpl

video2game / video2game

changgyhub / leetcode_101

hustvl / 4DGaussians

LeCAR-Lab / ASAP

DAMO-NLP-SG / VideoLLaMA3

facebookresearch / TCDM

Physical-Intelligence / openpi

OpenDriveLab / AgiBot-World

irmakguzey / object-rewards

YanjieZe / 3D-Diffusion-Policy

karpathy / nanoGPT

aiming-lab / GRAPE

sihengz02 / UH-1