-
Bournemouth university
- UK
- in/shuolin-xu-b565b8268
Lists (1)
Sort Name ascending (A-Z)
Stars
VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold
[ICLR 2025 Oral] Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"
[CVPR 2024] Official code for EgoGen: An Egocentric Synthetic Data Generator
[ICCV 2025] The official implementation for EgoM2P: Egocentric Multimodal Multitask Pretraining.
[ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
[CVPR 2025🔥] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model
Nymeria: a massive collection of multimodal egocentric daily motion in the wild
Code repository for the CVPR 2025 paper "From Sparse Signal to Smooth Motion Real-Time Motion Generation with Rolling Prediction Models" and GORP dataset
projectaria_tools is an C++/Python open-source toolkit to interact with Project Aria data
Aria data tools provide the open-source toolkit in C++ and Python to interact with data from Project Aria
Foundation Models and Data for Human-Human and Human-AI interactions.
[ICCV 2025] Zero-Shot Monocular Depth Completion with Guided Diffusion
[ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".
The code releasing for https://image-dream.github.io/
Official repository for CVPR 2024 highlight paper 4D-DRESS: A 4D Dataset of Real-world Human Clothing with Semantic Annotations.
HaMeR: Reconstructing Hands in 3D with Transformers
The official implementation of "GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation". (CVPR 2025)
Official implementation of Continuous 3D Perception Model with Persistent State
Code accompanying Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos (CVPR 2021)
PyTorch code and models for V-JEPA self-supervised learning from video.
The implementation of Extreme Viewpoint 4D Video Generation
(ECCV 2024) Official repository of paper "EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding"
Implementation of "YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception".
Collect video sequences with exact 6-DoF camera poses from Grand Theft Auto V
[CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step
Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation