-
Zhejiang University
- Hangzhou, China
Highlights
- Pro
Stars
[ICCV 2025] Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction
Official code of PatchmatchNet (CVPR 2021 Oral)
Repo for SeedVR2 & SeedVR (CVPR2025 Highlight)
Estimating Body and Hand Motion in an Ego-sensed World
SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling
[ICCV 2025] A simple training-free approach adapting DUSt3R for dynamic scenes.
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
Code release of "Assembler: Scalable 3D Part Assembly via Anchor Point Diffusion".
[CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
DiffusionRenderer (Cosmos): Neural Inverse and Forward Rendering with Video Diffusion Models
Official repo for paper "Sparse Representation and Construction for High-Resolution 3D Shapes Modeling".
Efficient Part-level 3D Object Generation via Dual Volume Packing
[Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions
An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.)
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.
Official repository for the paper "Orientation Matters: Making 3D Generative Models Orientation-Aligned"
PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers
Official implementation of "Force Prompting: Video Generation Models Can Learn and Generalize Physics-based Control Signals"
[TPAMI 2025, NeurIPS 2024] Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian Surfels
[ICCV 2025] Official code of DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
[NeurIPS 2024] Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer
Motion2VecSets: 4D Latent Vector Set Diffusion for Non-rigid Shape Reconstruction and Tracking