conda create -n geo4d python=3.8.5
conda activate geo4d
pip install -r requirements.txt
Install Pytorch3D
pip install "git+https://github.com/facebookresearch/pytorch3d.git"
Download model checkpoint
mkdir checkpoints
mkdir checkpoints/geo4d
gdown 10SPKkOpou2lKl9bwkgx1d6YocYkmSxQl -O ./checkpoints/geo4d/ # fine-tuned vae model
gdown 11K0ubqytun-SA5RIOgR7ejNIR8B4uois -O ./checkpoints/geo4d/ # whole model
We provide a demo video for you to try our model. Run the inference script:
bash ./scripts/infer_geo4d.sh ./data/demo/drift-turn.mp4 0
Please first refer to the evaluation_script.md to download the evaluation datasets.
Then, run the evaluation script:
bash scripts/eval_geo4d.sh [sintel|bonn|kitti|tum|davis] gpu_id
# e.g. bash scripts/eval.sh sintel 0
First, install 4d visualization tool, viser
.
pip install -e viser
You could then use the viser
to visualize the results:
python viser/visualizer.py --data path_to_results_folder --no_mask
We have used codes from other great research work, including DuST3R, MonST3R, DepthCrafter, DynamiCrafter, RayDiffusion, and MoGe. We sincerely thank the authors for their awesome work!
Here are more recent 3D/4D reconstruction projects from our team:
- Dynamic Point Maps: A Versatile Representation for Dynamic 3D Reconstruction
- Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image
- Amodal3R: Amodal 3D Reconstruction from Occluded 2D Images
- VGGT: Visual Geometry Grounded Transformer
If you find Geo4D useful for your research and applications, please cite us using this BibTex:
@misc{Geo4D,
title={Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction},
author={Jiang, Zeren and Zheng, Chuanxia and Laina, Iro and Larlus, Diane and Vedaldi, Andrea},
year={2025},
archivePrefix={arXiv},
primaryClass={cs.CV}
}