PanDA

Zidong Cao¹ · Jinjing Zhu¹ · Weiming Zhang¹ · Hao Ai² · Haotian Bai¹
Hengshuang Zhao³ · Lin Wang^4†

¹AI Thrust, HKUST (GZ) ²University of Birmingham ³HKU ⁴NTU
†corresponding author

We propose a semi-supervised learning framework to learn a panoramic Depth Anything, dubbed PanDA. PanDA first learns a teacher model by fine-tuning Depth Anything through joint training on synthetic indoor and outdoor panoramic datasets. Then, a student model is trained using large-scale unlabeled data, leveraging pseudo-labels generated by the teacher model. PanDA exhibits impressive zero-shot capability across diverse scenes.

News

2025-03-18: Code release.
2025-02-27: PanDA is accepted by CVPR 2025.
2025-02-06: Our survey about 360 vision is accepted by IJCV. Hope the survey helpful for you. [Link]

Self-captured Datasets

In our manuscript, some panoramas (RGB, without depth labels) are captured by ourselves. The dataset link is here. It contains about 10,000 panoramas of 4K resolution. Note that we do not claim the dataset is a technical contribution.

Pre-trained Models

We provide three models for robust relative panoramic depth estimation (predict depth values, range 0~1):

Model	Params	Checkpoint
PanDA-Small	24.8M	Download
PanDA-Base	97.5M	Download
PanDA-Large	335.3M	Download

Usage

Prepraration

git clone https://github.com/caozidong/PanDA
cd PanDA
pip install -r requirements.txt

Note: We use python==3.10, and pytorch==2.0.0, cuda==11.7, and cudnn==8.5.0.

Download the checkpoints listed here and put them under the checkpoints directory.

Inference for images

python run.py \
  --config ./config/inference/panda_<large, base, small> \
  --img-path <path> --outdir <outdir> \
  [--height <height>] [--resize] [--pred-only] \
  [--grayscale] [--save-cloud]

Options:

--config: Model config files.
--img-path: It supports an image directory, a single image path, and a text file storing image paths.
--height: The height of ERP image. The width is [2 x height]. By default, the height is 504. Increasing the height can obtain better predictions (1008x2016 requires more than 40GB GPU memory).
--resize (optional): If resizing the output depth to have the same spatial resolution as the input ERP image.
--pred-only (optional): Only save the predicted depth map, without raw image.
--grayscale (optional): Save the grayscale depth map, without applying color palette.
--save-cloud (optional): Save the colored point cloud result.

For example:

python run_image.py --config ./config/inference/panda_large.yaml \
       --img-path ./erp_samples/ --pred-only

Inference for videos

python run_video.py \
  --config ./config/inference/panda_<large, base, small> \
  --video-path assets/examples_video --outdir video_depth_vis \
  [--height <height>]

About teacher model training and evaluation

Please refer to train_teacher.

About student model training and evaluation

Please refer to train_student.

About Fine-tuning to Metric Depth

Please refer to train_metric depth.

Acknowledgement

We sincerely thank the Depth Anything v1, Depth Anything v2 for contributing such impressive models and codes to our community. Also, we sincerely thank the UniFuse for providing training and evaluation codes.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
config		config
datasets		datasets
depth_anything_v2		depth_anything_v2
depth_anything_v2_metric		depth_anything_v2_metric
erp_samples		erp_samples
networks		networks
train_metric_depth		train_metric_depth
train_student		train_student
train_teacher		train_teacher
LICENSE		LICENSE
README.md		README.md
depth_anything_utils.py		depth_anything_utils.py
requirements.txt		requirements.txt
run_image.py		run_image.py
run_video.py		run_video.py
saver.py		saver.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PanDA

News

Self-captured Datasets

Pre-trained Models

Usage

Prepraration

Inference for images

Inference for videos

About teacher model training and evaluation

About student model training and evaluation

About Fine-tuning to Metric Depth

Acknowledgement

About

Uh oh!

Releases

Packages

Languages

License

caozidong/PanDA

Folders and files

Latest commit

History

Repository files navigation

PanDA

News

Self-captured Datasets

Pre-trained Models

Usage

Prepraration

Inference for images

Inference for videos

About teacher model training and evaluation

About student model training and evaluation

About Fine-tuning to Metric Depth

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages