MS3D

This is the official code release for

MS3D: Leveraging Multiple Detectors for Unsupervised Domain Adaptation in 3D Object Detection [Paper] [Video]
MS3D++: Ensemble of Experts for Multi-Source Unsupervised Domain Adaption in 3D Object Detection [Paper] (code and models will be updated in this repo soon)

MS3D is an auto-labeling framework for vehicles and pedestrians that generates high quality labels for training of 3D detectors on a variety of lidars, regardless of their density. Simply using our generated labels for training VoxelRCNN on the Waymo dataset achieves a vehicle detection of 70.3 BEV AP on the official validation dataset, only 3.5 BEV AP less than training with human-annotated labels. Read our papers to find out more.

MS3D has the following benefits:

Robust labeling of a wide range of lidars such as high and low beam lidars.
Can tailor the ensemble of pre-trained detectors to obtain high auto-labeling quality on any given lidar dataset (e.g. different architectures, source domains, voxel sizes, or class-specific detectors).
Compatible with any 3D detector. MS3D just requires 3D bounding boxes as input so it is compatible with any 3D detector. Generated labels can be used to replace human-annotated labels in supervised training of any 3D detector.
Preserves real-time inference capability of detectors as we don't modify detector architecture.

Our box fusion method, KBF, can be used for detector ensembling in a supervised setting as well and can outperform Weighted Box Fusion (WBF). See our first MS3D paper for comparison results and a simple demo here.

Citation

Installation

Please refer to INSTALL.md for the installation of MS3D.

Usage - Auto-labeling

Please refer to DATASET_PREPARATION.md to prepare the datasets.
Please refer to GETTING_STARTED.md to learn more about how to use MS3D. We are also planning on releasing a guide for custom datasets, stay tuned!
Please refer to PARAMETERS.md on a guide of how to tune MS3D parameters.
Please refer to VISUALIZATION.md to learn how to use our visualization tools.

Model Zoo

For all tables below, "GT-FT" refers to fine-tuning the pre-trained detector using ground-truth labels from the target domain. Results are reported at IoU=0.7 evaluated at 40 recall levels (R40). Refer to our paper for detailed results.

Target Domain: nuScenes

Models for target-nuscenes can be downloaded here. We also provide MS3D results for fine-tuning with multi-frame detection as is common on nuScenes models to demonstrate that we can further boost performance. All models below use SECOND-IoU.

Method	Source	Vehicle (BEV)	Vehicle (3D)
MS3D	Waymo	42.23	24.76
MS3D	Lyft	41.64	23.46
MS3D (10 frame)	Waymo	47.35	27.18
GT-FT	Waymo	44.39	29.46
GT-FT (10 frame)	Waymo	50.05	33.32

Target Domain: Lyft

Models for target-lyft can be downloaded here. Similarly to nuScenes we show multi-frame detection results for MS3D. All models below use SECOND-IoU.

Method	Source	Vehicle (BEV)	Vehicle (3D)
SN	nuScenes	63.11	39.60
SN	Waymo	71.61	56.13
ST3D	nuScenes	67.33	41.82
ST3D	Waymo	73.86	56.33
MS3D	nuScenes	75.02	59.01
MS3D	Waymo	77.05	60.17
MS3D (3 frame)	Waymo	76.89	63.12
GT-FT	Waymo	81.10	66.76
GT-FT (3 frame)	Waymo	83.58	69.44

Target Domain: Waymo

Due to the Waymo Dataset License Agreement we do not provide links to models that are trained on waymo data. You can train your own model using our provided configs.

If you want to download the models, please send me an email with your name, institute, a screenshot of the Waymo dataset registration confirmation mail and your intended usage. Please note that Waymo open dataset is under strict non-commercial license, so we are not allowed to share the model with you if it will use for any profit-oriented activities.

All models below use SECOND-IoU.

Method	Source	Vehicle (BEV)	Vehicle (3D)
SN	Lyft	53.39	39.22
SN	nuScenes	50.69	28.86
ST3D	Lyft	56.06	39.17
ST3D	nuScenes	55.67	28.83
MS3D	Lyft	61.25	42.88
MS3D	nuScenes	61.39	42.76
GT-FT	Lyft	66.76	52.50

Source Models

We provide models trained on source-domain data used in our experiments.

nuScenes pre-trained models can be downloaded here

Lyft pre-trained models can be downloaded here

For Waymo, please send me an email if you would like to download the source-trained models we used.

Qualitative Results

License

MS3D is released under the Apache 2.0 license.

Citation

If you find this project useful in your research, please give us a star and consider citing:

@article{tsai2023ms3d,
  title={MS3D: Leveraging Multiple Detectors for Unsupervised Domain Adaptation in 3D Object Detection},
  author={Tsai, Darren and Berrio, Julie Stephany and Shan, Mao and Nebot, Eduardo and Worrall, Stewart},
  journal={arXiv preprint arXiv:2304.02431},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data		data
docker		docker
docs		docs
pcdet		pcdet
tools		tools
tracker		tracker
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MS3D

Overview

Installation

Usage - Auto-labeling

Model Zoo

Target Domain: nuScenes

Target Domain: Lyft

Target Domain: Waymo

Source Models

Qualitative Results

License

Citation

About

Uh oh!

Releases

Packages

Languages

License

luoxiaoliaolan/MS3D

Folders and files

Latest commit

History

Repository files navigation

MS3D

Overview

Installation

Usage - Auto-labeling

Model Zoo

Target Domain: nuScenes

Target Domain: Lyft

Target Domain: Waymo

Source Models

Qualitative Results

License

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages