`OmniSegmentor: A Flexible Multi-Modal Learning Framework for Semantic Segmentation`

Authors: Bowen Yin, Jiaolong Cao, Xuying Zhang, Yuming Chen, Ming-Ming Cheng, Qibin Hou*

This paper is still under review. Full code, complete ImageNeXt dataset, and model checkpoints will be publicly released upon acceptance.

This official repository of ' OmniSegmentor: A Flexible Multi-Modal Learning Framework for Semantic Segmentation'. This paper provides a large-scale multi-modal dataset (ImageNeXt) and a general multi-modal pretraining and finetuning framework. You can pretrain more powerful multi-modal encoders and contribute to the RGBX research.

Figure 1: Visualizations of our assembled ImageNeXt dataset. Built upon ImageNet [43], a widely used large-scale RGB classification dataset, ImageNeXt is composed of five popular visual modalities for each sample, including RGB, Depth, LiDAR, Thermal, and Event.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
figs		figs
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

`OmniSegmentor: A Flexible Multi-Modal Learning Framework for Semantic Segmentation`

🚀 Performance Comparison

About

Uh oh!

Releases

Packages

VCIP-RGBD/OmniSegmentor

Folders and files

Latest commit

History

Repository files navigation

OmniSegmentor: A Flexible Multi-Modal Learning Framework for Semantic Segmentation

🚀 Performance Comparison

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

`OmniSegmentor: A Flexible Multi-Modal Learning Framework for Semantic Segmentation`

Packages