ITLP-Campus

This dataset can be used for:

🧪 Developing and testing localization algorithms using real-world data collected across various seasons, times of day, and weather conditions.
📚 Educational and research projects on multimodal localization, machine learning, and computer vision.
📈 Benchmarking and comparative analysis of global localization algorithms.
🎯 Creating machine learning models robust to environmental changes and external conditions.

Installation

To install the package, clone the repository and install the it using pip:

git clone https://github.com/OPR-Project/ITLP-Campus.git

cd ITLP-Campus
pip install .

Requirements

Hardware

CPU: 6 or more physical cores
RAM: at least 8 GB
GPU: NVIDIA RTX 2060 or higher (to ensure adequate performance)
Video memory: at least 4 GB
Storage: SSD recommended for faster loading of data and models

Software

Operating System:
- Any OS with support for Docker and CUDA >= 11.1. Ubuntu 20.04 or later is recommended.
Dependencies:
- Python >= 3.10
- CUDA Toolkit >= 11.1
- cuDNN >= 7.5

Other dependencies are listed in the pyproject.toml file and will be installed automatically when you install the package.

Download

You can download the dataset manually from the following links:

Kaggle:
- ITLP Campus Outdoor
- ITLP Campus Indoor
Hugging Face:
- ITLP Campus Outdoor
- ITLP Campus Indoor

Or you can use the download_dataset.py script to download the dataset automatically. The script will download the dataset from HF and save it into the specified directory.

python download_dataset.py \
    --output_dir /path/to/dataset \
    --outdoor \  # download outdoor dataset (optional flag)
    --indoor  # download indoor dataset (optional flag)

The dataset is distributed under the CC BY 4.0 license.

Sensors

Sensor	Model	Resolution
Front cam	ZED (stereo)	1280x720
Back cam	RealSense D435	1280x720
LiDAR	VLP-16	16x1824

Structure

The data are organized by tracks, the length of one track is about 3 km, each track includes about 600 frames. The distance between adjacent frames is ~5 m.

8000 The structure of track data storage is as follows:

Track ##
├── back_cam
│   ├── ####.png
│   └── ####.png
├── front_cam
│   ├── ####.png
│   └── ####.png
├── masks
│   ├── back_cam
│   │   ├── ####.png
│   │   └── ####.png
│   └── front_cam
│       ├── ####.png
│       └── ####.png
├── text_descriptions
│   ├── back_cam_text.csv
│   └── front_cam_text.csv
├── text_labels
│   ├── back_cam_text_labels.csv
│   └── front_cam_text_labels.csv
├── aruco_labels
│   ├── back_cam_aruco_labels.csv
│   └── front_cam_aruco_labels.csv
├── lidar
│   ├── ####.bin
│   └── ####.bin
├── demo.mp4
├── track.csv
├── meta_info.yml
└── track_map.png

where

#### - file name, which is the timestamp of the image/scan (virtual timestamp of the moment when the image/scan was taken)
.bin - files - LiDAR scans in binary format
.png - images and semantic masks
.csv :
- <cam>_text.csv - text description of the scene for both front and back camera images (image timestamp, text description of the scene)
- <cam>_aruco_labels.csv - information about aruco tags (image timestamp, tag bboxes and its ID)
- <cam>_text_labels.csv - information only about images with text markings (image timestamp, marking bbox , text on the marking)
- track.csv - timestamp mapping for all data and 6DoF robot poses
.yml - meta information about track

An example of a outdoor track trajectory (track_map.png):

An example of a indoor track trajectory (track_map.png):

ITLP-Campus Indoor

Data

Track	Frames, pcs	Front cam, res	Back cam, res	LiDAR, rays	6 DoF pose	Semantic masks	Aruco tag	OCR Text labels
2023-03-13	3883	1280x720	1280x720	16	☑	1280x720x150	☑	☑
00_2023-10-25-night	1233	1280x720	1280x720	16	☑	1280x720x150	☑	☑
01_2023-11-09-twilight	1310	1280x720	1280x720	16	☑	1280x720x150	☑	☑

6 DoF poses are obtained using Cartographer SLAM with global localization in a pre-built map.

Semantics

Semantic masks are obtained using the Oneformer pre-trained on the dataset.

The masks are stored as mono-channel images.Each pixel stores a semantic label. Examples of semantic information are shown in the table below:

Label	Semantic class	Color, [r, g, b]
...	...	...
14	door; double door	[8, 255, 51]
23	sofa; couch; lounge	[11, 102, 255]
67	book	[255, 163, 0]
124	microwave; microwave; oven	[255, 0, 235]
...	...	...

The semantic markup contains a total of $150 classes. A complete table of all semantic classes is given in the table - cfg/indoor_anno_description.md. To map the id labels with rgb colors you should use the configuration file - cfg/indoor_anno_config.json.

An example of a mask over the image:

ITLP-Campus Outdoor

The outdor part of this dataset was recorded on the Husky robotics platform on the university campus and consists of 5 tracks recorded at different times of day (day/dusk/night) and different seasons (winter/spring).

Data

Track	Season	Time of day	Frames, pcs	Front cam, res	Back cam, res	LiDAR, rays	6 DoF pose	Semantic masks	Aruco tag	OCR Text labels
00_2023-02-21	winter	day	620	1280x720	1280x720	16	☑	front + back 1280x720x65 classes	☑	☑
01_2023-03-15	winter	night	626	1280x720	1280x720	16	☑	front + back 1280x720x65 classes	☑	☑
02_2023-02-10	winter	twilight	609	1280x720	1280x720	16	☑	front + back 1280x720x65 classes	☑	☑
03_2023-04-11	spring	day	638	1280x720	1280x720	16	☑	front + back 1280x720x65 classes	☑	☑
04_2023-04-13	spring	night	631	1280x720	1280x720	16	☑	front + back 1280x720x65 classes	☑	☑
05_2023-08-15	summer	day	833	1280x720	1280x720	16	☑	front + back 1280x720x65 classes	☑	☑
06_2023-08-18	summer	night	831	1280x720	1280x720	16	☑	front + back 1280x720x65 classes	☑	☑
07_2023-10-04	autumn	day	896	1280x720	1280x720	16	☑	front + back 1280x720x65 classes	☑	☑
08_2023-10-11	autumn	night	895	1280x720	1280x720	16	☑	front + back 1280x720x65 classes	☑	☑

6 DoF poses obtained using ALeGO-LOAM localization method refined with Interactive SLAM.

Semantics

Semantic masks are obtained using the Oneformer pre-trained on the Mapillary dataset.

The masks are stored as mono-channel images.Each pixel stores a semantic label. Examples of semantic information are shown in the table below:

Label	Semantic class	Color, [r, g, b]
...	...	...
10	Parking	[250, 170, 160]
11	Pedestrin Area	[96, 96, 96]
12	Rail Track	[230, 150, 140]
13	Road	[128, 64, 128]
...	...	...

The semantic markup contains a total of $65$ classes. A complete table of all semantic classes is given in the table - cfg/outdoor_anno_description.md. To map the id labels with rgb colors you should use the configuration file - cfg/outdoor_anno_config.json.

An example of a mask over the image:

PyTorch dataset API

Check out out OpenPlaceRecognition library. It is a library for place recognition and localization, which includes a collection of datasets, including ITLP-Campus. The library provides a unified API for loading datasets, training and evaluating models, and performing place recognition tasks. The library is designed to be easy to use and extensible, allowing researchers and developers to quickly experiment with different models and datasets.

Implementation of PyTorch's dataset class for ITLP-Campus track is provided in the src/itlp_campus/dataset.py file.

That class can be used for loading the track's data in the format of torch.Tensor.

Outdoor data

Usage example:

track_dir = Path("/path/to/ITLP_Campus_outdoor/00_2023-02-21")

dataset = ITLPCampus(
    dataset_root=track_dir,                      # track directory
    sensors=["front_cam", "back_cam", "lidar"],  # list of sensors for which you want to load data
    load_semantics=True,                         # whether to return semantic masks for cameras
    load_text_descriptions=False,                # whether to return text descriptions for cameras
    load_text_labels=False,                      # whether to return detected text labels for cameras
    load_aruco_labels=False,                     # whether to return detected aruco labels for cameras
    indoor=False,                                # indoor or outdoor track
)

data = dataset[0]  # will return dictionary with the first frame of the track

Indoor data

Usage example:

track_dir = Path("/path/to/ITLP_Campus_indoor/2023-03-13")

dataset = ITLPCampus(
    dataset_root=track_dir,                      # track directory
    sensors=["front_cam", "back_cam", "lidar"],  # list of sensors for which you want to load data
    load_semantics=True,                         # whether to return semantic masks for cameras
    load_text_descriptions=False,                # whether to return text descriptions for cameras
    load_text_labels=False,                      # whether to return detected text labels for cameras
    load_aruco_labels=False,                     # whether to return detected aruco labels for cameras
    indoor=True,                                 # indoor or outdoor track
)

data = dataset[0]  # will return dictionary with the first frame of the track

License

The code is under Apache 2.0 license.

The data is under CC BY 4.0 license.

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
cfg		cfg
img		img
scripts		scripts
src/itlp_campus		src/itlp_campus
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
download_dataset.py		download_dataset.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ITLP-Campus

Installation

Requirements

Hardware

Software

Download

Sensors

Structure

ITLP-Campus Indoor

Data

Semantics

ITLP-Campus Outdoor

Data

Semantics

PyTorch dataset API

Outdoor data

Indoor data

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

License

OPR-Project/ITLP-Campus

Folders and files

Latest commit

History

Repository files navigation

ITLP-Campus

Installation

Requirements

Hardware

Software

Download

Sensors

Structure

ITLP-Campus Indoor

Data

Semantics

ITLP-Campus Outdoor

Data

Semantics

PyTorch dataset API

Outdoor data

Indoor data

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages