Last Update: September 2022.
FeatSet+ is a dataset with visual features (color, texture, and shape) extracted from 17 open image datasets from the Literature. FeatSet+ supports different analyses involving machine learning, image analysis, and content-based image retrieval (CBIR). Moreover, the dataset can serve researchers seeking to solve problems related to the study of image features from different applications, such as emergency scenarios, medical cases, and object classification or recognition.
The FeatSet+ database schema is the following:
Each dataset in FeatSet+ follows the same schema, with (a) a metadata table containing the object identifier (OID), the filename (which is the same as the public dataset), and the set of classes (if any). The (b) set of FEM tables has the OID as a foreign key (FK), and every dimension of the feature vector is stored as a column. Additionally, (c) the Feature Equivalence Table provides the Feature Extraction Method (FEM) name, the feature ID, and the corresponding description of every visual feature provided by the FEM.
The complete description of FeatSet+ is given in the work [Cazzolato et al., 2022].
FeatSet+ is available for researchers and data scientists under the GNU General Public License. In case of publication and/or public use of the available data, as well as any resource derived from it, one should acknowledge its creators by citing the following paper and the paper from which the original images were acquired (see the references in the description of the datasets, bellow).
Extended work:
[Cazzolato et al., 2022] CAZZOLATO, M. T.; SCABORA, L. C.; ZABOT, G. F.; GUTIERREZ, M. A.; TRAINA-Jr, C.; TRAINA, A. J. M.. FeatSet+: Visual Features Extracted from Public Image Datasets. In the JIDM Special Edition on Datasets of the Journal of Information and Data Management, Brazil. Vol. 13, n. 1. DOI: 10.5753/jidm.2022.2328. 2022.
Bibtex:
@article{CazzolatoEtAl2022,
author = {Mirela T. Cazzolato and
Lucas C. Scabora and
Guilherme F. Zabot and
Marco A. Gutierrez and
Caetano Traina Jr. and
Agma J. M. Traina},
title = {FeatSet+: Visual Features Extracted from Public Image Datasets},
journal = {Journal of Information and Data Management (JIDM)},
volume = {13},
number = {1},
year = {2022},
url = {https://doi.org/10.5753/jidm.2022.2328},
doi = {10.5753/jidm.2022.2328},
}
Original work:
[Cazzolato et al., 2021] CAZZOLATO, M. T.; SCABORA, L. C.; ZABOT, G. F.; GUTIERREZ, M. A.; TRAINA-Jr, C.; TRAINA, A. J. M.. A Compilation of Visual Features Extracted from Public Image Datasets. In the Brazilian Symposium on Databases - Dataset Showcase Workshop (SBBD-DSW), Virtual Conference, Brazil. 2021. (to appear)
Bibtex:
@inproceedings{CazzolatoEtAl2021,
author = {Mirela T. Cazzolato and
Lucas C. Scabora and
Guilherme F. Zabot and
Marco A. Gutierrez and
Caetano Traina-Jr. and
Agma J. M. Traina},
title = {FeatSet: A Compilation of Visual Features Extracted from Public Image Datasets},
booktitle = {Brazilian Symposium on Databases - Dataset Showcase Workshop (SBBD-DSW), Virtual, Brazil, October 4-8, 2021. (to appear)},
pages = {1--12},
year = {2021}
}
SQL-Scripts-Link: Click here to download the SQL scripts used to load the data.
CSV-File-Link: Click here to download the CSV files with the data.
Following, we provide the description of the image datasets composing FeatSet+:
Index:
Dataset ds-BoWFire
Dataset ds-Flickr-Fire
Dataset ds-Mammoset
Dataset ds-LibraGestures
Dataset ds-Food5k
Dataset ds-Flickr-FireSmoke
Dataset ds-Covid19
Dataset ds-COIL100
Dataset ds-CUB-200-2011
Dataset ds-Letters
Dataset ds-Cars
Dataset ds-Food-11
Dataset ds-Dogs
Dataset ds-DeepLesion
Dataset ds-AwA
Dataset ds-MNIST
Dataset ds-CompCars
University of SĂŁo Paulo
Institute of Mathematics and Computer Science (USP-ICMC)
Database and Image Group (GBdI)
https://bitbucket.org/gbdi/bowfire-dataset
October 9th, 2020
Creative Commons
Images depicting fire incidents from emergency situations. The available features were extracted from the entire images of the dataset (train files). Every image has a label in L={fire, not fire}.
For further information, please refer to the original source.
@inproceedings{dsBowFire2015,
author = {Daniel Y. T. Chino and Letricia P. S. Avalhais and Jos{\'{e}} F. Rodrigues-Jr. and Agma J. M. Traina},
title = {BoWFire: Detection of Fire in Still Images by Integrating Pixel Color and Texture Analysis},
booktitle = {28th {SIBGRAPI} Conference on Graphics, Patterns and Images, {SIBGRAPI} 2015, Salvador, Bahia, Brazil, August 26-29, 2015},
pages = {95--102},
publisher = {{IEEE} Computer Society},
year = {2015},
url = {https://doi.org/10.1109/SIBGRAPI.2015.19},
doi = {10.1109/SIBGRAPI.2015.19},
}
University of SĂŁo Paulo
Institute of Mathematics and Computer Science (USP-ICMC)
Database and Image Group (GBdI)
https://github.com/mtcazzolato/dsw2017
October 9th, 2020
Creative Commons
Images acquired from Flickr, using tags related to fire to filter the information. Every image has a label in L={flame, not flame}.
For further information, please refer to the original source.
@inproceedings{dsFlikrFire2015,
author = {Marcos Vinicius Naves Bedo and Gustavo Blanco and Willian D. Oliveira and Mirela T. Cazzolato and Alceu Ferraz Costa and Jos{\'{e}} Fernando Rodrigues-Jr. and Agma J. M. Traina and Caetano {Traina Jr.}},
editor = {Slimane Hammoudi and Leszek A. Maciaszek and Ernest Teniente},
title = {Techniques for Effective and Efficient Fire Detection from Social Media Images},
booktitle = {{ICEIS} 2015 - Proceedings of the 17th International Conference on Enterprise Information Systems, Volume 1, Barcelona, Spain, 27-30 April, 2015},
pages = {34--45},
publisher = {SciTePress},
year = {2015},
doi = {10.5220/0005341500340045},
}
University of SĂŁo Paulo
Institute of Mathematics and Computer Science (USP-ICMC)
Database and Image Group (GBdI)
https://bitbucket.org/gbdi/mammoset