The project consists of a data analysis based on data mining tools and performed using Python. The given original dataset consists of 2 .csv files, tweets.csv and users.csv, from which we extracted the data for the project. The work is broken down in four phases:
- Task 1 which includes the two sub-tasks of data understanding and data preparation
- Task 2 which is the clustering of our processed data
- Task 3 comprising a predictive analysis on the data through classification models
- Task 4 in which time series were developed and analysed based off the data by days.
Developed by: Veronica Pistolesi, Francesca Poli
Academic year: 2022/2023
Master degree: Computer Science, Artificial Intelligence curriculum