ImmoEliza Real Estate Project

Part 2: Data Analysis

Description

The ImmoEliza Real Estate Project focuses on analyzing and preparing real estate data to build a predictive machine-learning model for property valuation in Belgium. This phase involves:

Data Cleaning: Removing duplicates, handling missing values, and correcting errors.
Data Preprocessing and Analysis: Exploring correlations, identifying key variables, and gaining insights through visualization.
Data Interpretation: Summarizing findings to make strategic decisions for real estate investments. The analysis highlights property trends, influential factors on pricing, and regional comparisons across Belgium, Wallonia, and Flanders. The results can be found in the presentation of findings in /presentation/immo-eliza_analysis.pptx.

The dataset raw_data.csv stems from phase 1 of the ImmoEliza project. The repository for the data scraping can be found under https://github.com/BeatrizJover/Immo-Eliza-project.

Installation

Clone the repository

git clone https://github.com/Alkszo/immo_eliza_analysis.git

Navigate to the repository directory:
```
cd <filepath>/immo_eliza_analysis
```
Install required dependencies:
```
pip install -r requirements.txt
```
Ensure Python 3.8+ is installed. Libraries used include pandas, matplotlib, seaborn, and numpy.
Check Input and Output Directory
- Ensure that raw_data.csv and cleaned-data.csv

Project File Structure

    immo_eliza_analysis/
    ├── graphs/
    ├── notebooks/
    │   ├── Alek-notebook.ipynb
    │   ├── Celina-analysis.ipynb
    │   ├── Celina-cleaning.ipynb
    │   ├── Miriam-analysis.ipynb
    │   └── Miriam-cleaning-preprocessing.ipynb
    ├── presentation/
    │   └── immo-eliza_analysis.pptx
    ├── README.md
    ├── cleaned-data.csv
    ├── data-cleaning.ipynb
    ├── data-analysis.ipynb
    ├── raw_data.csv
    └── requirements.txt

Usage

Run the Notebooks
- data_cleaning.ipynb: Cleans the dataset by removing duplicates and handling missing values and outputs the file cleaned_data.csv.
- data_analysis.ipynb: Explores variables, performs data preprocessing and visualizes data trends, and generates graphs showing data distribution and correlation, based on which it identifies key variables and draws insights.
Visualizations

The resulting visualizations can be found in data_analysis.ipynb as well as in the /graphs folder.

Key charts and graphs include:
- Plots visualizing outliers
- Bar charts for pricing across regions and per range of living area in sqm
- Bar charts for the most and least expensive municipalities in Belgium, Wallonia and Flanders
- Histogram of property sizes
- Feature correlation heatmaps
How to Use
- Launch the notebooks in JupyterLab or your preferred environment.
- Execute cells sequentially and review visualizations and findings.

Sample Visualizations

Contributor Notebooks

The folder /notebooks contains one notebook each, showcasing of each contributor. These notebooks are only meant to demonstrate the data exploration, analysis and experimentation process.

For the final results, please refer to the file data-analysis.ipynb.

Contributors

Aleksander Szostakowski
Celina Bolanos
Miriam Stoehr

Timeline

Challenge Duration: 3 Days

Day 1: Initial dataset review and cleaning
Day 2: Preprocessing and visualization creation
Day 3: Analysis interpretation and documentation

Personal Situation

The challenge is part of the BeCode Data Science and AI Bootcamp. This phase of the project builds essential skills in data cleaning, preprocessing, and visualization, providing foundational insights to inform later machine learning tasks as well as to develop effective storytelling skills though data.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ImmoEliza Real Estate Project

Description

Installation

Project File Structure

Usage

Sample Visualizations

Contributor Notebooks

Contributors

Timeline

Personal Situation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
graphs		graphs
notebooks		notebooks
presentation		presentation
README.md		README.md
cleaned-data.csv		cleaned-data.csv
data-cleaning.ipynb		data-cleaning.ipynb
data_analysis.ipynb		data_analysis.ipynb
feedback.md		feedback.md
raw_data.csv		raw_data.csv
requirements.txt		requirements.txt

Alkszo/immo_eliza_analysis

Folders and files

Latest commit

History

Repository files navigation

ImmoEliza Real Estate Project

Description

Installation

Project File Structure

Usage

Sample Visualizations

Contributor Notebooks

Contributors

Timeline

Personal Situation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages