8000 GitHub - althaf-07/test
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

althaf-07/test

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

2 Commits
ย 
ย 

Repository files navigation


๐Ÿ“Œ Project Overview

  • Author Name: Althaf Muhammad
  • Author Contact: Email and Github
  • Dataset Source: Kaggle - Housing Price Prediction Data
  • Project Name: House Price Prediction
  • Objective: Build a Machine Learning model to predict house prices.
  • Training Strategy: Batch Training
  • Type of ML: Supervised Learning
  • Type of Problem: Regression
  • Evaluation Metrics: RMSE, MAE, and $R^2$ Score
  • Python Version: 3.12.10
  • Python Dependency Manager: uv (v0.7.2)
  • Hardware: No hardware limitations. This is a very lightweight and simple project that is runnable on any modern machine, including local setups, Google Colab, and Kaggle.
  • OS: This project was initially developed on a Debian Devcontainer machine. I tried to maintain this project OS independent. But there is no guarantee that it is.
  • Project Configuration and Dependencies: Listed in pyproject.toml

Project Structure

house-price-pred-ml/                     # Root project directory
โ”œโ”€โ”€ .devcontainer/                       # Dev Container directory
โ”‚   โ”œโ”€โ”€ devcontainer.env                 # Dev Container environment variables file passed to Docker
โ”‚   โ”œโ”€โ”€ devcontainer.json                # Dev Container configuration file
โ”‚   โ”œโ”€โ”€ Dockerfile                       # Dockerfile used to build the Dev Container
โ”‚   โ””โ”€โ”€ postCreateCommand.sh             # Shell script run after Dev Container build completes
โ”œโ”€โ”€ .git/                                # Git version control metadata directory
โ”œโ”€โ”€ .venv/                               # Local Python virtual environment directory
โ”œโ”€โ”€ data/                                # Directory used to store all dataset-related files
โ”‚   โ”œโ”€โ”€ interim/                         # Directory for intermediate datasets processed from raw data
โ”‚   โ”‚   โ””โ”€โ”€ interim.csv                  # CSV file containing intermediate processed data
โ”‚   โ”œโ”€โ”€ processed/                       # Directory for storing train-test split datasets
โ”‚   โ”‚   โ”œโ”€โ”€ test.csv                     # CSV file containing the test split of the dataset
โ”‚   โ”‚   โ””โ”€โ”€ train.csv                    # CSV file containing the train split of the dataset
โ”‚   โ””โ”€โ”€ raw/                             # Directory for unprocessed raw data files
โ”‚       โ””โ”€โ”€ raw.csv                      # CSV file containing raw dataset
โ”œโ”€โ”€ logs/                                # Directory for storing log files generated during execution
โ”œโ”€โ”€ notebooks/                           # Directory used for storing Jupyter notebooks for experimentation
โ”‚   โ””โ”€โ”€ 1-althaf07-experimentation.ipynb # Notebook for initial experimentation and data analysis
โ”œโ”€โ”€ reports/                             # Directory for project reports and documentation
โ”‚   โ”œโ”€โ”€ figures/                         # Directory for saving generated figures and plots
โ”‚   โ”‚   โ””โ”€โ”€ univariate/                  # Directory for univariate analysis figures
โ”‚   โ”‚       โ”œโ”€โ”€ bathrooms.png
โ”‚   โ”‚       โ”œโ”€โ”€ bedrooms.png
โ”‚   โ”‚       โ”œโ”€โ”€ neighborhood.png
โ”‚   โ”‚       โ”œโ”€โ”€ numc_describe.md
โ”‚   โ”‚       โ”œโ”€โ”€ price.png
โ”‚   โ”‚       โ”œโ”€โ”€ square_feet.png
โ”‚   โ”‚       โ””โ”€โ”€ year_built.png
โ”‚   โ”œโ”€โ”€ environment.md                   # Markdown file documenting the environment setup
โ”‚   โ””โ”€โ”€ experiment_document.md           # Markdown file detailing experiment results and analysis
โ”œโ”€โ”€ src/                                 # Source code root directory
โ”‚   โ””โ”€โ”€ house_price_pred_ml/             # Main package containing all project modules
โ”‚       โ”œโ”€โ”€ plot/                        # Plotting utilities module
โ”‚       โ”‚   โ””โ”€โ”€ univariate/              # Module for univariate plot functions
โ”‚       โ”‚       โ”œโ”€โ”€ cat_numd.py          # Plots for categorical vs numerical data distributions
โ”‚       โ”‚       โ””โ”€โ”€ numc.py              # Plots for numerical data distributions
โ”‚       โ”œโ”€โ”€ __init__.py                  # Initializes the house_price_pred_ml Python package
โ”‚       โ”œโ”€โ”€ api.py                       # Defines API endpoints for model serving
โ”‚       โ”œโ”€โ”€ auto_gen_table.py            # Automatically generates data summary tables
โ”‚       โ”œโ”€โ”€ config.py                    # Handles configuration loading logic
โ”‚       โ”œโ”€โ”€ config.yaml                  # YAML configuration file for model parameters and settings
โ”‚       โ”œโ”€โ”€ evaluate.py                  # Evaluates model performance metrics
โ”‚       โ”œโ”€โ”€ predict.py                   # Generates predictions using the trained model
โ”‚       โ”œโ”€โ”€ process_data.py              # Processes raw data into cleaned, usable format
โ”‚       โ”œโ”€โ”€ split_data.py                # Splits dataset into training and testing sets
โ”‚       โ”œโ”€โ”€ train.py                     # Trains the machine learning model
โ”‚       โ”œโ”€โ”€ tree.py                      # Implements tree-based model utilities
โ”‚       โ””โ”€โ”€ utils.py                     # Utility functions for data processing and evaluation
โ”œโ”€โ”€ tmp/                                 # Temporary directory for intermediate or disposable files
โ”œโ”€โ”€ .gitignore                           # Specifies files and directories ignored by Git
โ”œโ”€โ”€ .pre-commit-config.yaml              # Configuration for pre-commit hooks to enforce code standards
โ”œโ”€โ”€ .python-version                      # Specifies the Python version used in the project
โ”œโ”€โ”€ Dockerfile                           # Dockerfile for building the project container for deployment
โ”œโ”€โ”€ pyproject.toml                       # Configuration file for Python dependencies and build system
โ”œโ”€โ”€ README.md                            # Main project documentation and usage instructions
โ””โ”€โ”€ uv.lock                              # Lock file for managing UV-based dependencies

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
0