Skip to main content

A python package for connecting with database.

Project description

HousingPricePredictionMLOps

Problem Statement

You are hired by a company, Home Realty Co Ltd. You are provided with a dataset containing details about various houses and their selling prices. The company is looking to optimize its profit by distinguishing between higher and lower priced houses. Your task is to help the company predict the price of a house based on the given attributes, so they can better assess the value of each property and make informed decisions to maximize their profit. Additionally, identify and provide the top 5 attributes that are the most important in determining the house prices.

Data Dictionary

  • Area - The total area of the house in square feet.
  • Bedrooms - The number of bedrooms in the house.
  • Bathrooms - The number of bathrooms in the house.
  • Stories - The number of stories (levels) in the house.
  • Mainroad - Indicates whether the house is located on a main road (Yes or No).
  • Guestroom - Indicates whether the house has a guestroom (Yes or No).
  • Basement - Indicates whether the house has a basement (Yes or No).
  • Hotwaterheating - Indicates whether the house has hot water heating (Yes or No).
  • Airconditioning - Indicates whether the house has air conditioning (Yes or No).
  • Parking - The number of parking spaces available with the house.
  • Prefarea - Indicates whether the house is in a preferred area (Yes or No).
  • Furnishingstatus - The furnishing status of the house, with options: Not Furnished, Semi-Furnished, Furnished.
  • Price - The selling price of the house.

Create project template hierarchy

python template.py

Setup development environment

bash init_setup.sh

Acivate environment

source activate ./env

Install project as local package

pip install -r requirement.txt

Pipelines

Training Pipeline

  • Data Ingestion (fetched data from source)
  • Data Transformation (Feature Engineering, Data Preprocessing)
  • Model Builing (Create a model using the processed data)

MLFlow

python src/HousingPricePrediction/pipelines/training_pipeline.py
mlflow ui

Command to train the pipeline

python src\HousingPricePrediction\pipelines\training_pipeline.py

Prediction Pipeline

  • Two types of prediction pipeline
  • Single record prediction
  • Batch prediction

Explainer Dashboard

  • Feature Importance
  • Regression Stats
  • Individual Predictions
  • What if?
  • Feature Dependence
python dashboard.py

Flask App

python app.py

Streamlit App

streamlit run streamlit_app.py

Data version control (DVC)

dvc init
dvc add notebooks/data/Housing.csv
git add .
git commit -m "Add data"
git push
git log
git checkout <commit ID>
dvc checkout

Deployment of DockerImage

docker build -t housing_price_prediction .
docker run -p 8000:5000 housing_price_prediction
docker login
docker tag housing_price_prediction asangkumarsingh/unique_housing_price_prediction
docker push asangkumarsingh/unique_housing_price_prediction

Docker hub repo:

Dataset Link

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

uniquehousingpricepredictionmlops-0.0.1.tar.gz (9.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file uniquehousingpricepredictionmlops-0.0.1.tar.gz.

File metadata

File hashes

Hashes for uniquehousingpricepredictionmlops-0.0.1.tar.gz
Algorithm Hash digest
SHA256 9d962fbae4028a5e9a7e0c995c63f17b4e09e8adde6b7ddbfb195208872932ed
MD5 e1ebba0209da18e70afb23d346596b32
BLAKE2b-256 b9be92fc740bb1928e5f73853741624e11731da2aa580e736fb70fdb51d6310e

See more details on using hashes here.

File details

Details for the file uniqueHousingPricePredictionMLOps-0.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for uniqueHousingPricePredictionMLOps-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 dade52a624e6c586403a154f9e771029cc6ea289a30742e6ef141100b0ceaca5
MD5 3cbbf27af3bd64543b926cc3519c5816
BLAKE2b-256 43186b316e8c0594b04247c7fc5701d3ca87fd7c2a93b5e03178993617f052ab

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page