Skip to main content

A python package for connecting with database.

Project description

Housing_Price_Prediction

Problem Statement

You are hired by a company, Home Realty Co Ltd. You are provided with a dataset containing details about various houses and their selling prices. The company is looking to optimize its profit by distinguishing between higher and lower priced houses. Your task is to help the company predict the price of a house based on the given attributes, so they can better assess the value of each property and make informed decisions to maximize their profit. Additionally, identify and provide the top 5 attributes that are the most important in determining the house prices.

Data Dictionary

  • Area - The total area of the house in square feet.
  • Bedrooms - The number of bedrooms in the house.
  • Bathrooms - The number of bathrooms in the house.
  • Stories - The number of stories (levels) in the house.
  • Mainroad - Indicates whether the house is located on a main road (Yes or No).
  • Guestroom - Indicates whether the house has a guestroom (Yes or No).
  • Basement - Indicates whether the house has a basement (Yes or No).
  • Hotwaterheating - Indicates whether the house has hot water heating (Yes or No).
  • Airconditioning - Indicates whether the house has air conditioning (Yes or No).
  • Parking - The number of parking spaces available with the house.
  • Prefarea - Indicates whether the house is in a preferred area (Yes or No).
  • Furnishingstatus - The furnishing status of the house, with options: Not Furnished, Semi-Furnished, Furnished.
  • Price - The selling price of the house.

Create project template hierarchy

python template.py

Setup development environment

bash init_setup.sh

Acivate environment

source activate ./env

Install project as local package

pip install -r requirement.txt

Pipelines

Training Pipeline

  • Data Ingestion (fetched data from source)
  • Data Transformation (Feature Engineering, Data Preprocessing)
  • Model Builing (Create a model using the processed data)

MLFlow

python src/HousingPricePrediction/pipelines/training_pipeline.py
mlflow ui

Command to train the pipeline

python src\HousingPricePrediction\pipelines\training_pipeline.py

Prediction Pipeline

  • Two types of prediction pipeline
  • Single record prediction
  • Batch prediction

Explainer Dashboard

  • Feature Importance
  • Regression Stats
  • Individual Predictions
  • What if?
  • Feature Dependence
python dashboard.py

Flask App

python app.py

Streamlit App

streamlit run streamlit_app.py

Data version control (DVC)

dvc init
dvc add notebooks/data/Housing.csv
git add .
git commit -m "Add data"
git push
git log
git checkout <commit ID>
dvc checkout

Deployment of DockerImage

docker build -t housing-price-prediction .
docker run -p 5000:5000 housing-price-prediction
docker push asangkumarsingh/projecthousingpriceprediction

Docker hub repo:

Dataset Link

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

uniquehousepriceprediction-4.0.1.tar.gz (10.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

uniqueHousePricePrediction-4.0.1-py3-none-any.whl (12.7 kB view details)

Uploaded Python 3

File details

Details for the file uniquehousepriceprediction-4.0.1.tar.gz.

File metadata

File hashes

Hashes for uniquehousepriceprediction-4.0.1.tar.gz
Algorithm Hash digest
SHA256 5dfe8b8c1eb5f11240d4f3981d6100063789f0dae5eec9e870fa74968ad87308
MD5 d872157d944f9801893591a6a86da570
BLAKE2b-256 3872cea77deb1c9b622e33eee31160f43a6bb167315d4d726492d64d5a32938c

See more details on using hashes here.

File details

Details for the file uniqueHousePricePrediction-4.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for uniqueHousePricePrediction-4.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 6f36208fc84f1844598dc00df2b2b50b5a12fe439b78a8b1b88094dee104b25f
MD5 894e52b29d74bc08628c38fbcc959e6c
BLAKE2b-256 4a52787d1775a6ec4fc61269b2f2dea9cca7af9c7c8ff34ac77fd65000f00631

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page