california's housing price prediction ML model
Project description
Median housing value prediction
The housing data can be downloaded from https://raw.githubusercontent.com/ageron/handson-ml/master/. The script has codes to download the data. We have modelled the median house value on given housing data.
The following techniques have been used:
- Linear regression
- Decision Tree
- Random Forest with RandomizedSearchCV
- Random Forest with GridSearchCV
Steps performed
- We prepare and clean the data. We check and impute for missing values.
- Features are generated and the variables are checked for correlation.
- Multiple sampling techinuqies are evaluated. The data set is split into train and test.
- All the above said modelling techniques are tried and evaluated.
- Mean squared error, Root mean squaerd error, Mean absolute error metrics are used to evaluate the model
To excute the script's
There are three scripts need to run for evaluating the model
$ python ingest_data.py -p raw
you can run this script with specifying where you want to place the downloaded data and also with default arguments
$ python train.py -x housing_prepared.csv -y housing_labes.csv
you can run this script with specifying dependent and independent variables and also with no argument passed
$ python score.py -m final_model.pkl -d test_set.csv
you can run this script with specifying which ML model want to use and with what dataset to score metrics
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file california_house_price_predictor_7606-1.0.0.tar.gz
.
File metadata
- Download URL: california_house_price_predictor_7606-1.0.0.tar.gz
- Upload date:
- Size: 7.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ddba66f66349bf949731fab2cd9dc633ab18619db688c8f7935bc9335fbc1ac2 |
|
MD5 | 4331e5322892813d99c4f8208e050750 |
|
BLAKE2b-256 | feffd94a360a1ba5b63819c684a8c445f71fc57aa371353f313ff622c4dcd0b6 |
Provenance
File details
Details for the file california_house_price_predictor_7606-1.0.0-py3-none-any.whl
.
File metadata
- Download URL: california_house_price_predictor_7606-1.0.0-py3-none-any.whl
- Upload date:
- Size: 8.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 363522aa91439597d461489682f01c056fc97bd027c396233aff79018ada924e |
|
MD5 | a0ee9b574eea207239156b6d1792e616 |
|
BLAKE2b-256 | 5a51726d945fffddf994686a1464fc4ba96493a35731791bf87ad53c32f9946c |