A package to train machine learning models on housing dataset
Project description
Median housing value prediction
The housing data can be downloaded from https://raw.githubusercontent.com/ageron/handson-ml/master/. The script has codes to download the data. We have modelled the median house value on given housing data.
The following techniques have been used:
- Linear regression
- Decision Tree
- Random Forest
Steps performed
- We prepare and clean the data. We check and impute for missing values.
- Features are generated and the variables are checked for correlation.
- Multiple sampling techinuqies are evaluated. The data set is split into train and test.
- All the above said modelling techniques are tried and evaluated. The final metric used to evaluate is mean squared error.
Setup for development
Create conda environment
conda env create -f env.yml
conda activate <env_name>
Perform test
Tox have been configured with pytest to automate testing in virtualenv.
tox
Test a specific test file:
tox -- -k <file_name>
Usage
Install package
Option 1. From github:
git clone https://github.com/TejaML/mle-training.git
cd mle-training
pip install .
Option 2. From PyPi
pip install housing-prediction
Test installation:
To test whether the package is successfully installed or not, start python session, and try to import housing. If it's imported successfully, then installation is complete
python
>>> import housing
It will install all the dependencies and the housing package
Run scripts
There are two ways to run the scripts, as single command line tool and as python scripts.
-
As command line tool
housing
-
As python scripts
python -m housing.ingest_data
python -m housing.train
python -m housing.score
You can also access pass arguments, to find all available arguments:
housing --help
python -m housing.ingest_data --help
python -m housing.train
python -m housing.score
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for housing-prediction-0.4-0.4.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 29b86e3a6b40b4cd7c6fe7074f36b305e09e8288587c26c87f1ba18d39a26044 |
|
MD5 | b88b08d2dc83d405affeb5a89800e684 |
|
BLAKE2b-256 | 72145b3b74bed560acfac4b63bc6511378cd305bf2b4b78f36bcca47c7235f8e |
Hashes for housing_prediction_0.4-0.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e78e3d652211600d38600d0f4b73b4a92f64fe705b0a6638d79d91dc8ac67564 |
|
MD5 | 7dcc70d6b465aa923b613cbd85f5ed93 |
|
BLAKE2b-256 | b818fc53180c1ea78e3ba8956c63b786d8b65a053f36801500c6d25fce8e7d4c |