Skip to main content

A package to train machine learning models on housing dataset

Project description

Median housing value prediction

The housing data can be downloaded from https://raw.githubusercontent.com/ageron/handson-ml/master/. The script has codes to download the data. We have modelled the median house value on given housing data.

The following techniques have been used:

  • Linear regression
  • Decision Tree
  • Random Forest

Steps performed

  • We prepare and clean the data. We check and impute for missing values.
  • Features are generated and the variables are checked for correlation.
  • Multiple sampling techinuqies are evaluated. The data set is split into train and test.
  • All the above said modelling techniques are tried and evaluated. The final metric used to evaluate is mean squared error.

Setup for development

Create conda environment

conda env create -f env.yml
conda activate <env_name>

Perform test

Tox have been configured with pytest to automate testing in virtualenv.

tox

Test a specific test file:

tox -- -k <file_name>

Usage

Install package

Option 1. From github:

git clone https://github.com/TejaML/mle-training.git
cd mle-training
pip install .

Option 2. From PyPi

pip install housing-prediction

Test installation:

To test whether the package is successfully installed or not, start python session, and try to import housing. If it's imported successfully, then installation is complete

python
>>> import housing

It will install all the dependencies and the housing package

Run scripts

There are two ways to run the scripts, as single command line tool and as python scripts.

  • As command line tool

    housing
    
  • As python scripts

python -m housing.ingest_data
python -m housing.train
python -m housing.score

You can also access pass arguments, to find all available arguments:

housing --help
python -m housing.ingest_data --help
python -m housing.train
python -m housing.score

Project details


Release history Release notifications | RSS feed

This version

0.4

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

housing-prediction-0.4-0.4.tar.gz (23.8 kB view hashes)

Uploaded Source

Built Distribution

housing_prediction_0.4-0.4-py3-none-any.whl (26.9 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page