Skip to main content

Handling missing values in python

Project description

HANDLING MISSING DATA

PROJECT 3, UCS633 - Data Analysis and Visualization
Nikhil Gupta  
COE17
Roll number: 101703371

Output is the dataset which contains no missing values and this dataset is streamed to a new csv file whose name is provided by the user.

Installation

pip install hmvpack_NG

Note the name has an underscore not a hyphen. If installation gives error or package is not found after installing, install as sudo.

Recommended - test it out in a virtual environment.

To use via command line

The package contains two functions i.e there are two ways of handling missing data. First two arguments are same for accessing both functions.

  1. Deleting the row with missing values.

HMVcli infile.csv outfile.csv D

  1. Replacing the missing values by mean of the values of that particular feature

HMVcli infile.csv outfile.csv R

To use in .py script

For Deletion function ->

from hmvlib.models import delete_record
delete_record('infile.csv', 'oufile.csv')

For Replacement function ->

from hmvlib.models import replace_record
replace_record('infile.csv', 'oufile.csv')

Can email me for any issues or suggestions

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hmvpack-NG-0.0.3.tar.gz (2.2 kB view hashes)

Uploaded Source

Built Distribution

hmvpack_NG-0.0.3-py2-none-any.whl (2.9 kB view hashes)

Uploaded Python 2

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page