Handling missing values in python
Project description
HANDLING MISSING DATA
PROJECT 3, UCS633 - Data Analysis and Visualization
Nikhil Gupta
COE17
Roll number: 101703371
Output is the dataset which contains no missing values and this dataset is streamed to a new csv file whose name is provided by the user.
Installation
pip install HMVpack_NG
Note the name has an underscore not a hyphen. If installation gives error or package is not found after installing, install as sudo.
Recommended - test it out in a virtual environment.
To use via command line
The package contains two functions i.e there are two ways of handling missing data. First two arguments are same for accessing both functions.
- Deleting the row with missing values.
HMVcli infile.csv outfile.csv D
- Replacing the missing values by mean of the values of that particular feature
HMVcli infile.csv outfile.csv R
To use in .py script
For Deletion function ->
from hmvlib.models import delete_record
delete_record('infile.csv', 'oufile.csv')
For Replacement function ->
from hmvlib.models import replace_record
replace_record('infile.csv', 'oufile.csv')
Can email me for any issues or suggestions
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for hmvpack_NG-0.0.1-py2-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 11df3549a9b8079d563757db606ddda62f7e86ec64908f82580908fe3a6a8348 |
|
MD5 | 2749385035671ad933d391aae743dde3 |
|
BLAKE2b-256 | f745a7ef3584031a75a0ba62b7c6344461b364278b5c9f0cb8bb126175014399 |