Skip to main content

Outlier removal using pandas

Project description

OUTLIER REMOVAL

PROJECT 2, UCS633 - Data Analysis and Visualization
Nikhil Gupta  
COE17
Roll number: 101703371

Output is the number of rows removed from the input dataset. The remaining rows of the dataset are streamed to a new csv file whose name is required as an input.

The no of rows removed: 5

Note the outlier removal is performed using IQR method.

Installation

pip install outlierpack_NG

Note the name has an underscore not a hyphen. If installation gives error or package is not found after installing, install as sudo.

Recommended - test it out in a virtual environment.

To use via command line

outcli myData.csv newData.csv

First argument after outcli is the input csv filename from which the dataset is extracted. The second argument is for storing the final dataset after processing.

To use in .py script

from outlib.models import r_outliers
r_outliers('myData.csv', 'newData.csv')

Can email me for any issues or suggestions

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for outlierpack-NG, version 0.0.7
Filename, size File type Python version Upload date Hashes
Filename, size outlierpack_NG-0.0.7-py2-none-any.whl (2.7 kB) File type Wheel Python version py2 Upload date Hashes View
Filename, size outlierpack-NG-0.0.7.tar.gz (2.0 kB) File type Source Python version None Upload date Hashes View

Supported by

Pingdom Pingdom Monitoring Google Google Object Storage and Download Analytics Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page