Skip to main content

Outlier Removal Using Z-score

Project description

Library for removing outliers from pandas dataframe

PROJECT 2, UCS633 - Data Analysis and Visualization
Navkiran Singh  
COE17
Roll number: 101703365

Takes two inputs - filename of input csv, intended filename of output csv. Third optional argument is threshold, by default it's 1.5. Output is the number of rows removed from the input dataset. Resulting csv is saved as output.csv.

Installation

pip install outliers_navkiran

Recommended - test in a virtual environment.

Use via command line

outliers_navkiran_cli in.csv out.csv

When providing custom threshold:

outliers_navkiran_cli in.csv out.csv 1.5

First argument after outcli is the input csv filename from which the dataset is extracted. The second argument is for storing the final dataset after processing.

Use in .py script

from outliers_navkiran import remove_outliers
remove_outliers('input.csv', 'output.csv',1.5)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

outliers_navkiran-1.0.1.tar.gz (2.3 kB view hashes)

Uploaded Source

Built Distribution

outliers_navkiran-1.0.1-py3-none-any.whl (3.6 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page