Skip to main content

Removing outliers from a pandas dataframe

Project description

OUTLIER REMOVAL

PROJECT 2, UCS633 - Data Analysis and Visualization
Nikhil Gupta  
COE17
Roll number: 101703371

Output is the number of rows removed from the input dataset. The remaining rows of the dataset are streamed to a new csv file whose name is required as an input.

The no of rows removed: 5

Note the outlier removal is performed using IQR method.

Installation

pip install outlierpack_NG

Note the name has an underscore not a hyphen. If installation gives error or package is not found after installing, install as sudo.

Recommended - test it out in a virtual environment.

To use via command line

Outcli myData.csv newData.csv

First argument after outcli is the input csv filename from which the dataset is extracted. The second argument is for storing the final dataset after processing.

To use in .py script

from outlib.models import r_outliers
r_outliers('myData.csv', 'newData.csv')

Can email me for any issues or suggestions

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

outlierpack-NG-0.1.0.tar.gz (2.1 kB view details)

Uploaded Source

Built Distribution

outlierpack_NG-0.1.0-py2-none-any.whl (2.9 kB view details)

Uploaded Python 2

File details

Details for the file outlierpack-NG-0.1.0.tar.gz.

File metadata

  • Download URL: outlierpack-NG-0.1.0.tar.gz
  • Upload date:
  • Size: 2.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/44.0.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/2.7.17

File hashes

Hashes for outlierpack-NG-0.1.0.tar.gz
Algorithm Hash digest
SHA256 fe90d04745a2f2b527d31ee7e38e26866658c3f700dcb2345278a0c7f0e9fc5e
MD5 889ff8f2f0e2cec6e26c9b6a76b6e1a8
BLAKE2b-256 72c63c7291bf85a5a9117c604ca4c2cd53546a95a23ddb3ca65f35931b8c17db

See more details on using hashes here.

File details

Details for the file outlierpack_NG-0.1.0-py2-none-any.whl.

File metadata

  • Download URL: outlierpack_NG-0.1.0-py2-none-any.whl
  • Upload date:
  • Size: 2.9 kB
  • Tags: Python 2
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/44.0.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/2.7.17

File hashes

Hashes for outlierpack_NG-0.1.0-py2-none-any.whl
Algorithm Hash digest
SHA256 08b32b203bf2891cd36a1aae09e891b45995b9a877a4a70d1b0d8ad9bb44f434
MD5 150c802be7b50df62646de81d567cbd8
BLAKE2b-256 12b593a2791b30693a9a74dbfbbdae10b6b38429490bd1ffeec5c85b50f72047

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page