Removing outliers from a pandas dataframe
Project description
OUTLIER REMOVAL
PROJECT 2, UCS633 - Data Analysis and Visualization
Nikhil Gupta
COE17
Roll number: 101703371
Output is the number of rows removed from the input dataset. The remaining rows of the dataset are streamed to a new csv file whose name is required as an input.
The no of rows removed: 5
Note the outlier removal is performed using IQR method.
Installation
pip install outlierpack_NG
Note the name has an underscore not a hyphen. If installation gives error or package is not found after installing, install as sudo.
Recommended - test it out in a virtual environment.
To use via command line
Outcli myData.csv newData.csv
First argument after outcli is the input csv filename from which the dataset is extracted. The second argument is for storing the final dataset after processing.
To use in .py script
from outlib.models import r_outliers
r_outliers('myData.csv', 'newData.csv')
Can email me for any issues or suggestions
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file outlierpack-NG-0.1.0.tar.gz
.
File metadata
- Download URL: outlierpack-NG-0.1.0.tar.gz
- Upload date:
- Size: 2.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/44.0.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/2.7.17
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fe90d04745a2f2b527d31ee7e38e26866658c3f700dcb2345278a0c7f0e9fc5e |
|
MD5 | 889ff8f2f0e2cec6e26c9b6a76b6e1a8 |
|
BLAKE2b-256 | 72c63c7291bf85a5a9117c604ca4c2cd53546a95a23ddb3ca65f35931b8c17db |
File details
Details for the file outlierpack_NG-0.1.0-py2-none-any.whl
.
File metadata
- Download URL: outlierpack_NG-0.1.0-py2-none-any.whl
- Upload date:
- Size: 2.9 kB
- Tags: Python 2
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/44.0.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/2.7.17
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 08b32b203bf2891cd36a1aae09e891b45995b9a877a4a70d1b0d8ad9bb44f434 |
|
MD5 | 150c802be7b50df62646de81d567cbd8 |
|
BLAKE2b-256 | 12b593a2791b30693a9a74dbfbbdae10b6b38429490bd1ffeec5c85b50f72047 |