A python package for removing outliers from a dataset using InterQuartile Range (IQR)
Project description
Outlier Removal Using InterQuartile Range
Project 2 : UCS633 Submitted By: Kshitiz Varshney 101703295 *** pypi: https://pypi.org/project/outlierRemoval-kvarshney-101703295/ ***
InterQuartile Range (IQR) Description Any set of data can be described by its five-number summary.
These five numbers, which give you the information you need to find patterns and outliers, consist of: The minimum value of the dataset.
The first quartile Q1, which represents a quarter(25%) of the way through the list of all data.
The median of the data set, which represents the midpoint(50%) of the whole list of data.
The third quartile Q3, which represents three-quarters(75%) of the way through the list of all data.
The maximum value of the data set.
These five values helps in determining exceptional(outliers) elements present in our dataset. ## Calculation of IQR IQR = Q3 – Q1
MIN = Q1 - (1.5IQR)
MAX = Q3 + (1.5IQR)
## Installation Use the package manager pip to install outlierRemoval-kvarshney-101703295. bash pip install outlierRemoval-kvarshney-101703295
## How to use this package: outlierRemoval-kvarshney-101703295 can be run as shown below: ### In Command Prompt >> outlierRemoval dataset.csv
## Sample dataset Marks | Students :------------: | :-------------: 3 | Student 1 57 | Student 2 65 | Student 3 98 | Student 4 43 | Student 5 44 | Student 6 54 | Student 7 1 | Student 8
## Output Dataset after Removal Marks | Students :------------: | :-------------: 57 | Student2 65 | Student3 98 | Student4 43 | Student5 44 | Student6 54 | Student7
It is clearly visible that the rows containing Student1 and Student8 have been removed due to them being Outliers. ## License MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for outlierRemoval-kvarshney-101703295-1.0.2.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8ae2785e02f3f7b1f8eee6c26e6b268a3fa08c561cf6b0659e9930cc20ce34af |
|
MD5 | cb889afcfd48d14b322d3e11f48747f8 |
|
BLAKE2b-256 | bc820f180b001833b6ff281455ec8b41a1eda8ce57b40547642bfdc1e3d5d38f |
Hashes for outlierRemoval_kvarshney_101703295-1.0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 26964eb4cef624714fa58370ccb7beb4955fb6cb5e90deb8a95fd8cb27a0c4e2 |
|
MD5 | c5a4f8a23cbc7a43376e38d414b55b97 |
|
BLAKE2b-256 | 9c63f60b8fe8173387b68e50dccfe4b99111aa3445e8852b05954504068d6adc |