A python package for removing outliers from a dataset using InterQuartile Range (IQR)
Project description
Outlier Removal Using InterQuartile Range
Project 2 : UCS633
Submitted By: Kshitiz Varshney 101703295
pypi: https://pypi.org/project/outlierRemoval-kvarshney-101703295/
InterQuartile Range (IQR) Description
Any set of data can be described by its five-number summary. These five numbers, which give you the information you need to find patterns and outliers, consist of:
The minimum value of the dataset.
The first quartile Q1, which represents a quarter(25%) of the way through the list of all data.
The median of the data set, which represents the midpoint(50%) of the whole list of data.
The third quartile Q3, which represents three-quarters(75%) of the way through the list of all data.
The maximum or highest value of the data set.
These five values helps us to determine outliers present in our dataset.
Calculation of IQR
IQR = Q3 – Q1
MIN = Q1 - (1.5IQR)
MAX = Q3 + (1.5IQR)
Installation
Use the package manager pip to install outlierRemoval-kvarshney-101703295.
pip install outlierRemoval-kvarshney-101703295
How to use this package:
outlierRemoval-kvarshney-101703295 can be run as shown below:
In Command Prompt
>> outlierRemoval dataset.csv
Sample dataset
| Marks | Students |
|---|---|
| 3 | Student1 |
| 57 | Student2 |
| 65 | Student3 |
| 98 | Student4 |
| 43 | Student5 |
| 44 | Student6 |
| 54 | Student7 |
| 99 | Student8 |
Output Dataset after Removal
| Marks | Students |
|---|---|
| 57 | Student2 |
| 65 | Student3 |
| 98 | Student4 |
| 43 | Student5 |
| 44 | Student6 |
| 54 | Student7 |
It is clearly visible that the rows containing Student1 andStudent8 have been removed due to them being Outliers.
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file outlierRemoval-kvarshney-101703295-1.0.3.tar.gz.
File metadata
- Download URL: outlierRemoval-kvarshney-101703295-1.0.3.tar.gz
- Upload date:
- Size: 4.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.0.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.8.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
94405a40a17bf94b249fd9e168de3f67f27b112c439db4e7f10a4ce9e395dcd6
|
|
| MD5 |
c8670c54c1814fea138de96a66ee1aa3
|
|
| BLAKE2b-256 |
41ef309a63bf63357e916368d62dc295858e0b05bf7efe3bc7552bd39a891fa3
|
File details
Details for the file outlierRemoval_kvarshney_101703295-1.0.3-py3-none-any.whl.
File metadata
- Download URL: outlierRemoval_kvarshney_101703295-1.0.3-py3-none-any.whl
- Upload date:
- Size: 4.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.0.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.8.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
20ae57face9e3c9039f20831052c3c3469d7a57902341679417615dfa68c2297
|
|
| MD5 |
d8d9895504d0d185e024aa3eeddf475b
|
|
| BLAKE2b-256 |
6f4e11902ac280ca4c6ec412eb8d3ea0f054a284bf30cc8055c9f506b05ef76a
|