A Python pip package to remove outliers from the dataset
Project description
Outlier row removal using inter quartile range
Project 2 : UCS633 DATA ANALYSIS AND VISUALIZATION
Submitted By: Yash saxena 101703627
pypi: https://pypi.org/project/outlier-removal-yash-saxena
git: https://github.com/yashsaxena972/outlier-removal
IQR Interquartile range Description
Any data can be described by its five-number summary. These five numbers,consist of (in ascending order):
The minimum or lowest value of the dataset.
The first quartile Q1, which represents a quarter of the way through the list of all data.
The median of the data set, which represents the midpoint of the whole list of data.
The third quartile Q3, which represents three-quarters of the way through the list of all data.
The maximum or highest value of the data set.
Calculation of acceptable data
IQR = Q3-Q1
lower=Q1-(1.5*IQR)
upper=Q3+(1.5*IQR)
The data values present in between the lower and upper are acceptable and the rest are outliers and hence being removed.
Installation
Use the package manager pip to install removal system.
pip install outlier-removal-yash-saxena
How to use this package:
outlier-removal-yash-saxena can be run as done below:
In Command Prompt
>> outliers <dataset.csv>
Sample dataset
Marks | Students |
---|---|
3 | S1 |
57 | S2 |
65 | S3 |
98 | S4 |
43 | S5 |
44 | S6 |
54 | S7 |
99 | S8 |
1 | S9 |
Output dataset after removal
Marks | Students |
---|---|
57 | S2 |
65 | S3 |
98 | S4 |
43 | S5 |
44 | S6 |
54 | S7 |
It is clearly visible that the rows S1,S8 and S9 have been removed from the dataset.
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file outlier-removal-yash-saxena-1.0.2.tar.gz
.
File metadata
- Download URL: outlier-removal-yash-saxena-1.0.2.tar.gz
- Upload date:
- Size: 3.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 69c1bc27772dbbcb9dd9425b176af5737f318e1e627663cd13cb3d17ba3e1853 |
|
MD5 | 11624167db5efa5e289c14e81aebe9db |
|
BLAKE2b-256 | 659043603b7833c6bfda2384cfa33b323477d54f38d5bd355595152cda91bb8c |
File details
Details for the file outlier_removal_yash_saxena-1.0.2-py3-none-any.whl
.
File metadata
- Download URL: outlier_removal_yash_saxena-1.0.2-py3-none-any.whl
- Upload date:
- Size: 4.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 831f1136648459923ace3301bb3425ae42db000a04b7ba43d290ad67b1c8a0da |
|
MD5 | 4bd026a66dd93668593e5a236a6ad83d |
|
BLAKE2b-256 | c828e9c26543267fb99bcf7d182e38ea9bb95542443ec607d5149c2003a2d9b7 |