Skip to main content

A Python pip package to remove outliers from the dataset

Project description

Outlier row removal using inter quartile range

Project 2 : UCS633 DATA ANALYSIS AND VISUALIZATION

Submitted By: Yash saxena 101703627


pypi: https://pypi.org/project/outlier-removal-yash-saxena
git: https://github.com/yashsaxena972/outlier-removal


IQR Interquartile range Description

Any data can be described by its five-number summary. These five numbers,consist of (in ascending order):

The minimum or lowest value of the dataset.
The first quartile Q1, which represents a quarter of the way through the list of all data.
The median of the data set, which represents the midpoint of the whole list of data.
The third quartile Q3, which represents three-quarters of the way through the list of all data.
The maximum or highest value of the data set.

Calculation of acceptable data

IQR = Q3-Q1
lower=Q1-(1.5*IQR)
upper=Q3+(1.5*IQR)

The data values present in between the lower and upper are acceptable and the rest are outliers and hence being removed.

Installation

Use the package manager pip to install removal system.

pip install outlier-removal-yash-saxena

How to use this package:

outlier-removal-yash-saxena can be run as done below:

In Command Prompt

>> outliers <dataset.csv>

Sample dataset

Marks Students
3 S1
57 S2
65 S3
98 S4
43 S5
44 S6
54 S7
99 S8
1 S9

Output dataset after removal

Marks Students
57 S2
65 S3
98 S4
43 S5
44 S6
54 S7

It is clearly visible that the rows S1,S8 and S9 have been removed from the dataset.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

outlier-removal-yash-saxena-1.0.2.tar.gz (3.6 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file outlier-removal-yash-saxena-1.0.2.tar.gz.

File metadata

  • Download URL: outlier-removal-yash-saxena-1.0.2.tar.gz
  • Upload date:
  • Size: 3.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.3

File hashes

Hashes for outlier-removal-yash-saxena-1.0.2.tar.gz
Algorithm Hash digest
SHA256 69c1bc27772dbbcb9dd9425b176af5737f318e1e627663cd13cb3d17ba3e1853
MD5 11624167db5efa5e289c14e81aebe9db
BLAKE2b-256 659043603b7833c6bfda2384cfa33b323477d54f38d5bd355595152cda91bb8c

See more details on using hashes here.

File details

Details for the file outlier_removal_yash_saxena-1.0.2-py3-none-any.whl.

File metadata

  • Download URL: outlier_removal_yash_saxena-1.0.2-py3-none-any.whl
  • Upload date:
  • Size: 4.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.3

File hashes

Hashes for outlier_removal_yash_saxena-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 831f1136648459923ace3301bb3425ae42db000a04b7ba43d290ad67b1c8a0da
MD5 4bd026a66dd93668593e5a236a6ad83d
BLAKE2b-256 c828e9c26543267fb99bcf7d182e38ea9bb95542443ec607d5149c2003a2d9b7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page