Skip to main content

A python package for removing outliers from a dataset using InterQuartile Range (IQR)

Project description

Outlier Removal Using InterQuartile Range

Project 2 : UCS633

Submitted By: Abhishek Sharma - 101703027

InterQuartile Range (IQR) Description

Any set of data can be described by its five-number summary. These five numbers, which give you the information you need to find patterns and outliers, consist of:

The minimum or lowest value of the dataset.
The first quartile Q1, which represents a quarter of the way through the list of all data.
The median of the data set, which represents the midpoint of the whole list of data.
The third quartile Q3, which represents three-quarters of the way through the list of all data.
The maximum or highest value of the data set.

These five numbers tell a person more about their data than looking at the numbers all at once could, or at least make this much easier.

Calculation of IQR

IQR = Q3 – Q1
MIN = Q1 - (1.5IQR)
MAX = Q3 + (1.5
IQR)

Sample dataset

Marks Students
3 Student1
57 Student2
65 Student3
98 Student4
43 Student5
44 Student6
54 Student7
99 Student8
1 Student9

Output Dataset after Removal

Marks Students
57 Student2
65 Student3
98 Student4
43 Student5
44 Student6
54 Student7

It is clearly visible that the rows containing Student1, Student8 and Student9 have been removed due to them being Outliers.

License

MIT

Project details


Release history Release notifications | RSS feed

This version

1.1

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

outlier-asharma-3027-1.1.tar.gz (2.8 kB view details)

Uploaded Source

Built Distribution

outlier_asharma_3027-1.1-py3-none-any.whl (3.2 kB view details)

Uploaded Python 3

File details

Details for the file outlier-asharma-3027-1.1.tar.gz.

File metadata

  • Download URL: outlier-asharma-3027-1.1.tar.gz
  • Upload date:
  • Size: 2.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.2.0.post20200210 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.1

File hashes

Hashes for outlier-asharma-3027-1.1.tar.gz
Algorithm Hash digest
SHA256 b0e222e862c703b7ce8cca1bc91938e5cde8695f309ebe4a41155b9949256217
MD5 77227f2bfb6ff550ebb58df2b64a082f
BLAKE2b-256 073eba7f1eedfa859b27a602f117631b0dac2d4d3624958f2463b93f19c5b5e1

See more details on using hashes here.

File details

Details for the file outlier_asharma_3027-1.1-py3-none-any.whl.

File metadata

  • Download URL: outlier_asharma_3027-1.1-py3-none-any.whl
  • Upload date:
  • Size: 3.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.2.0.post20200210 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.1

File hashes

Hashes for outlier_asharma_3027-1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 ab0ec75387e4cf85900fde1ad8837fd651f44be12859995d2f1e968e5ad4b9a2
MD5 7b423f3947c6e02de95d62682ad6ccb6
BLAKE2b-256 b3ca8bc70edb5d24a6c63a6cd181ec1f532bf5b9d4babc1e5b71b5563b387ad9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page