Skip to main content

A python package for removing outliers from a dataset using InterQuartile Range (IQR)

Project description

Outlier Removal Using InterQuartile Range

Project 2 : UCS633

Submitted By: Kshitiz Varshney 101703295


pypi: https://pypi.org/project/outlierRemoval-kvarshney-101703295/


InterQuartile Range (IQR) Description

Any set of data can be described by its five-number summary. These five numbers, which give you the information you need to find patterns and outliers, consist of:

The minimum value of the dataset.
The first quartile Q1, which represents a quarter(25%) of the way through the list of all data.
The median of the data set, which represents the midpoint(50%) of the whole list of data.
The third quartile Q3, which represents three-quarters(75%) of the way through the list of all data.
The maximum or highest value of the data set.

These five values helps us to determine outliers present in our dataset.

Calculation of IQR

IQR = Q3 – Q1
MIN = Q1 - (1.5IQR)
MAX = Q3 + (1.5
IQR)

Installation

Use the package manager pip to install outlierRemoval-kvarshney-101703295.

pip install outlierRemoval-kvarshney-101703295

How to use this package:

outlierRemoval-kvarshney-101703295 can be run as shown below:

In Command Prompt

>> outlierRemoval dataset.csv

Sample dataset

Marks Students
3 Student1
57 Student2
65 Student3
98 Student4
43 Student5
44 Student6
54 Student7
99 Student8

Output Dataset after Removal

Marks Students
57 Student2
65 Student3
98 Student4
43 Student5
44 Student6
54 Student7

It is clearly visible that the rows containing Student1 andStudent8 have been removed due to them being Outliers.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

outlierRemoval-kvarshney-101703295-1.0.3.tar.gz (4.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file outlierRemoval-kvarshney-101703295-1.0.3.tar.gz.

File metadata

  • Download URL: outlierRemoval-kvarshney-101703295-1.0.3.tar.gz
  • Upload date:
  • Size: 4.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.0.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.8.1

File hashes

Hashes for outlierRemoval-kvarshney-101703295-1.0.3.tar.gz
Algorithm Hash digest
SHA256 94405a40a17bf94b249fd9e168de3f67f27b112c439db4e7f10a4ce9e395dcd6
MD5 c8670c54c1814fea138de96a66ee1aa3
BLAKE2b-256 41ef309a63bf63357e916368d62dc295858e0b05bf7efe3bc7552bd39a891fa3

See more details on using hashes here.

File details

Details for the file outlierRemoval_kvarshney_101703295-1.0.3-py3-none-any.whl.

File metadata

  • Download URL: outlierRemoval_kvarshney_101703295-1.0.3-py3-none-any.whl
  • Upload date:
  • Size: 4.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.0.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.8.1

File hashes

Hashes for outlierRemoval_kvarshney_101703295-1.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 20ae57face9e3c9039f20831052c3c3469d7a57902341679417615dfa68c2297
MD5 d8d9895504d0d185e024aa3eeddf475b
BLAKE2b-256 6f4e11902ac280ca4c6ec412eb8d3ea0f054a284bf30cc8055c9f506b05ef76a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page