Skip to main content

A python package for removing outliers from a dataset using InterQuartile Range (IQR)

Project description

Outlier Removal Using InterQuartile Range

Project 2 : UCS633

Submitted By: Kunal Jindal 101703299


pypi: https://pypi.org/project/outlierRemoval-kjindal-101703299/


InterQuartile Range (IQR) Description

Any set of data can be described by its five-number summary. These five numbers, which give you the information you need to find patterns and outliers, consist of:

The minimum or lowest value of the dataset.
The first quartile Q1, which represents a quarter of the way through the list of all data.
The median of the data set, which represents the midpoint of the whole list of data.
The third quartile Q3, which represents three-quarters of the way through the list of all data.
The maximum or highest value of the data set.

These five numbers tell a person more about their data than looking at the numbers all at once could, or at least make this much easier.

Calculation of IQR

IQR = Q3 – Q1
MIN = Q1 - (1.5IQR)
MAX = Q3 + (1.5
IQR)

Installation

Use the package manager pip to install outlierRemoval-kjindal-101703299.

pip install outlierRemoval-kjindal-101703299

How to use this package:

outlierRemoval-kjindal-101703299 can be run as shown below:

In Command Prompt

>> outlierRemoval dataset.csv

Sample dataset

Marks Students
3 Student1
57 Student2
65 Student3
98 Student4
43 Student5
44 Student6
54 Student7
99 Student8
1 Student9

Output Dataset after Removal

Marks Students
57 Student2
65 Student3
98 Student4
43 Student5
44 Student6
54 Student7

It is clearly visible that the rows containing Student1, Student8 and Student9 have been removed due to them being Outliers.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Built Distribution

File details

Details for the file outlierRemoval-kjindal-101703299-1.0.2.tar.gz.

File metadata

  • Download URL: outlierRemoval-kjindal-101703299-1.0.2.tar.gz
  • Upload date:
  • Size: 3.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.0.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.8.1

File hashes

Hashes for outlierRemoval-kjindal-101703299-1.0.2.tar.gz
Algorithm Hash digest
SHA256 49fb2c039aad5b082861cca9229e4ce44dc478cf9b4a7e9fe091a36dc2bdbe77
MD5 9ca0ca2a7dd0b027b5bf9e009302902c
BLAKE2b-256 e0c6ad5e9ed48b471edc74e26cf709906b2dc65f76090672ba565688b2b5a7cf

See more details on using hashes here.

File details

Details for the file outlierRemoval_kjindal_101703299-1.0.2-py3-none-any.whl.

File metadata

  • Download URL: outlierRemoval_kjindal_101703299-1.0.2-py3-none-any.whl
  • Upload date:
  • Size: 4.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.0.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.8.1

File hashes

Hashes for outlierRemoval_kjindal_101703299-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 2ab6166eee1f444e40020580f3b584ad562b6275d4c1a26d668528608acbadc9
MD5 1adf4492a862fe946e5d018ab194add0
BLAKE2b-256 0be399947a693debb455e959c89c6f211ae5e942fe6fa5d3a9cf1e5f3bbd87e3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page