Skip to main content

A Python library for detecting outliers at the end of the distribution

Project description

End of Distribution

end_of_distribution is a Python library designed for statistical outlier detection, focusing on identifying values at the "end of the distribution" in datasets. It provides methods for detecting outliers using both Z-score and Interquartile Range (IQR), commonly used techniques in data preprocessing and analysis.

Features

  • Z-Score Outliers: Identify outliers based on their distance from the mean.
  • IQR Outliers: Detects outliers based on the interquartile range, robust against extreme values.

Installation

After cloning the repository:

git clone https://github.com/adityayadav0111/end_of_distribution.git
cd end_of_distribution

To install directly if available on PyPI:

pip install end_of_distribution

Usage

Import the functions from end_of_distribution to detect outliers in your data.

from lib.end_of_distribution import z_score_outliers, iqr_outliers

# Example data
data = [1, 2, 3, 4, 100, 5]

# Detect outliers with Z-score
z_outliers = z_score_outliers(data, threshold=3)
print("Z-score Outliers:", z_outliers)

# Detect outliers with IQR
iqr_outliers = iqr_outliers(data)
print("IQR Outliers:", iqr_outliers)

API Reference

  • z_score_outliers(data, threshold=3)

    • Parameters:
      • data (list): List of numerical values to check for outliers.
      • threshold (float): Z-score threshold for identifying outliers. Default is 3.
    • Returns: A list with True/False for outliers in the dataset.
  • iqr_outliers(data)

    • Parameters:
      • data (list): List of numerical values to check for outliers.
    • Returns: A list with True/False indicating outliers.

Contributing

We welcome contributions! Here’s how to get started:

  1. Fork the repository and clone it locally.
  2. Create a new branch for your contribution.
  3. Make your changes and commit.
  4. Push your changes and open a pull request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

end_of_distribution-0.1.0.tar.gz (3.4 kB view details)

Uploaded Source

Built Distribution

end_of_distribution-0.1.0-py3-none-any.whl (3.9 kB view details)

Uploaded Python 3

File details

Details for the file end_of_distribution-0.1.0.tar.gz.

File metadata

  • Download URL: end_of_distribution-0.1.0.tar.gz
  • Upload date:
  • Size: 3.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.8.10

File hashes

Hashes for end_of_distribution-0.1.0.tar.gz
Algorithm Hash digest
SHA256 824d300eebee3d0c937df76be14e22654526467688cf6f09d602759fcc4c6188
MD5 8ab3a9be32e4d7b2a2af063451479f91
BLAKE2b-256 115227cb6c83d118eb0ce1e593310ee78ab39e6b05316fae59a13811c963c7ba

See more details on using hashes here.

File details

Details for the file end_of_distribution-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for end_of_distribution-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 86740ac00f7b163e7f886f9f8039bac8511a4b06981b7355a4a2ded83538c285
MD5 3d5f5812beaeb6b3afd5af41ac859a29
BLAKE2b-256 30b3e3b2b528140b6bd986e9fe90fef06f2e1470e9714e9ffd513e349e568b2c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page