Skip to main content

Statistical post-hoc analysis and outlier detection algorithms

Project description

scikit-posthocs is a Python package that provides post hoc tests for pairwise multiple comparisons that are usually performed in statistical data analysis to assess the differences between group levels if a statistically significant result of ANOVA test has been obtained.

scikit-posthocs is tightly integrated with Pandas DataFrames and NumPy arrays to ensure fast computations and convenient data import and storage.

This package will be useful for statisticians, data analysts, and researchers who use Python in their work.

Background

Python statistical ecosystem comprises multiple packages. However, it still has numerous gaps and is surpassed by R packages and capabilities.

SciPy (version 1.2.0) offers Student, Wilcoxon, and Mann-Whitney tests that are not adapted to multiple pairwise comparisons. Statsmodels (version 0.9.0) features TukeyHSD test that needs some extra actions to be fluently integrated into a data analysis pipeline. Statsmodels also has good helper methods: allpairtest (adapts an external function such as scipy.stats.ttest_ind to multiple pairwise comparisons) and multipletests (adjusts p values to minimize type I and II errors). PMCMRplus is a very good R package that has no rivals in Python as it offers more than 40 various tests (including post hoc tests) for factorial and block design data. PMCMRplus was an inspiration and a reference for scikit-posthocs.

scikit-posthocs attempts to improve Python statistical capabilities by offering a lot of parametric and nonparametric post hoc tests along with outliers detection and basic plotting methods.

Features

  • Parametric pairwise multiple comparisons tests:

    • Scheffe test.

    • Student T test.

    • Tamhane T2 test.

    • TukeyHSD test.

  • Non-parametric tests for factorial design:

    • Conover test.

    • Dunn test.

    • Dwass, Steel, Critchlow, and Fligner test.

    • Mann-Whitney test.

    • Nashimoto and Wright (NPM) test.

    • Nemenyi test.

    • van Waerden test.

    • Wilcoxon test.

  • Non-parametric tests for block design:

    • Conover test.

    • Durbin and Conover test.

    • Miller test.

    • Nemenyi test.

    • Quade test.

    • Siegel test.

  • Other tests:

    • Anderson-Darling test.

    • Mack-Wolfe test.

    • Hayter (OSRT) test.

  • Outliers detection tests:

    • Simple test based on interquartile range (IQR).

    • Grubbs test.

    • Tietjen-Moore test.

    • Generalized Extreme Studentized Deviate test (ESD test).

  • Plotting functionality (e.g. significance plots).

All post hoc tests are capable of p adjustments for multiple pairwise comparisons.

Dependencies

Compatibility

Package is compatible with Python 2 and Python 3.

Install

You can install the package using pip :

$ pip install scikit-posthocs

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scikit_posthocs-0.10.0.tar.gz (37.2 kB view details)

Uploaded Source

Built Distribution

scikit_posthocs-0.10.0-py3-none-any.whl (33.1 kB view details)

Uploaded Python 3

File details

Details for the file scikit_posthocs-0.10.0.tar.gz.

File metadata

  • Download URL: scikit_posthocs-0.10.0.tar.gz
  • Upload date:
  • Size: 37.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for scikit_posthocs-0.10.0.tar.gz
Algorithm Hash digest
SHA256 b189055b19dcfd33ab5d2ed94825d870e1ae57499653bef016fd24ad455a51bf
MD5 6fdf36e8747b28c9db1a340d27ffb51b
BLAKE2b-256 ab17dddbf0d576d89cfe28f23654a3aec3645ab079e8394191b3d1b50f5aec14

See more details on using hashes here.

File details

Details for the file scikit_posthocs-0.10.0-py3-none-any.whl.

File metadata

File hashes

Hashes for scikit_posthocs-0.10.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0fa1d4640cf2d1d5a602ed7be3848a4ce54f426f179a7df9ae7a5787f94455c1
MD5 924d317ae16e32d6a8df5230f8f8385a
BLAKE2b-256 ee946fcc320ed6e8025d493aaf92295986d6d4bee2071ebda466bc39cd13d903

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page