Skip to main content

A simple hypothesis testing package

Project description

Data Analysis

Why did I write this code?

I'm often handed a dataset and asked to do the same thing - determine if the experimental group is different than the control group, and generate some descriptive statistics along the way. As an alternative to writing all operations manually for each project I've created a Python module using Object Oriented Programming (OOP). This eliminates the need to search for old projects, copy and paste long lines of code and cross my fingers that it works.

What libraries does this code use?

This code uses a Scipy, a widely accepted statistical package, to perform hypothesis testing.

I also use Pandas, an extremely popular data analysis and manipulation library for organizing the data and providing descriptive statistics.

For data visualization I use the Seaborn library (built on top of matplotlib)

Hypothesis Testing

My main task is often to determine if the results are statistically significant. For this we need to implement a hypthesis test, however there are many to choose from. The decision tree of this code looks something like this:

  1. Is the data Normally Distributed and do the experimental and control groups have Equal Variance? Student's t-test
  2. Is the data Normally Distributed but do the experimental and control groups have Unequal Variance? Welches' t-test
  3. Is the data not Normally Distributed? Mann-Whitney U test

Where to Start

See This Notebook for a demo on the Breast Cancer Wisconsin (Diagnostic) Dataset found Here

autohyp

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autohyp-0.0.2.tar.gz (3.7 kB view details)

Uploaded Source

Built Distribution

autohyp-0.0.2-py3-none-any.whl (4.0 kB view details)

Uploaded Python 3

File details

Details for the file autohyp-0.0.2.tar.gz.

File metadata

  • Download URL: autohyp-0.0.2.tar.gz
  • Upload date:
  • Size: 3.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10

File hashes

Hashes for autohyp-0.0.2.tar.gz
Algorithm Hash digest
SHA256 3e6627e1ba71c4e51f145e1ceccd302dc79cf442351b568641203d3eda98d1e3
MD5 824375ecb06c3ad56fd6ac1bd85b2c79
BLAKE2b-256 ee06f7ecda85809e3b0c32cf443f280ea84a1593ea2d0906385e22e2dcd77236

See more details on using hashes here.

File details

Details for the file autohyp-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: autohyp-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 4.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10

File hashes

Hashes for autohyp-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 e3b7cf5ea96557ec26c6fdf69ceb1a4e55a89977c3d4ec9d43d6b61cc3a899c7
MD5 ebf59d19585ce61529e392d5fad786b7
BLAKE2b-256 f645b807e9173e49bf4b1899470f0d03084cf31adf35e06f9b62f62b6984abfe

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page