Skip to main content

Coefficient of Variation (CV) and Coefficient of Quartile Variation (CQV) with Confidence Intervals (CI)

Project description

pycvcqv

PyPI Python Version Build status coverage report Downloads "Buy Me A Coffee" static analysis dependencies vulnerabilities maintainability complexity lint report docstring Code style: black Security: bandit Pre-commit License

Find homogeneity with confidence.

Python port of cvcqv

Introduction

pycvcqv provides versatile functions to quantify homogeneity with confidence intervals. It offers a variety of well-established methods from the literature (Kelley, McKay, Miller, Vangel, Mahmoudvand-Hassani, Equal-Tailed, Shortest-Length, Normal Approximation, Bonett, and the Abu-Shawiesh-Akyuz-Kibria adjusted-degrees-of-freedom, large-sample, and augmented-large-sample CIs) and bootstrap resampling techniques (Normal, Basic, Percentile, BCa) for constructing confidence intervals on the Coefficient of Variation (cv) and the Coefficient of Quartile Variation (cqv).

Coefficient of Variation

cv is a measure of relative dispersion representing the degree of variability relative to the mean (Albatineh et al., 2014). Since cv is unitless, it is useful for comparing variables that have different units. It is also a measure of homogeneity (Albatineh et al., 2014).

Coefficient of Quartile Variation

cqv is a measure of relative dispersion based on the interquartile range (IQR). Since cqv is unitless, it is also useful for comparing variables that have different units. It is also a measure of homogeneity (Bonett, 2006; Altunkaynak, 2018).

Install

pip install pycvcqv

Usage

import pandas as pd
from pycvcqv import coefficient_of_variation, cqv

coefficient_of_variation(
    data=[
        0.2, 0.5, 1.1, 1.4, 1.8, 2.3, 2.5, 2.7, 3.5, 4.4,
        4.6, 5.4, 5.4, 5.7, 5.8, 5.9, 6.0, 6.6, 7.1, 7.9
    ],
    multiplier=100,
    ndigits=2
)
# {'cv': 57.77, 'lower': 41.43, 'upper': 98.38}
cqv(
    data=[0.2, 0.5, 1.1, 1.4, 1.8, 2.3, 2.5, 2.7, 3.5, 4.4, 4.6, 5.4, 5.4],
    multiplier=100,
)
# 51.7241
data = pd.DataFrame(
    {
        "col-1": pd.Series([0.2, 0.5, 1.1, 1.4, 1.8, 2.3, 2.5, 2.7, 3.5]),
        "col-2": pd.Series([5.4, 5.4, 5.7, 5.8, 5.9, 6.0, 6.6, 7.1, 7.9]),
    }
)
coefficient_of_variation(data=data, num_threads=3)
#   columns      cv      lower      upper
# 0   col-1  0.6076     0.3770     1.6667
# 1   col-2  0.1359     0.0913     0.2651
cqv(data=data, num_threads=-1)
#   columns      cqv
# 0   col-1  0.3889
# 1   col-2  0.0732

Confidence-interval methods for cv

coefficient_of_variation accepts a method argument that selects the confidence-interval estimator.

from pycvcqv import coefficient_of_variation

x = [
    0.2, 0.5, 1.1, 1.4, 1.8, 2.3, 2.5, 2.7, 3.5, 4.4,
    4.6, 5.4, 5.4, 5.7, 5.8, 5.9, 6.0, 6.6, 7.1, 7.9,
]

for method in (
    "kelley", "mckay", "miller", "vangel",
    "mahmoudvand_hassani", "equal_tailed",
    "shortest_length", "normal_approximation",
    "aak_adj", "aak_ls", "aak_als",
    "norm", "basic", "perc", "bca",
):
    print(method, coefficient_of_variation(
        data=x,
        method=method,
        multiplier=100,
        ndigits=3,
        num_replicates=10000,
        random_state=42,
    ))

Output (95% CI, multiplier=100, ndigits=3, bootstrap methods use num_replicates=10000, random_state=42):

method est lower upper description
kelley 57.774 41.303 97.950 cv with Kelley 95% CI
mckay 57.774 41.441 108.483 cv with McKay 95% CI
miller 57.774 34.053 81.495 cv with Miller 95% CI
vangel 57.774 40.955 103.931 cv with Vangel 95% CI
mahmoudvand_hassani 57.774 43.476 82.857 cv with Mahmoudvand-Hassani 95% CI
equal_tailed 57.774 43.937 84.383 cv with Equal-Tailed 95% CI
shortest_length 57.774 42.015 81.013 cv with Shortest-Length 95% CI
normal_approximation 57.774 44.533 85.272 cv with Normal Approximation 95% CI
aak_adj 57.774 48.029 72.516 cv with Abu-Shawiesh-Akyuz-Kibria Adjusted-DoF 95% CI
aak_ls 57.774 46.310 72.075 cv with Abu-Shawiesh-Akyuz-Kibria Large-Sample 95% CI
aak_als 57.774 45.839 75.092 cv with Abu-Shawiesh-Akyuz-Kibria Augmented-LS 95% CI
norm 57.774 38.850 78.379 cv with Normal Approximation Bootstrap 95% CI
basic 57.774 37.716 77.166 cv with Basic Bootstrap 95% CI
perc 57.774 38.382 77.832 cv with Bootstrap Percentile 95% CI
bca 57.774 41.556 83.032 cv with Adjusted Bootstrap Percentile (BCa) 95% CI

Confidence-interval methods for cqv

cqv accepts a method argument that selects the confidence-interval estimator. When method is omitted only the point estimate is returned (the legacy behavior).

from pycvcqv import cqv

x = [
    0.2, 0.5, 1.1, 1.4, 1.8, 2.3, 2.5, 2.7, 3.5, 4.4,
    4.6, 5.4, 5.4, 5.7, 5.8, 5.9, 6.0, 6.6, 7.1, 7.9,
]

for method in ("bonett", "norm", "basic", "perc", "bca"):
    print(method, cqv(
        data=x,
        method=method,
        multiplier=100,
        ndigits=3,
        num_replicates=10000,
        random_state=42,
    ))

Output (95% CI, multiplier=100, ndigits=3, bootstrap methods use num_replicates=10000, random_state=42):

method est lower upper description
bonett 45.625 24.785 77.329 cqv with Bonett 95% CI
norm 45.625 19.937 70.403 cqv with Normal Approximation Bootstrap 95% CI
basic 45.625 21.081 73.923 cqv with Basic Bootstrap 95% CI
perc 45.625 17.327 70.169 cqv with Bootstrap Percentile 95% CI
bca 45.625 22.006 76.331 cqv with Adjusted Bootstrap Percentile (BCa) 95% CI

Credits

🚀 Your next Python package needs a bleeding-edge project structure. This project was generated with python-package-template

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pycvcqv-0.7.0.tar.gz (45.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pycvcqv-0.7.0-py3-none-any.whl (78.6 kB view details)

Uploaded Python 3

File details

Details for the file pycvcqv-0.7.0.tar.gz.

File metadata

  • Download URL: pycvcqv-0.7.0.tar.gz
  • Upload date:
  • Size: 45.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for pycvcqv-0.7.0.tar.gz
Algorithm Hash digest
SHA256 b2efed246490987ec681344ae3da27f51bea601ae6404c37c5b5b2180b1be990
MD5 dc038d601ad7d3776f9e009518061815
BLAKE2b-256 1917419d16b9fbca2860cf259259eb2c03385fc7edc87470325ef996a5ced90d

See more details on using hashes here.

File details

Details for the file pycvcqv-0.7.0-py3-none-any.whl.

File metadata

  • Download URL: pycvcqv-0.7.0-py3-none-any.whl
  • Upload date:
  • Size: 78.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for pycvcqv-0.7.0-py3-none-any.whl
Algorithm Hash digest
SHA256 55b3dab302594b658c0dacd53a1dd4997a52e4abc8b4b87e7d3288284d7196d4
MD5 25ba882569284238e85f3fef7c9fe521
BLAKE2b-256 a60ea2ad30a046ffff695d9b071753ba0fea9be57158a90b21236602c7b52bf5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page