Skip to main content

Python implementation of Checkerboard Copula Regression-based Visualization and Association Measure

Project description

ccrvam

Python implementation of Checkerboard Copula Regression-based Visualization and Association Measure (CCRVAM)

PyPI version build Documentation Status Coverage Status Built with PyPi Template PyPI Downloads

Overview

CCRVAM is a Python package for analyzing associations between categorical variables using Checkerboard Copula Regression. It provides tools for:

  • Construction of CCRVAM objects from three forms of categorical data (case form, frequency form, table form)
  • Calculation of marginal distributions and CDFs of categorical variables
  • Computation of Checkerboard Copula Regression (CCR), its Prediction and Visualization
  • Implementation of Checkerboard Copula Regression Association Measure (CCRAM) and Scaled CCRAM (SCCRAM)
  • Bootstrap functionality for CCR-based prediction, CCRAM and SCCRAM
  • Permutation testing functionality for CCRAM & SCCRAM
  • Vectorized implementations for improved performance
  • Rigorous Edge-case Handling & Unit Testing with Pytest

Authors

  • Dhyey Mavani
  • Daeyoung Kim
  • Shu-Min Liao

Installation

Usage in Jupyter Notebook:

This package (ccrvam) is hosted on PyPi, so for installation add the following line at the top of your Jupyter notebook!

%pip install --upgrade ccrvam

Now, you should be all set to use it in a Jupyter Notebook!

Note: You might need to restart your kernel after installation in order to use the package. This is because of the way Jupyter handles package installations.

Usage in a Software Project Through Terminal (Virtual Environment):

Alternatively, if you would like to use it in a project, we recommend you to have a virtual environment for your use of this package, then follow the following workflow:

  1. First, create and activate a virtual environment (Python 3.10+ recommended):
# Create virtual environment
$ python -m venv ccrvam-env

# Activate virtual environment (Mac/Linux)
$ source ccrvam-env/bin/activate

# Verify you're in the virtual environment
$ which python
  1. Install package
$ pip install ccrvam
  1. To deactivate the virtual environment, when done:
$ deactivate

Documentation Structure

Visit Read the Docs for the full documentation, including overviews and several examples.

Examples

For detailed examples in Jupyter Notebooks and beyond (organized by functionality) please refer to our GitHub repository's examples folder.

Project Structure

This diagram illustrates the main components of the ccrvam package. Understanding this structure can help package users and developers navigate the codebase, locate specific functionalities (like data processing in utils.py or the core CCRVAM logic in genccrvam.py), and understand how different modules interact. The examples/ and tests/ directories provide practical usage scenarios and verification tests, respectively.

graph TD
    A[ccrvam] --> B[checkerboard]
    B --> B1[genccrvam.py<br/>GenericCCRVAM]
    B --> B2[genstatsim.py<br/>Bootstrap/Permutation]
    B --> B3[utils.py<br/>DataProcessor]

    A --> F[docs/]
    B1 --> F
    B2 --> F
    B3 --> F

    A --> D[examples/]
    A --> E[tests/]
    E --> E1[test_genccrvam.py]
    E --> E2[test_genstatsim.py]
    E --> E3[test_utils.py]

User Workflow Overview

This flowchart outlines the typical steps involved when using the ccrvam package, from loading data to performing analyses like calculating CCRAM or generating predictions, and utilizing bootstrap/permutation functionalities. It provides a high-level view of how different functions connect and the expected data flow through the package.

flowchart TD
    A[Raw Data / File Input] --> B[DataProcessor.load_data]
    B --> C[Contingency Table]

    C --> D[GenericCCRVAM.from_contingency_table]
    C --> E[GenericCCRVAM.from_cases]

    D --> F[GenericCCRVAM Instance]
    E --> F

    F --> G[calculate_CCRAM]
    F --> H[get_predictions_ccr]
    F --> I[plot_ccr_predictions]
    F --> J[calculate_ccs]
    F --> K[get_prediction_under_indep]

    C --> L[bootstrap_ccram]
    C --> M[permutation_test_ccram]
    C --> N[bootstrap_predict_ccr_summary]

    L --> O[CustomBootstrapResult]
    M --> P[CustomPermutationResult]
    N --> Q[Predictions Summary]

    Q --> R[plot_prediction_heatmap]
    Q --> S[save_predictions]

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Citation Guide

When using this package in your research, please follow these citation guidelines:

Software Citation

For citing the software package itself, use:

@software{ccrvam2025,
  author = {Mavani, Dhyey and Kim, Daeyoung and Liao, Shu-Min},
  title = {ccrvam: Checkerboard Copula Regression-based Visualization and Association Measure},
  year = {2025},
  publisher = {GitHub},
  url = {https://github.com/DhyeyMavani2003/ccrvam}
}

Method Citations

For citing the underlying methodology, please cite both papers:

@article{wei2021exploratory,
  title={On exploratory analytic method for multi-way contingency tables with an ordinal response variable and categorical explanatory variables},
  author={Wei, Zheng and Kim, Daeyoung},
  journal={Journal of Multivariate Analysis},
  volume={186},
  pages={104793},
  year={2021},
  publisher={Elsevier},
  doi={10.1016/j.jmva.2021.104793}
}

@incollection{liao2024visualization,
  title={Visualization of Dependence in Multidimensional Contingency Tables with an Ordinal Dependent Variable via Copula Regression},
  author={Liao, Shu-Min and Wang, Li and Kim, Daeyoung},
  booktitle={Dependent Data in Social Sciences Research: Forms, Issues, and Methods of Analysis},
  edition={Second},
  editor={Stemmler, Mark and Wiedermann, Wolfgang and Huang, Francis L.},
  publisher={Springer New York LLC},
  pages={517--538},
  year={2024}
}

Citation in Text

When referring to the methodology in your text, you can use:

  • For the original methodology: "Wei and Kim (2021) proposed the Checkerboard Copula Regression..."
  • For the visualization extension: "Liao et al. (2024) extended the methodology to include visualization capabilities..."
1. Zheng Wei and Daeyoung Kim (2021). On exploratory analytic method for multi-way contingency tables with an ordinal response variable and categorical explanatory variables. *Journal of Multivariate Analysis*, 186, 104793. https://doi.org/10.1016/j.jmva.2021.104793

2. Shu-Min Liao, Li Wang, Daeyoung Kim (2024). Visualization of Dependence in Multidimensional Contingency Tables with an Ordinal Dependent Variable via Copula Regression. In *Dependent Data in Social Sciences Research: Forms, Issues, and Methods of Analysis*, Second edition, Mark Stemmler, Wolfgang Wiedermann, and Francis L. Huang, eds. Springer New York LLC, pp. 517-538

Version Information

When citing specific versions of the software, please include the version number and DOI (if available) in your citation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ccrvam-1.2.2.tar.gz (74.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ccrvam-1.2.2-py3-none-any.whl (71.8 kB view details)

Uploaded Python 3

File details

Details for the file ccrvam-1.2.2.tar.gz.

File metadata

  • Download URL: ccrvam-1.2.2.tar.gz
  • Upload date:
  • Size: 74.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ccrvam-1.2.2.tar.gz
Algorithm Hash digest
SHA256 3d9a2749cb4064494cdad74c2a6dd3526bf42271edb1d005066dce99cf1fc3fd
MD5 cc78eb598001e2b76fe1adb2a3de421a
BLAKE2b-256 2aae93c806afd271805293c4ea5bf83e416ebf0d5335b0d3fc504d5e14adfbe1

See more details on using hashes here.

Provenance

The following attestation bundles were made for ccrvam-1.2.2.tar.gz:

Publisher: publish.yml on DhyeyMavani2003/ccrvam

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file ccrvam-1.2.2-py3-none-any.whl.

File metadata

  • Download URL: ccrvam-1.2.2-py3-none-any.whl
  • Upload date:
  • Size: 71.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ccrvam-1.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 a2d2cdb1fd01b054e1aadffefde5d640a42e108a76c7be0bd2effd03128cafeb
MD5 67250e4fb32aa3a81447155581880a04
BLAKE2b-256 2d8cd9c5598b11256516adfb5dfe035ff5026853482e8c671a0ab514b162150e

See more details on using hashes here.

Provenance

The following attestation bundles were made for ccrvam-1.2.2-py3-none-any.whl:

Publisher: publish.yml on DhyeyMavani2003/ccrvam

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page