Skip to main content

Python implementation of Checkerboard Copula Regression-based Visualization and Association Measure

Project description

ccrvam

Python implementation of Checkerboard Copula Regression-based Visualization and Association Measure (CCRVAM)

PyPI version build Documentation Status Coverage Status Built with PyPi Template PyPI Downloads

Overview

CCRVAM is a Python package for analyzing associations between categorical variables using Checkerboard Copula Regression. It provides tools for:

  • Construction of CCRVAM objects from three forms of categorical data (case form, frequency form, table form)
  • Calculation of marginal distributions and CDFs of categorical variables
  • Computation of Checkerboard Copula Regression (CCR), its Prediction and Visualization
  • Implementation of Checkerboard Copula Regression Association Measure (CCRAM) and Scaled CCRAM (SCCRAM)
  • Bootstrap functionality for CCR-based prediction, CCRAM and SCCRAM
  • Permutation testing functionality for CCRAM & SCCRAM
  • Vectorized implementations for improved performance
  • Rigorous Edge-case Handling & Unit Testing with Pytest

Authors

  • Dhyey Mavani
  • Daeyoung Kim
  • Shu-Min Liao

Installation

Usage in Jupyter Notebook:

This package (ccrvam) is hosted on PyPi, so for installation add the following line at the top of your Jupyter notebook!

%pip install --upgrade ccrvam

Now, you should be all set to use it in a Jupyter Notebook!

Note: You might need to restart your kernel after installation in order to use the package. This is because of the way Jupyter handles package installations.

Usage in a Software Project Through Terminal (Virtual Environment):

Alternatively, if you would like to use it in a project, we recommend you to have a virtual environment for your use of this package, then follow the following workflow:

  1. First, create and activate a virtual environment (Python 3.10+ recommended):
# Create virtual environment
$ python -m venv ccrvam-env

# Activate virtual environment (Mac/Linux)
$ source ccrvam-env/bin/activate

# Verify you're in the virtual environment
$ which python
  1. Install package
$ pip install ccrvam
  1. To deactivate the virtual environment, when done:
$ deactivate

Documentation Structure

Visit Read the Docs for the full documentation, including overviews and several examples.

Examples

For detailed examples in Jupyter Notebooks and beyond (organized by functionality) please refer to our GitHub repository's examples folder.

Project Structure

This diagram illustrates the main components of the ccrvam package. Understanding this structure can help package users and developers navigate the codebase, locate specific functionalities (like data processing in utils.py or the core CCRVAM logic in genccrvam.py), and understand how different modules interact. The examples/ and tests/ directories provide practical usage scenarios and verification tests, respectively.

graph TD
    A[ccrvam] --> B[checkerboard]
    B --> B1[genccrvam.py<br/>GenericCCRVAM]
    B --> B2[genstatsim.py<br/>Bootstrap/Permutation]
    B --> B3[utils.py<br/>DataProcessor]

    A --> F[docs/]
    B1 --> F
    B2 --> F
    B3 --> F

    A --> D[examples/]
    A --> E[tests/]
    E --> E1[test_genccrvam.py]
    E --> E2[test_genstatsim.py]
    E --> E3[test_utils.py]

User Workflow Overview

This flowchart outlines the typical steps involved when using the ccrvam package, from loading data to performing analyses like calculating CCRAM or generating predictions, and utilizing bootstrap/permutation functionalities. It provides a high-level view of how different functions connect and the expected data flow through the package.

flowchart TD
    A[Raw Data / File Input] --> B[DataProcessor.load_data]
    B --> C[Contingency Table]

    C --> D[GenericCCRVAM.from_contingency_table]
    C --> E[GenericCCRVAM.from_cases]

    D --> F[GenericCCRVAM Instance]
    E --> F

    F --> G[calculate_CCRAM]
    F --> H[get_predictions_ccr]
    F --> I[plot_ccr_predictions]
    F --> J[calculate_ccs]
    F --> K[get_prediction_under_indep]

    C --> L[bootstrap_ccram]
    C --> M[permutation_test_ccram]
    C --> N[bootstrap_predict_ccr_summary]

    L --> O[CustomBootstrapResult]
    M --> P[CustomPermutationResult]
    N --> Q[Predictions Summary]

    Q --> R[plot_prediction_heatmap]
    Q --> S[save_predictions]

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Citation Guide

When using this package in your research, please follow these citation guidelines:

Software Citation

For citing the software package itself, use:

@software{ccrvam2025,
  author = {Mavani, Dhyey and Kim, Daeyoung and Liao, Shu-Min},
  title = {ccrvam: Checkerboard Copula Regression-based Visualization and Association Measure},
  year = {2025},
  publisher = {GitHub},
  url = {https://github.com/DhyeyMavani2003/ccrvam}
}

Method Citations

For citing the underlying methodology, please cite both papers:

@article{wei2021exploratory,
  title={On exploratory analytic method for multi-way contingency tables with an ordinal response variable and categorical explanatory variables},
  author={Wei, Zheng and Kim, Daeyoung},
  journal={Journal of Multivariate Analysis},
  volume={186},
  pages={104793},
  year={2021},
  publisher={Elsevier},
  doi={10.1016/j.jmva.2021.104793}
}

@incollection{liao2024visualization,
  title={Visualization of Dependence in Multidimensional Contingency Tables with an Ordinal Dependent Variable via Copula Regression},
  author={Liao, Shu-Min and Wang, Li and Kim, Daeyoung},
  booktitle={Dependent Data in Social Sciences Research: Forms, Issues, and Methods of Analysis},
  edition={Second},
  editor={Stemmler, Mark and Wiedermann, Wolfgang and Huang, Francis L.},
  publisher={Springer New York LLC},
  pages={517--538},
  year={2024}
}

Citation in Text

When referring to the methodology in your text, you can use:

  • For the original methodology: "Wei and Kim (2021) proposed the Checkerboard Copula Regression..."
  • For the visualization extension: "Liao et al. (2024) extended the methodology to include visualization capabilities..."
1. Zheng Wei and Daeyoung Kim (2021). On exploratory analytic method for multi-way contingency tables with an ordinal response variable and categorical explanatory variables. *Journal of Multivariate Analysis*, 186, 104793. https://doi.org/10.1016/j.jmva.2021.104793

2. Shu-Min Liao, Li Wang, Daeyoung Kim (2024). Visualization of Dependence in Multidimensional Contingency Tables with an Ordinal Dependent Variable via Copula Regression. In *Dependent Data in Social Sciences Research: Forms, Issues, and Methods of Analysis*, Second edition, Mark Stemmler, Wolfgang Wiedermann, and Francis L. Huang, eds. Springer New York LLC, pp. 517-538

Version Information

When citing specific versions of the software, please include the version number and DOI (if available) in your citation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ccrvam-1.2.3.tar.gz (75.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ccrvam-1.2.3-py3-none-any.whl (72.3 kB view details)

Uploaded Python 3

File details

Details for the file ccrvam-1.2.3.tar.gz.

File metadata

  • Download URL: ccrvam-1.2.3.tar.gz
  • Upload date:
  • Size: 75.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ccrvam-1.2.3.tar.gz
Algorithm Hash digest
SHA256 2f8277d36d9bc3352a1dfea2aadad6e334bfb3b9104970f58ce00d5fdacf4f68
MD5 4551b18ea9e41df8f77e615b96629a40
BLAKE2b-256 017acf1835e00aa4f6191f0db57f2525b278f7eb473d25584da59287eb9173a0

See more details on using hashes here.

Provenance

The following attestation bundles were made for ccrvam-1.2.3.tar.gz:

Publisher: publish.yml on DhyeyMavani2003/ccrvam

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file ccrvam-1.2.3-py3-none-any.whl.

File metadata

  • Download URL: ccrvam-1.2.3-py3-none-any.whl
  • Upload date:
  • Size: 72.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ccrvam-1.2.3-py3-none-any.whl
Algorithm Hash digest
SHA256 bc2d147f18c6d3d20c365beb521ebcf9798032de6ab024889fa43171f05359db
MD5 3c36a09cbc044650b52e88a140e1b12b
BLAKE2b-256 fb5cb96efe7e2cf406401d7fba4ef3949179f9043452ddfe3cf45c1498a557b5

See more details on using hashes here.

Provenance

The following attestation bundles were made for ccrvam-1.2.3-py3-none-any.whl:

Publisher: publish.yml on DhyeyMavani2003/ccrvam

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page