Skip to main content

equiflow is a package to generate equity-focused cohort selection flow diagrams.

Project description

equiflow

Under construction!

equiflow is a package designed to generate "Equity-focused Cohort Selection Flow Diagrams". We hope to facilitate research, increase its reproducibility, and improve the transparency of the process of cohort curation in machine learning studies.

Vision

equiflow will provide tabular and visual representations of inclusion and exclusion criteria applied to a clinical dataset. Each patient exclusion step can depict the cohort composition across demographics and outcomes, to interrogate potential sampling selection bias.

This package is designed to enhance the transparency and reproducibility of research in the medical machine learning field. It complements other tools like tableone, which is used for generating summary statistics for patient populations.

Citation

The concept was first introuced in our position paper.

Ellen JG, Matos J, Viola M, et al. Participant flow diagrams for health equity in AI. J Biomed Inform. 2024;152:104631. https://doi.org/10.1016/j.jbi.2024.104631

Motivation

Selection bias can arise through many aspects of a study, including recruitment, inclusion/exclusion criteria, input-level exclusion and outcome-level exclusion, and often reflects the underrepresentation of populations historically disadvantaged in medical research. The effects of selection bias can be further amplified when non-representative samples are used in artificial intelligence (AI) and machine learning (ML) applications to construct clinical algorithms. Building on the “Data Cards” initiative for transparency in AI research, we advocate for the addition of a participant flow diagram for AI studies detailing relevant sociodemographic and/or clinical characteristics of excluded participants across study phases, with the goal of identifying potential algorithmic biases before their clinical implementation. We include both a model for this flow diagram as well as a brief case study explaining how it could be implemented in practice. Through standardized reporting of participant flow diagrams, we aim to better identify potential inequities embedded in AI applications, facilitating more reliable and equitable clinical algorithms.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

equiflow-0.1.0a2.tar.gz (6.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

equiflow-0.1.0a2-py3-none-any.whl (5.0 kB view details)

Uploaded Python 3

File details

Details for the file equiflow-0.1.0a2.tar.gz.

File metadata

  • Download URL: equiflow-0.1.0a2.tar.gz
  • Upload date:
  • Size: 6.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.2

File hashes

Hashes for equiflow-0.1.0a2.tar.gz
Algorithm Hash digest
SHA256 5a8dfbd95ea87ad6ff53b25114a66ba8a3f317bac7c7a76a1174be02cb634523
MD5 4a97229ef97b8cc8e0e793237d2bcf82
BLAKE2b-256 9b228adfe9a1d32cb834ec16a1b0b24d32351072d003e62d19878416949cd3b6

See more details on using hashes here.

File details

Details for the file equiflow-0.1.0a2-py3-none-any.whl.

File metadata

  • Download URL: equiflow-0.1.0a2-py3-none-any.whl
  • Upload date:
  • Size: 5.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.2

File hashes

Hashes for equiflow-0.1.0a2-py3-none-any.whl
Algorithm Hash digest
SHA256 deb7e59f4e04720a163817b029224ae91074ee6f7dbeae741550a523c5ad5ce7
MD5 54854ed4f7924bb7deba4aa08a4996f8
BLAKE2b-256 cff04a296ff76bb890fd957e77b51107924c95116c71d56bf58a3af7328e2c5e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page