Serialisation utilities for OMOP CDM cohorts and CQI builder

Project description

OA Cohorts – Reporting & Cohort Execution Engine

This package provides the core machinery for defining, executing, and inspecting cohort-based reports over OMOP-style clinical data. It’s designed to support building real-world evidence reports from composable clinical rules, measures, cohorts, and indicators, with both programmatic APIs and lightweight HTML rendering for debugging and exploration.

The framework implemented here supports configuration-driven clinical quality indicators over OMOP-harmonised data, with explicit support for disease and treatment episodes, temporality, and combinatorial logic. Measures can be defined in terms of diagnoses, treatments, procedures, observations, measurements, and demographics, and composed into clinically interpretable cohorts and indicators.

This enables the same indicator definitions to support bulk benchmarking, trend analysis over time, and patient-level drill-down, without rewriting query logic for each use case. In practice, this provides a bridge between formal indicator specifications and the operational reality of multidisciplinary care.

At a high level, the system lets you:

Define query rules (exact, hierarchical, scalar thresholds, phenotypes, etc.)
Combine rules into subqueries
Build measures from subqueries (including composite measures with AND/OR/EXCEPT logic)
Group measures into dash cohorts and cohort definitions
Define indicators (numerator/denominator pairs)
Assemble everything into a report
Execute the report against a database session and materialise results as in-memory member sets
Inspect SQL, executability, and structure via HTML renderers (handy in notebooks)

This is intentionally object-centric: once a report is executed, downstream payloads are assembled from the resolved cohort and indicator member sets, with report-level demography fetched only for the in-scope cohort person_ids.

Diagnosis target semantics

The diagnosis layer now distinguishes between two common clinical questions:

dx_any: any diagnosis row attached to the current disease episode
dx_primary: diagnosis rows restricted to the patient's primary diagnosis episode anchor

In practice, dx_primary is the safer target when a cohort or indicator should be defined by the diagnosis present at the start of the primary episode of care, rather than by later progression or metastatic episodes that may carry related diagnosis coding.

That distinction matters most for report definitions that need to answer questions like "was this a lung primary cohort?" rather than the broader question "did this episode ever carry a lung-related diagnosis code?"

What’s here (roughly)

Report / ReportCohortMap: Top-level report definition, linking cohorts and indicators.
DashCohort / DashCohortDef: User-facing cohort groupings backed by executable measures.
Measure / MeasureSQLCompiler / MeasureExecutor: The core executable units. Measures compile to SQL, execute against a session, and materialise member sets with dating and episode context.
Indicator: Numerator/denominator semantics over measures, including optional indicator-level relative date windows anchored to report cohort membership.
QueryRule (+ subclasses): The rule DSL: exact matches, hierarchies, exclusions, scalar thresholds, phenotypes, substring matches, etc.
HTMLRenderable mixins: Lightweight visualisation of structure, SQL previews, and executability for debugging and exploration.

Execution model

report.execute(session)
report.assert_executed()

rows = report.members(executor)    # all cohort members
indicators = report.indicators     # output rows are built per denominator member within the report cohort

Indicator-relative date windows

Indicators can optionally define dynamic numerator and denominator date windows using:

numerator_max_days_prior
numerator_max_days_post
denominator_max_days_prior
denominator_max_days_post

These windows are evaluated relative to the report cohort membership date, not globally on the reusable measure definition. This keeps measures portable while allowing the same measure to participate in different indicators with different timing requirements.

Execution semantics:

measures still execute broadly and materialise their full MeasureMember sets
indicator row assembly then narrows numerator and denominator rows relative to the in-scope report cohort membership date
when the denominator is the full report cohort (measure_id = 0), filtering is still evaluated per cohort membership row so different in-scope episodes for the same person can qualify differently
if a window is configured and either the anchor date or candidate member date is missing, that candidate does not satisfy the dated comparison

Status

This is a working internal engine under active development. APIs may shift.

Docker

The repo includes a lightweight CLI container under docker/docker-compose.yaml that joins the external cava-network.

Resources

oa-cohorts uses two separate database resources:

dashboard_db — where oa-cohorts stores its report, indicator, and measure configuration. This is the database the CLI commands (import-config, report-summary, etc.) read from and write to. oa-cohorts owns and provisions this resource via omop-config configure oa_cohorts.
cdm_db — the OMOP CDM database used for cohort execution. Typically shared with omop_alchemy or omop_constructs; not provisioned by oa-cohorts directly.

Both resources can share the same physical database server, or be separate databases in production. To share a server but use different schemas, declare two [resources.*] entries that both point to the same [databases.*] entry:

[databases.mydb]
dialect = "postgresql+psycopg2"
host = "localhost"
database_name = "mydb"

[resources.dashboard_db]
database = "mydb"
cdm_schema = "public"        # schema where oa-cohorts stores its config tables

[resources.cdm_db]
database = "mydb"
cdm_schema = "omop"          # OMOP CDM schema
vocab_schema = "omop"
results_schema = "results"

Note: [resource_aliases] maps a semantic name to an existing resource and inherits its full config (including schema). It is not the right tool for "same server, different schema" — use two separate resource entries as above.

The CLI resolves dashboard_db by default (or the oa_cohorts.default_resource configured during setup). --database-url remains available as a per-command override, and ENGINE can be set as a local fallback when no stack config file is present.

Example:

cd docker
docker compose up -d oa-cohorts
docker compose exec oa-cohorts oa-cohorts --help
docker compose exec oa-cohorts oa-cohorts import-config /app/dash_config

When using stack configuration, ensure the dashboard_db host is reachable on cava-network, for example postgresql+psycopg2://user:password@postgres:5432/dbname. For one-off local overrides, pass --database-url or set ENGINE before invoking the command.

Project details

Release history Release notifications | RSS feed

This version

0.8.3

Jun 22, 2026

0.8.2

Jun 22, 2026

0.8.1

Jun 19, 2026

0.8.0

Jun 19, 2026

0.7.6

Jun 17, 2026

0.7.5

May 31, 2026

0.7.4

May 31, 2026

0.7.3

May 31, 2026

0.7.2

May 29, 2026

0.7.1

May 27, 2026

0.7.0

May 20, 2026

0.6.1

May 20, 2026

0.6.0

May 18, 2026

0.5.5

May 14, 2026

0.5.4

May 12, 2026

0.5.3

May 12, 2026

0.5.2

May 11, 2026

0.5.1

May 7, 2026

0.5.0

May 7, 2026

0.4.2

May 6, 2026

0.4.1

May 4, 2026

0.4.0

Apr 1, 2026

0.3.22

Mar 6, 2026

0.3.21

Mar 5, 2026

0.3.20

Mar 5, 2026

0.3.19

Mar 5, 2026

0.3.18

Mar 5, 2026

0.3.16

Mar 5, 2026

0.3.15

Mar 5, 2026

0.3.14

Mar 5, 2026

0.3.13

Mar 5, 2026

0.3.12

Mar 5, 2026

0.3.11

Mar 4, 2026

0.3.10

Mar 4, 2026

0.3.9

Mar 4, 2026

0.3.8

Mar 4, 2026

0.3.7

Mar 4, 2026

0.3.6

Mar 4, 2026

0.3.5

Mar 4, 2026

0.3.4

Mar 4, 2026

0.3.3

Mar 4, 2026

0.3.1

Mar 4, 2026

0.3.0

Mar 4, 2026

0.2.8

Mar 4, 2026

0.2.7

Mar 4, 2026

0.2.6

Feb 26, 2026

0.2.5

Feb 26, 2026

0.2.4

Feb 26, 2026

0.2.3

Feb 26, 2026

0.2.2

Feb 26, 2026

0.2.1

Feb 26, 2026

0.2.0

Feb 26, 2026

0.1.4

Feb 26, 2026

0.1.3

Feb 26, 2026

0.1.2

Feb 25, 2026

0.1.1

Feb 25, 2026

0.1.0

Feb 25, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

oa_cohorts-0.8.3.tar.gz (53.1 kB view details)

Uploaded Jun 22, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

oa_cohorts-0.8.3-py3-none-any.whl (70.3 kB view details)

Uploaded Jun 22, 2026 Python 3

File details

Details for the file oa_cohorts-0.8.3.tar.gz.

File metadata

Download URL: oa_cohorts-0.8.3.tar.gz
Upload date: Jun 22, 2026
Size: 53.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for oa_cohorts-0.8.3.tar.gz
Algorithm	Hash digest
SHA256	`d48552f210cc89130cc32b1edf228b6d4d83bfe64a95b29982fcb06eb733ff33`
MD5	`f4f3daad438d9fb021c77a21afb40584`
BLAKE2b-256	`0f61bcdf084946ee84faeedc9332b01f71453a4fd1f8e78fda79836ed05f502e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for oa_cohorts-0.8.3.tar.gz:

Publisher: pypi.yml on AustralianCancerDataNetwork/oa-cohorts

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: oa_cohorts-0.8.3.tar.gz
- Subject digest: d48552f210cc89130cc32b1edf228b6d4d83bfe64a95b29982fcb06eb733ff33
- Sigstore transparency entry: 1907884765
- Sigstore integration time: Jun 22, 2026
Source repository:
- Permalink: AustralianCancerDataNetwork/oa-cohorts@2a56e0e278c8de164097db6d0be9ae40886ab32d
- Branch / Tag: refs/tags/0.8.3
- Owner: https://github.com/AustralianCancerDataNetwork
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi.yml@2a56e0e278c8de164097db6d0be9ae40886ab32d
- Trigger Event: release

File details

Details for the file oa_cohorts-0.8.3-py3-none-any.whl.

File metadata

Download URL: oa_cohorts-0.8.3-py3-none-any.whl
Upload date: Jun 22, 2026
Size: 70.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for oa_cohorts-0.8.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`fc981e7b32e01a655ca555c93c1be69716abf4ba3b4d5b2dae17a20aa1c33c6f`
MD5	`1a928f9d12db7f16e8995a305952edc2`
BLAKE2b-256	`3cd27253dc99ec11f4be7992b6600d1b019e6f234194d46721902a259471f107`

See more details on using hashes here.

Provenance

The following attestation bundles were made for oa_cohorts-0.8.3-py3-none-any.whl:

Publisher: pypi.yml on AustralianCancerDataNetwork/oa-cohorts

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: oa_cohorts-0.8.3-py3-none-any.whl
- Subject digest: fc981e7b32e01a655ca555c93c1be69716abf4ba3b4d5b2dae17a20aa1c33c6f
- Sigstore transparency entry: 1907884852
- Sigstore integration time: Jun 22, 2026
Source repository:
- Permalink: AustralianCancerDataNetwork/oa-cohorts@2a56e0e278c8de164097db6d0be9ae40886ab32d
- Branch / Tag: refs/tags/0.8.3
- Owner: https://github.com/AustralianCancerDataNetwork
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi.yml@2a56e0e278c8de164097db6d0be9ae40886ab32d
- Trigger Event: release

oa-cohorts 0.8.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

OA Cohorts – Reporting & Cohort Execution Engine

Diagnosis target semantics

What’s here (roughly)

Execution model

Indicator-relative date windows

Status

Docker

Resources

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance