Skip to main content

FHIR for Data Science — unified package for FHIRPath, CQL, and SQL-on-FHIR analytics

Project description

FHIR4DS

FHIR for Data Science - A unified suite of high-performance tools for working with FHIR healthcare data in analytical environments.

Overview

FHIR4DS provides high-performance tools for querying and transforming FHIR data. Built on DuckDB for efficient analytical processing, it offers native support for FHIRPath expressions, CQL clinical quality measures, and SQL-on-FHIR v2 ViewDefinitions.

Packages (Unified Namespace)

The project is now organized under the fhir4ds namespace for a clean, professional API.

Feature Import Path Description
FHIRPath fhir4ds.fhirpath Core FHIRPath expression evaluator
FHIRPath (DuckDB) fhir4ds.fhirpath.duckdb DuckDB UDFs and C++ extension for FHIRPath
CQL fhir4ds.cql CQL parser and SQL translator
CQL (DuckDB) fhir4ds.cql.duckdb DuckDB UDFs and translations for CQL
ViewDefinition fhir4ds.viewdef SQL-on-FHIR v2 ViewDefinition to SQL generator
DQM fhir4ds.dqm Digital Quality Measure orchestrator & audit engine

Non-Python Components

  • C++ Extensions: Located in extensions/fhirpath and extensions/cql. These provide native high-performance DuckDB extensions.
  • Web Demos: Located in web/wasm-demo and web/website.

Quick Start

Installation

pip install fhir4ds-v2

# With optional measure evaluation dependencies (numpy, pandas, etc.)
pip install "fhir4ds-v2[measures]"

Unified Connection

The easiest way to get started is the create_connection helper, which returns a DuckDB connection with all FHIRPath and CQL UDFs pre-registered.

import fhir4ds

con = fhir4ds.create_connection()

# Load FHIR data
con.execute("CREATE TABLE resources AS SELECT * FROM 'data/*.parquet'")

FHIRPath Queries

result = con.execute("""
    SELECT
        fhirpath_text(resource, 'Patient.id') AS patient_id,
        fhirpath_text(resource, 'Patient.name.given[0]') AS given_name
    FROM resources
    WHERE resourceType = 'Patient'
""").fetchdf()

CQL Measure Evaluation

result = fhir4ds.evaluate_measure(
    library_path="./CMS165.cql",
    conn=con,
    output_columns={
        "ip": "Initial Population",
        "denom": "Denominator",
        "numer": "Numerator",
    }
)
print(result.df())

SQL-on-FHIR ViewDefinitions

import json

view_definition = {
    "resource": "Patient",
    "select": [{"column": [
        {"path": "id", "name": "patient_id"},
        {"path": "gender", "name": "gender"},
    ]}]
}

sql = fhir4ds.generate_view_sql(json.dumps(view_definition))
patients_flat = con.execute(sql).df()

Standalone FHIRPath

from fhir4ds.fhirpath import evaluate

patient = {
    "resourceType": "Patient",
    "id": "123",
    "name": [{"given": ["John"], "family": "Doe"}]
}

result = evaluate(patient, "Patient.name.given")
# result: ["John"]

Test Coverage & Compliance

The unified engine is rigorously tested against official specifications.

Component Unit Tests Official Compliance
fhir4ds.fhirpath ✅ Passing 934/935 FHIRPath R4 (99.9%)
fhir4ds.cql ✅ Passing 3044/3044 CQL Parsing (100%)
fhir4ds.viewdef ✅ Passing 384/384 ViewDefinition v2 (100%)
fhir4ds.dqm ✅ Passing N/A

Architecture

The project follows a Feature-First hierarchy:

  • fhir4ds/: Unified Python package root.
    • fhirpath/: Core FHIRPath engine.
      • duckdb/: DuckDB-specific FHIRPath integration.
    • cql/: Core CQL-to-SQL translator.
      • duckdb/: DuckDB-specific CQL integration.
    • viewdef/: SQL-on-FHIR v2 implementation.
    • dqm/: Digital Quality Measure orchestration.
  • extensions/: High-performance C++ source for DuckDB extensions.
  • web/: Documentation website and interactive WASM demos.

Development

Setup

# Clone the repository
git clone https://github.com/fhir4ds/fhir4ds.git
cd fhir4ds

# Install in editable mode with development dependencies
pip install -e ".[dev]"

Running Tests

# Run all Python tests
pytest fhir4ds/

# Run specific subpackage tests
pytest fhir4ds/fhirpath/tests/unit/
pytest fhir4ds/dqm/tests/

# Run official spec conformance suites
python3 conformance/scripts/run_all.py

Benchmarks

Performance benchmarking tools and results are located in the benchmarks/ directory. See benchmarks/AGENTS.md for details.

License

This project is dual-licensed:

  1. Open Source: GNU Affero General Public License v3 (AGPL-3.0). See LICENSE for details.
  2. Commercial: A proprietary license for enterprise use, embedding in closed-source products, and high-performance C++ extensions.

For commercial licensing inquiries, please contact fhir4ds@gmail.com.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fhir4ds_v2-0.0.2.tar.gz (18.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

fhir4ds_v2-0.0.2-py3-none-any.whl (18.5 MB view details)

Uploaded Python 3

File details

Details for the file fhir4ds_v2-0.0.2.tar.gz.

File metadata

  • Download URL: fhir4ds_v2-0.0.2.tar.gz
  • Upload date:
  • Size: 18.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: Hatch/1.16.5 cpython/3.10.12 HTTPX/0.28.1

File hashes

Hashes for fhir4ds_v2-0.0.2.tar.gz
Algorithm Hash digest
SHA256 9adf431b63b4715801732bb017d9cdf5f1efa679ccabfaf9c2212a7d0a96611e
MD5 90e1ecfe3e14ec9b7f4cdd471b6a3558
BLAKE2b-256 0423f20aea7dd0f2faf22d7e210dba90a489711c5906072230e0aa929b943226

See more details on using hashes here.

File details

Details for the file fhir4ds_v2-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: fhir4ds_v2-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 18.5 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: Hatch/1.16.5 cpython/3.10.12 HTTPX/0.28.1

File hashes

Hashes for fhir4ds_v2-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 8c8a8471f5b439acdca773fcf1f1bc45125ad462531daf4bf83483d76bc0f8af
MD5 cae9c307383579e0decad1034d804751
BLAKE2b-256 c72e2cd44dc70e5b6d53752d9d972f43808211bc3fc0536c6b17875886e1fc46

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page