Skip to main content

Data processing engine for the NVEIL Toolkit. Install via: pip install nveil

Project description

NVEIL

Choregraph

The pure-Python data-processing engine behind NVEIL — usable on its own.

PyPI Python License Docs


Choregraph turns declarative pipeline specifications into executable Kedro data pipelines. It is the data-processing engine inside the NVEIL platform — and a standalone library you can use on its own: wire inputs and transforms in Python (or portable XML), run them locally, and get DataFrames back. No NVEIL server required.

Install

pip install choregraph

Optional extras:

pip install "choregraph[extra]"    # geo (geopandas), NLP, scikit-learn, Pillow, …
pip install "choregraph[dicom]"    # DICOM medical imaging

Quick start

from choregraph import Choregraph
from choregraph.parser import InputPortSpec

cg = Choregraph(workspace_path="./my_project")
cg.add_input(id="population", location="data/population.csv", format="CSV")

cg.add_node(
    id="large",
    type="filter_greater_than",
    input_ports=[
        InputPortSpec(name="df", source_ref="population"),
        InputPortSpec(name="column", value="population"),
        InputPortSpec(name="value", value="1000000"),
    ],
)

cg.run()
df = cg.get_dataset("large_result")

Pipelines are portable — save and reload them as XML:

cg.export_to_xml("pipeline.xml")

with Choregraph(xml_spec="pipeline.xml") as cg:
    cg.run()
    df = cg.get_dataset("large_result")

Full walkthrough → docs.nveil.com.

What's inside

  • Declarative pipelines — define inputs, transforms, and outputs in Python or portable XML, executed as Kedro pipelines.
  • 50+ transforms — filtering, aggregation, joins, pivots, normalization, discretization, row/column operations, …
  • Geo — geocode location names to coordinates; join country-boundary polygons for map visualizations.
  • NLP — multi-label binarization with automatic language detection, lemmatization, and fuzzy matching.
  • Excel intelligence — LLM-assisted detection and tidying of messy multi-table spreadsheets.
  • Biosignals — EDF/EDF+ ingestion (EEG, ECG, polysomnography, …).

Part of NVEIL

Choregraph is one of the open-source engines of the NVEIL platform. Most people reach it through the NVEIL Toolkit (pip install nveil) or the platform itself — but Choregraph stands on its own for anyone who wants a programmable, XML-portable data-processing engine. Its results feed the NVEIL platform's visualization engine.

Contributing

Contributions are welcome under the project's Contributor License Agreement — signed once, on your first pull request (a license grant, not an assignment; you keep your rights). Bug reports and ideas are welcome via GitHub Issues.

License

Dual-licensed: AGPL-3.0-or-later (see LICENSE) and a commercial license for closed-source / proprietary use (see COMMERCIAL-LICENSE.md) — contact pierre.jacquet@nveil.com.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

choregraph-1.4.2.tar.gz (5.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

choregraph-1.4.2-py3-none-any.whl (5.1 MB view details)

Uploaded Python 3

File details

Details for the file choregraph-1.4.2.tar.gz.

File metadata

  • Download URL: choregraph-1.4.2.tar.gz
  • Upload date:
  • Size: 5.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for choregraph-1.4.2.tar.gz
Algorithm Hash digest
SHA256 ee4a7bb685c544b2c9d8c47320b4346b03643e5ae3dc7a9a20f78c14d151be16
MD5 e6149cef440c10667a91f730bfd0d4ef
BLAKE2b-256 3d4c0a851de8a51399219556182858d16ebf3f9218b608e9238115472efa47fa

See more details on using hashes here.

Provenance

The following attestation bundles were made for choregraph-1.4.2.tar.gz:

Publisher: publish.yml on nveil-ai/choregraph

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file choregraph-1.4.2-py3-none-any.whl.

File metadata

  • Download URL: choregraph-1.4.2-py3-none-any.whl
  • Upload date:
  • Size: 5.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for choregraph-1.4.2-py3-none-any.whl
Algorithm Hash digest
SHA256 bf9aac4104f4ff9becd4b51b96a493773488d4fb49f0d562196f3cf239b2ee0c
MD5 0b30c22bb52714f7fc8ff9115598c8bf
BLAKE2b-256 e74e489362899ec343f7eda2c27d43cf110dad02923e90396ff9d5f8b080f3c9

See more details on using hashes here.

Provenance

The following attestation bundles were made for choregraph-1.4.2-py3-none-any.whl:

Publisher: publish.yml on nveil-ai/choregraph

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page