Skip to main content

Data processing engine for the NVEIL Toolkit. Install via: pip install nveil

Project description

NVEIL

Choregraph

The pure-Python data-processing engine behind NVEIL — usable on its own.

PyPI Python License Docs


Choregraph turns declarative pipeline specifications into executable Kedro data pipelines. It is the data-processing engine inside the NVEIL platform — and a standalone library you can use on its own: wire inputs and transforms in Python (or portable XML), run them locally, and get DataFrames back. No NVEIL server required.

Install

pip install choregraph

Optional extras:

pip install "choregraph[extra]"    # geo (geopandas), NLP, scikit-learn, Pillow, …
pip install "choregraph[dicom]"    # DICOM medical imaging

Quick start

from choregraph import Choregraph
from choregraph.parser import InputPortSpec

cg = Choregraph(workspace_path="./my_project")
cg.add_input(id="population", location="data/population.csv", format="CSV")

cg.add_node(
    id="large",
    type="filter_greater_than",
    input_ports=[
        InputPortSpec(name="df", source_ref="population"),
        InputPortSpec(name="column", value="population"),
        InputPortSpec(name="value", value="1000000"),
    ],
)

cg.run()
df = cg.get_dataset("large_result")

Pipelines are portable — save and reload them as XML:

cg.export_to_xml("pipeline.xml")

with Choregraph(xml_spec="pipeline.xml") as cg:
    cg.run()
    df = cg.get_dataset("large_result")

Full walkthrough → docs.nveil.com.

What's inside

  • Declarative pipelines — define inputs, transforms, and outputs in Python or portable XML, executed as Kedro pipelines.
  • 50+ transforms — filtering, aggregation, joins, pivots, normalization, discretization, row/column operations, …
  • Geo — geocode location names to coordinates; join country-boundary polygons for map visualizations.
  • NLP — multi-label binarization with automatic language detection, lemmatization, and fuzzy matching.
  • Excel intelligence — LLM-assisted detection and tidying of messy multi-table spreadsheets.
  • Biosignals — EDF/EDF+ ingestion (EEG, ECG, polysomnography, …).

Part of NVEIL

Choregraph is one of the open-source engines of the NVEIL platform. Most people reach it through the NVEIL Toolkit (pip install nveil) or the platform itself — but Choregraph stands on its own for anyone who wants a programmable, XML-portable data-processing engine. Its results feed the NVEIL platform's visualization engine.

Contributing

Contributions are welcome under the project's Contributor License Agreement — signed once, on your first pull request (a license grant, not an assignment; you keep your rights). Bug reports and ideas are welcome via GitHub Issues.

License

Dual-licensed: AGPL-3.0-or-later (see LICENSE) and a commercial license for closed-source / proprietary use (see COMMERCIAL-LICENSE.md) — contact pierre.jacquet@nveil.com.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

choregraph-1.4.1.tar.gz (5.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

choregraph-1.4.1-py3-none-any.whl (5.1 MB view details)

Uploaded Python 3

File details

Details for the file choregraph-1.4.1.tar.gz.

File metadata

  • Download URL: choregraph-1.4.1.tar.gz
  • Upload date:
  • Size: 5.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for choregraph-1.4.1.tar.gz
Algorithm Hash digest
SHA256 2d838824e0c7ee73448183451d420b7f086305beb2420ddc0aa9cd1aa66a7b49
MD5 37bcbe85092da811e37f7a95ba6c31a3
BLAKE2b-256 5668b946df0a332da587fa7ac8c0f5c1a70e4f2668ca78805eb7e61f7f88bb35

See more details on using hashes here.

Provenance

The following attestation bundles were made for choregraph-1.4.1.tar.gz:

Publisher: publish.yml on nveil-ai/choregraph

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file choregraph-1.4.1-py3-none-any.whl.

File metadata

  • Download URL: choregraph-1.4.1-py3-none-any.whl
  • Upload date:
  • Size: 5.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for choregraph-1.4.1-py3-none-any.whl
Algorithm Hash digest
SHA256 2bad0af454445e16ab242e14021f96d303810cd8e3dca4e9f59227fce516b06a
MD5 a3865686711dfe530fb973d7dd7ffe15
BLAKE2b-256 61238fbe08c3521b3c8a4509e8e6ade9056e31985d21b290f29238b335c83b65

See more details on using hashes here.

Provenance

The following attestation bundles were made for choregraph-1.4.1-py3-none-any.whl:

Publisher: publish.yml on nveil-ai/choregraph

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page