Data processing engine for the NVEIL Toolkit. Install via: pip install nveil
Project description
Choregraph
The pure-Python data-processing engine behind NVEIL — usable on its own.
Choregraph turns declarative pipeline specifications into executable Kedro data pipelines. It is the data-processing engine inside the NVEIL platform — and a standalone library you can use on its own: wire inputs and transforms in Python (or portable XML), run them locally, and get DataFrames back. No NVEIL server required.
Install
pip install choregraph
Optional extras:
pip install "choregraph[extra]" # geo (geopandas), NLP, scikit-learn, Pillow, …
pip install "choregraph[dicom]" # DICOM medical imaging
Quick start
from choregraph import Choregraph
from choregraph.parser import InputPortSpec
cg = Choregraph(workspace_path="./my_project")
cg.add_input(id="population", location="data/population.csv", format="CSV")
cg.add_node(
id="large",
type="filter_greater_than",
input_ports=[
InputPortSpec(name="df", source_ref="population"),
InputPortSpec(name="column", value="population"),
InputPortSpec(name="value", value="1000000"),
],
)
cg.run()
df = cg.get_dataset("large_result")
Pipelines are portable — save and reload them as XML:
cg.export_to_xml("pipeline.xml")
with Choregraph(xml_spec="pipeline.xml") as cg:
cg.run()
df = cg.get_dataset("large_result")
Full walkthrough → docs.nveil.com.
What's inside
- Declarative pipelines — define inputs, transforms, and outputs in Python or portable XML, executed as Kedro pipelines.
- 50+ transforms — filtering, aggregation, joins, pivots, normalization, discretization, row/column operations, …
- Geo — geocode location names to coordinates; join country-boundary polygons for map visualizations.
- NLP — multi-label binarization with automatic language detection, lemmatization, and fuzzy matching.
- Excel intelligence — LLM-assisted detection and tidying of messy multi-table spreadsheets.
- Biosignals — EDF/EDF+ ingestion (EEG, ECG, polysomnography, …).
Part of NVEIL
Choregraph is one of the open-source engines of the NVEIL platform. Most people reach it through the NVEIL Toolkit (pip install nveil) or the platform itself — but Choregraph stands on its own for anyone who wants a programmable, XML-portable data-processing engine. Its results feed the NVEIL platform's visualization engine.
Contributing
Contributions are welcome under the project's Contributor License Agreement — signed once, on your first pull request (a license grant, not an assignment; you keep your rights). Bug reports and ideas are welcome via GitHub Issues.
License
Dual-licensed: AGPL-3.0-or-later (see LICENSE) and a commercial license for closed-source / proprietary use (see COMMERCIAL-LICENSE.md) — contact pierre.jacquet@nveil.com.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file choregraph-1.4.2.tar.gz.
File metadata
- Download URL: choregraph-1.4.2.tar.gz
- Upload date:
- Size: 5.1 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ee4a7bb685c544b2c9d8c47320b4346b03643e5ae3dc7a9a20f78c14d151be16
|
|
| MD5 |
e6149cef440c10667a91f730bfd0d4ef
|
|
| BLAKE2b-256 |
3d4c0a851de8a51399219556182858d16ebf3f9218b608e9238115472efa47fa
|
Provenance
The following attestation bundles were made for choregraph-1.4.2.tar.gz:
Publisher:
publish.yml on nveil-ai/choregraph
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
choregraph-1.4.2.tar.gz -
Subject digest:
ee4a7bb685c544b2c9d8c47320b4346b03643e5ae3dc7a9a20f78c14d151be16 - Sigstore transparency entry: 1731524008
- Sigstore integration time:
-
Permalink:
nveil-ai/choregraph@4d96ca0048e1869953ae34e6ae9bea37e47623c7 -
Branch / Tag:
refs/tags/v1.4.2 - Owner: https://github.com/nveil-ai
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@4d96ca0048e1869953ae34e6ae9bea37e47623c7 -
Trigger Event:
push
-
Statement type:
File details
Details for the file choregraph-1.4.2-py3-none-any.whl.
File metadata
- Download URL: choregraph-1.4.2-py3-none-any.whl
- Upload date:
- Size: 5.1 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
bf9aac4104f4ff9becd4b51b96a493773488d4fb49f0d562196f3cf239b2ee0c
|
|
| MD5 |
0b30c22bb52714f7fc8ff9115598c8bf
|
|
| BLAKE2b-256 |
e74e489362899ec343f7eda2c27d43cf110dad02923e90396ff9d5f8b080f3c9
|
Provenance
The following attestation bundles were made for choregraph-1.4.2-py3-none-any.whl:
Publisher:
publish.yml on nveil-ai/choregraph
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
choregraph-1.4.2-py3-none-any.whl -
Subject digest:
bf9aac4104f4ff9becd4b51b96a493773488d4fb49f0d562196f3cf239b2ee0c - Sigstore transparency entry: 1731524088
- Sigstore integration time:
-
Permalink:
nveil-ai/choregraph@4d96ca0048e1869953ae34e6ae9bea37e47623c7 -
Branch / Tag:
refs/tags/v1.4.2 - Owner: https://github.com/nveil-ai
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@4d96ca0048e1869953ae34e6ae9bea37e47623c7 -
Trigger Event:
push
-
Statement type: