CELINE utils
Project description
CELINE Utils
CELINE Utils is a collection of shared utilities, libraries, and command-line tools that form the technical backbone of the CELINE data platform.
It provides reusable building blocks for data pipelines, governance, lineage, metadata management, and platform integrations. The repository is designed to be embedded into CELINE applications and executed within orchestrated environments using Meltano, dbt, Prefect, and OpenLineage.
Scope and Goals
The goals of this repository are to:
- Centralize cross-cutting platform logic used by multiple CELINE projects
- Provide opinionated but extensible tooling for data pipelines
- Enforce consistent governance and lineage semantics
- Reduce duplication across pipeline applications
- Act as a stable foundation for CELINE-compatible services and workflows
This is not an end-user application; it is a platform utility layer.
Key Capabilities
Command Line Interface (CLI)
A unified CLI built with Typer exposes administrative, governance, and pipeline utilities:
celine-utils
├── governance
│ └── generate
└── pipeline
├── init
└── run
Pipeline Orchestration
CELINE Utils provides a structured execution layer for:
- Meltano ingestion pipelines
- dbt transformations and tests
- Prefect-based Python flows
The PipelineRunner coordinates execution, logging, error handling, and lineage emission in a consistent way across tools.
See the pipeline tutorial to discover how to setup and deploy a new pipeline.
OpenLineage Integration
First-class OpenLineage support includes:
- Automatic emission of START, COMPLETE, FAIL, and ABORT events
- Dataset-level schema facets
- Data quality assertions from dbt tests
- Custom CELINE governance facets
Governance Framework
A declarative governance.yaml specification allows you to define:
- Dataset ownership
- License and access level
- Classification and retention
- Tags and documentation links
Governance rules are resolved using pattern matching and injected into lineage events.
Dataset Tooling
The DatasetClient enables:
- Schema and table introspection
- Column metadata inspection
- Safe query construction
- Export to Pandas
Platform Integrations
Built-in integrations include:
- Keycloak for identity and access management
- Apache Superset for analytics platform integration
- MQTT for lightweight messaging
Repository Structure
celine/
admin/
cli/
common/
datasets/
pipelines/
schemas/
tests/
Configuration
Configuration is environment-driven using pydantic-settings:
- Environment variables first
- Optional
.envfiles - Typed validation
- Container-friendly defaults
Installation
pip install celine-utils
Intended Audience
CELINE Utils is intended for:
- Data engineers
- Platform engineers
- CELINE application developers
It is not a general-purpose data tooling library.
License
Copyright © 2025
Spindox Labs
Licensed under the Apache License, Version 2.0.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file celine_utils-1.4.2.tar.gz.
File metadata
- Download URL: celine_utils-1.4.2.tar.gz
- Upload date:
- Size: 41.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ec94c851b9d602f9cc113b344da51f00482c7d97f78345c015add4df27de1413
|
|
| MD5 |
d547bd4d471882a52aa2f6da87a268b3
|
|
| BLAKE2b-256 |
df8223f3455c2bfe36df668d5c9ad656172536889f04e96ca2a3fa338883ea56
|
Provenance
The following attestation bundles were made for celine_utils-1.4.2.tar.gz:
Publisher:
release.yaml on celine-eu/celine-utils
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
celine_utils-1.4.2.tar.gz -
Subject digest:
ec94c851b9d602f9cc113b344da51f00482c7d97f78345c015add4df27de1413 - Sigstore transparency entry: 770637724
- Sigstore integration time:
-
Permalink:
celine-eu/celine-utils@3329f0f5296f0a97326ebbd88fb9b48397ecafdb -
Branch / Tag:
refs/tags/v1.4.2 - Owner: https://github.com/celine-eu
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yaml@3329f0f5296f0a97326ebbd88fb9b48397ecafdb -
Trigger Event:
push
-
Statement type:
File details
Details for the file celine_utils-1.4.2-py3-none-any.whl.
File metadata
- Download URL: celine_utils-1.4.2-py3-none-any.whl
- Upload date:
- Size: 54.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0353601870bd7f31d92190cc3ec7713434c1d36263a99a226f8cae144175bf75
|
|
| MD5 |
85438e2c61227f708394434932730dcf
|
|
| BLAKE2b-256 |
d343aa77c8a5e475da608b52b9910b3e773b20916bfa061c6579af9464414731
|
Provenance
The following attestation bundles were made for celine_utils-1.4.2-py3-none-any.whl:
Publisher:
release.yaml on celine-eu/celine-utils
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
celine_utils-1.4.2-py3-none-any.whl -
Subject digest:
0353601870bd7f31d92190cc3ec7713434c1d36263a99a226f8cae144175bf75 - Sigstore transparency entry: 770637732
- Sigstore integration time:
-
Permalink:
celine-eu/celine-utils@3329f0f5296f0a97326ebbd88fb9b48397ecafdb -
Branch / Tag:
refs/tags/v1.4.2 - Owner: https://github.com/celine-eu
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yaml@3329f0f5296f0a97326ebbd88fb9b48397ecafdb -
Trigger Event:
push
-
Statement type: