Skip to main content

CELINE utils

Project description

CELINE Utils

CELINE Utils is a collection of shared utilities, libraries, and command-line tools that form the technical backbone of the CELINE data platform.

It provides reusable building blocks for data pipelines, governance, lineage, metadata management, and platform integrations. The repository is designed to be embedded into CELINE applications and executed within orchestrated environments using Meltano, dbt, Prefect, and OpenLineage


Scope and Goals

The goals of this repository are to:

  • Centralize cross-cutting platform logic used by multiple CELINE projects
  • Provide opinionated but extensible tooling for data pipelines
  • Enforce consistent governance and lineage semantics
  • Reduce duplication across pipeline applications
  • Act as a stable foundation for CELINE-compatible services and workflows

This is not an end-user application; it is a platform utility layer.


Key Capabilities

Command Line Interface (CLI)

A unified CLI built with Typer exposes administrative, governance, and pipeline utilities:

celine-utils
 ├── governance
 │    └── generate
 └── pipeline
      ├── init
      └── run

Pipeline Orchestration

CELINE Utils provides a structured execution layer for:

  • Meltano ingestion pipelines
  • dbt transformations and tests
  • Prefect-based Python flows

The PipelineRunner coordinates execution, logging, error handling, and lineage emission in a consistent way across tools.

See the pipeline tutorial to discover how to setup and deploy a new pipeline.


OpenLineage Integration

First-class OpenLineage support includes:

  • Automatic emission of START, COMPLETE, FAIL, and ABORT events
  • Dataset-level schema facets
  • Data quality assertions from dbt tests
  • Custom CELINE governance facets

Governance Framework

A declarative governance.yaml specification allows you to define:

  • Dataset ownership
  • License and access level
  • Classification and retention
  • Tags and documentation links

Governance rules are resolved using pattern matching and injected into lineage events.


Dataset Tooling

The DatasetClient enables:

  • Schema and table introspection
  • Column metadata inspection
  • Safe query construction
  • Export to Pandas

Platform Integrations

Built-in integrations include:

  • Keycloak for identity and access management
  • Apache Superset for analytics platform integration
  • MQTT for lightweight messaging

Repository Structure

celine/
  admin/
  cli/
  common/
  datasets/
  pipelines/
schemas/
tests/

Configuration

Configuration is environment-driven using pydantic-settings:

  • Environment variables first
  • Optional .env files
  • Typed validation
  • Container-friendly defaults

Installation

pip install celine-utils

Intended Audience

CELINE Utils is intended for:

  • Data engineers
  • Platform engineers
  • CELINE application developers

It is not a general-purpose data tooling library.


License

Copyright © 2025
Spindox Labs

Licensed under the Apache License, Version 2.0.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

celine_utils-1.10.1.tar.gz (49.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

celine_utils-1.10.1-py3-none-any.whl (64.3 kB view details)

Uploaded Python 3

File details

Details for the file celine_utils-1.10.1.tar.gz.

File metadata

  • Download URL: celine_utils-1.10.1.tar.gz
  • Upload date:
  • Size: 49.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for celine_utils-1.10.1.tar.gz
Algorithm Hash digest
SHA256 daa1429562e6c5d13c99413f08720b3daef575320c81a312897eb584f95581d6
MD5 7c415b8bfd72939f44edfa00cd668a99
BLAKE2b-256 0d00ae3cdd78974a752434a2e31c1da3c61a57ef5a733a9a66d34bea9caacaba

See more details on using hashes here.

Provenance

The following attestation bundles were made for celine_utils-1.10.1.tar.gz:

Publisher: release.yaml on celine-eu/celine-utils

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file celine_utils-1.10.1-py3-none-any.whl.

File metadata

  • Download URL: celine_utils-1.10.1-py3-none-any.whl
  • Upload date:
  • Size: 64.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for celine_utils-1.10.1-py3-none-any.whl
Algorithm Hash digest
SHA256 4e8b0dff6e14bafab47807f55a7ec77272fa3b2e9308c469d306c536d7cbb2b6
MD5 1349cca4856a2efcdcdc1e0649c18c0d
BLAKE2b-256 0226002938a3da8a64d1b5d59733b6fb7a560c3644dd9d40e0020b3ba35f45f0

See more details on using hashes here.

Provenance

The following attestation bundles were made for celine_utils-1.10.1-py3-none-any.whl:

Publisher: release.yaml on celine-eu/celine-utils

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page