Skip to main content

Cogniflow basic sink StepPackage with C++ data hive parquet sink.

Project description

cf-basic-sinks

cf-basic-sinks provides native sink steps for pipeline outputs.

Steps

Step Category Description
cfsink:DataHiveParquetSinkStep sink Write canonical data hive parquet output through the cf_datahive_cpp gatekeeper

Published distribution name:

pip install cf-basic-sinks

Native build prerequisites

cf_basic_sinks is built with scikit-build-core and CMake and requires:

  • CPython 3.13
  • CMake on PATH
  • a Windows C++ toolchain compatible with that CMake installation
  • access to cf-pipeline-sdk from the package index
  • access to cf-datahive from the package index for the owner-provided native cf_datahive_cpp source surface
  • DuckDB native headers/library for the transitive cf_datahive_cpp build

This package does not publish cf_datahive_cpp separately. The step package consumes the owner-provided native source surface exposed by the installed cf_datahive package. Native data hive write ownership remains under cf_datahive.

Local builds can satisfy the DuckDB requirement through either:

  • the repo-local .native_deps/duckdb layout created by the setup scripts
  • CF_DATAHIVE_CPP_DUCKDB_INCLUDE and CF_DATAHIVE_CPP_DUCKDB_LIB

Publishing

cf_basic_sinks is published with the dedicated Windows workflow:

  • Workflow: .github/workflows/cf_basic_sinks_windows_publish.yml
  • Package directory: sandcastle/cf_basic_steps/cf_basic_sinks
  • PyPI tag: cf-basic-sinks-v<version>
  • TestPyPI tag: cf-basic-sinks-v<version>-test

Workflow note:

  • test/build jobs provision native deps via scripts/setup_native_deps_v2.ps1
  • test/build jobs preinstall published cf-pipeline-sdk and cf-datahive into the job interpreter before invoking the shared publish helper, so the CMake Python subprocesses can discover the owner-provided SDK and data-hive surfaces
  • test/build jobs export CF_DATAHIVE_CPP_DUCKDB_INCLUDE and CF_DATAHIVE_CPP_DUCKDB_LIB to the repo-local .native_deps/duckdb layout, because the published cf_datahive owner surface is resolved from site-packages rather than from the repository tree

Local preflight:

$env:CF_DATAHIVE_CPP_DUCKDB_INCLUDE = (Resolve-Path .native_deps/duckdb).Path
$env:CF_DATAHIVE_CPP_DUCKDB_LIB = (Resolve-Path .native_deps/duckdb/lib/duckdb.lib).Path
powershell -ExecutionPolicy Bypass -File scripts/mimic_windows_python_publish_workflow.ps1 `
  -WorkflowFile .github/workflows/cf_basic_sinks_windows_publish.yml `
  -PackageDir sandcastle/cf_basic_steps/cf_basic_sinks `
  -PythonExe py `
  -PythonVersion 3.13

Queue a dry-run dispatch:

powershell -ExecutionPolicy Bypass -File scripts/queue_windows_python_publish_workflow.ps1 `
  -WorkflowFile .github/workflows/cf_basic_sinks_windows_publish.yml `
  -PackageDir sandcastle/cf_basic_steps/cf_basic_sinks `
  -PublishTarget testpypi `
  -Ref main `
  -RequireLocalPass `
  -DryRun

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cf_basic_sinks-0.1.1.tar.gz (52.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cf_basic_sinks-0.1.1-cp313-cp313-win_amd64.whl (51.4 kB view details)

Uploaded CPython 3.13Windows x86-64

File details

Details for the file cf_basic_sinks-0.1.1.tar.gz.

File metadata

  • Download URL: cf_basic_sinks-0.1.1.tar.gz
  • Upload date:
  • Size: 52.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for cf_basic_sinks-0.1.1.tar.gz
Algorithm Hash digest
SHA256 1ac3fb75fd99fa7222bf19263ef9f71162b36c2785702604847a35d316a7550e
MD5 65d67cf10ce157051b2d3369c5272fa7
BLAKE2b-256 ee127d62c4ba354bf15be389cd3b07834fd6a85bd92638a8da3a5a502471d838

See more details on using hashes here.

File details

Details for the file cf_basic_sinks-0.1.1-cp313-cp313-win_amd64.whl.

File metadata

File hashes

Hashes for cf_basic_sinks-0.1.1-cp313-cp313-win_amd64.whl
Algorithm Hash digest
SHA256 cf1aa6647b65e79b710412aa466f5a4f0c0cc85b3b781e65e6270a398fd73b36
MD5 c7bad4289b9aec8831c3e8abbc7d484c
BLAKE2b-256 5ca1bb42a3eb8b82f2a4993001016897055f7b8ff4c361db57e3e321f45bddb3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page