Skip to main content

Cogniflow basic sink StepPackage with C++ data hive parquet sink.

Project description

cf-basic-sinks

cf-basic-sinks provides native sink steps for pipeline outputs.

Steps

Step Category Description
cfsink:DataHiveParquetSinkStep sink Write canonical data hive parquet output through the cf_datahive_cpp gatekeeper

Published distribution name:

pip install cf-basic-sinks

Native build prerequisites

cf_basic_sinks is built with scikit-build-core and CMake and requires:

  • CPython 3.13
  • CMake on PATH
  • a Windows C++ toolchain compatible with that CMake installation
  • access to cf-pipeline-sdk from the package index
  • access to cf-datahive from the package index for the owner-provided native cf_datahive_cpp source surface
  • DuckDB native headers/library for the transitive cf_datahive_cpp build

This package does not publish cf_datahive_cpp separately. The step package consumes the owner-provided native source surface exposed by the installed cf_datahive package. Native data hive write ownership remains under cf_datahive.

Local builds can satisfy the DuckDB requirement through either:

  • the repo-local .native_deps/duckdb layout created by the setup scripts
  • CF_DATAHIVE_CPP_DUCKDB_INCLUDE and CF_DATAHIVE_CPP_DUCKDB_LIB

Publishing

cf_basic_sinks is published with the dedicated Windows workflow:

  • Workflow: .github/workflows/cf_basic_sinks_windows_publish.yml
  • Package directory: sandcastle/cf_basic_steps/cf_basic_sinks
  • PyPI tag: cf-basic-sinks-v<version>
  • TestPyPI tag: cf-basic-sinks-v<version>-test

Workflow note:

  • test/build jobs provision native deps via scripts/setup_native_deps_v2.ps1
  • test/build jobs preinstall published cf-pipeline-sdk and cf-datahive into the job interpreter before invoking the shared publish helper, so the CMake Python subprocesses can discover the owner-provided SDK and data-hive surfaces
  • test/build jobs export CF_DATAHIVE_CPP_DUCKDB_INCLUDE and CF_DATAHIVE_CPP_DUCKDB_LIB to the repo-local .native_deps/duckdb layout, because the published cf_datahive owner surface is resolved from site-packages rather than from the repository tree

Local preflight:

$env:CF_DATAHIVE_CPP_DUCKDB_INCLUDE = (Resolve-Path .native_deps/duckdb).Path
$env:CF_DATAHIVE_CPP_DUCKDB_LIB = (Resolve-Path .native_deps/duckdb/lib/duckdb.lib).Path
powershell -ExecutionPolicy Bypass -File scripts/mimic_windows_python_publish_workflow.ps1 `
  -WorkflowFile .github/workflows/cf_basic_sinks_windows_publish.yml `
  -PackageDir sandcastle/cf_basic_steps/cf_basic_sinks `
  -PythonExe py `
  -PythonVersion 3.13

Queue a dry-run dispatch:

powershell -ExecutionPolicy Bypass -File scripts/queue_windows_python_publish_workflow.ps1 `
  -WorkflowFile .github/workflows/cf_basic_sinks_windows_publish.yml `
  -PackageDir sandcastle/cf_basic_steps/cf_basic_sinks `
  -PublishTarget testpypi `
  -Ref main `
  -RequireLocalPass `
  -DryRun

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cf_basic_sinks-0.1.0.tar.gz (52.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cf_basic_sinks-0.1.0-cp313-cp313-win_amd64.whl (51.4 kB view details)

Uploaded CPython 3.13Windows x86-64

File details

Details for the file cf_basic_sinks-0.1.0.tar.gz.

File metadata

  • Download URL: cf_basic_sinks-0.1.0.tar.gz
  • Upload date:
  • Size: 52.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for cf_basic_sinks-0.1.0.tar.gz
Algorithm Hash digest
SHA256 3e0c1d376f2658ac24a1c92816b8fb53bc2eef14dbc37ca66f4e3127e97f82c0
MD5 58c3a0fa08bd311ec4bc814b815763ac
BLAKE2b-256 34c101f6a3ee5c0dfd064967936a3f0a334c3c73fd962b56625bc23bb539b035

See more details on using hashes here.

File details

Details for the file cf_basic_sinks-0.1.0-cp313-cp313-win_amd64.whl.

File metadata

File hashes

Hashes for cf_basic_sinks-0.1.0-cp313-cp313-win_amd64.whl
Algorithm Hash digest
SHA256 ffb053456a0cda499fdaabc564b4f59377f0b0cc558ff7e01ff96573d8238829
MD5 a0613b418974ef04437b1b8eb3f7cb20
BLAKE2b-256 b49f92b20ed3527a751d9e8de9a3b31f18868ef786d326a0e79cdfc0039ff11b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page