Skip to main content

CLI tool and library for CSV import pipelines

Project description

Astro pipeline

Astro

CLI tool and library for CSV import pipelines.

Documentation

Full guides, API reference, and contributing docs are at https://astro-pipeline.readthedocs.io.

Build locally: pip install -e ".[docs]" and make docs.

Topic Where to read
Install and quickstart docs/getting-started/
Pipeline authoring docs/user-guide/pipelines.md
CLI reference docs/user-guide/cli.md
Contributing docs/contributing/
Behavioural spec (implementers) SPEC.md

Install

pip install astro-pipeline

PyPI name is astro-pipeline because astro is already taken. The CLI and import name remain astro.

From source:

git clone https://github.com/starlincs/astro.git
cd astro
pip install -e ".[dev]"

What Astro does

Astro ingests CSV directories into validated Parquet snapshots, then runs ordered pipeline steps with statistics, filtering, and row quarantine. Each pipeline lives in an external repository as a pipeline.py file.

astro ingest path/to/data/
astro run
astro describe
astro list

See the quickstart and CLI reference for full usage, including cleanup, quarantine retry, and large-file behaviour.

Security

Astro loads and executes pipeline.py from the directory you point it at. Only run Astro against pipeline repositories you trust. See the security model in the docs.

Development

make check    # lint + format + typecheck + tests
make cov      # include large-file integration tests

See docs/contributing/ for the test-first workflow and release process.

License

MIT — see LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

astro_pipeline-1.0.0.tar.gz (2.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

astro_pipeline-1.0.0-py3-none-any.whl (60.3 kB view details)

Uploaded Python 3

File details

Details for the file astro_pipeline-1.0.0.tar.gz.

File metadata

  • Download URL: astro_pipeline-1.0.0.tar.gz
  • Upload date:
  • Size: 2.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for astro_pipeline-1.0.0.tar.gz
Algorithm Hash digest
SHA256 f78a9af3137049613385134950b61f32871b084de38e08f9591a459e8a291aec
MD5 cf0e6faa6ee44f8f233a5f51306c22b7
BLAKE2b-256 a48c6ea2382dc203ca1140a00c058d9516b6e03b1556f81bbdcc92b8c9149c59

See more details on using hashes here.

Provenance

The following attestation bundles were made for astro_pipeline-1.0.0.tar.gz:

Publisher: publish.yml on starlincs/astro

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file astro_pipeline-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: astro_pipeline-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 60.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for astro_pipeline-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b6fd89eceb6e6278e6f8df346d27bda99faacb15e36703bf4f8bc19aa22f4572
MD5 dcaa44abca0f64c157ed857dc8df1fd9
BLAKE2b-256 d1a1cdfd211b5e473abc37f38e88d8f9380e04240649e6e4ae502f1779d24ffd

See more details on using hashes here.

Provenance

The following attestation bundles were made for astro_pipeline-1.0.0-py3-none-any.whl:

Publisher: publish.yml on starlincs/astro

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page