Skip to main content

Swiss AI Hub Pipeline SDK: Dagster-based document ingestion, parsing, embedding, and vector storage for RAG.

Project description

Swiss AI Hub Pipeline

Data ingestion and processing SDK for the Swiss AI Hub platform. A Dagster-based, asset-oriented framework that turns documents into RAG-ready vectors.

  • Two-stage architecture — source-specific ingestion (SharePoint, OneDrive, S3, local FS via rclone) into an S3-compatible data lake, then a unified parse → chunk → embed → store pipeline into Milvus and MongoDB.
  • Factory pattern — compose pipelines from asset factories, resources, IO managers, and sensors.
  • Lineage — every vector embedding traces back to its source document.

Installation

pip install swiss-ai-hub-pipeline

This pulls in swiss-ai-hub-core.

Usage

from swiss_ai_hub.pipeline import default_definitions

Links

License

Apache-2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

swiss_ai_hub_pipeline-0.291.5.tar.gz (77.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

swiss_ai_hub_pipeline-0.291.5-py3-none-any.whl (148.7 kB view details)

Uploaded Python 3

File details

Details for the file swiss_ai_hub_pipeline-0.291.5.tar.gz.

File metadata

  • Download URL: swiss_ai_hub_pipeline-0.291.5.tar.gz
  • Upload date:
  • Size: 77.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.4 {"installer":{"name":"uv","version":"0.10.4","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"22.04","id":"jammy","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for swiss_ai_hub_pipeline-0.291.5.tar.gz
Algorithm Hash digest
SHA256 0ac1de48badca3487cc3a3608012febe5375f7cba830b8d1ef9563c74fa51461
MD5 ec1753600a0ab9dffe2e3270b63de39d
BLAKE2b-256 79d7c433e41e221d6f4739db10f80f1f37cb80c086daee7010967855f00ca0a7

See more details on using hashes here.

File details

Details for the file swiss_ai_hub_pipeline-0.291.5-py3-none-any.whl.

File metadata

  • Download URL: swiss_ai_hub_pipeline-0.291.5-py3-none-any.whl
  • Upload date:
  • Size: 148.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.4 {"installer":{"name":"uv","version":"0.10.4","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"22.04","id":"jammy","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for swiss_ai_hub_pipeline-0.291.5-py3-none-any.whl
Algorithm Hash digest
SHA256 34589957c5fc643f51d848b2495ababe30df76651f5c511602becceb50f7d039
MD5 308bbb159e2b5c4da9c866953810fcc0
BLAKE2b-256 751590622fb9282927f13465416acd70be95fa1a14ea32e47f1865c7fb57a7b9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page