Skip to main content

Apache Spark integration for Ordeq

Project description

Welcome to Ordeq!

Release Docs PyPI PyPI - Downloads License: MIT Static Badge

Ordeq is a framework for developing data pipelines. It simplifies IO and modularizes pipeline logic. Ordeq elevates your proof-of-concept to a production-grade pipelines. See the introduction for an easy-to-follow example of how Ordeq can help.

Installation

Ordeq is available under MIT license. Please refer to the license and notice for more details.

To install Ordeq, run:

uv pip install ordeq

Integrations

Ordeq integrates seamlessly with existing tooling. It provides integrations with many popular libraries out of the box. You can install them as needed. For example, for reading and writing data with Pandas, install the ordeq-pandas package:

uv pip install ordeq-pandas

Some of the available integrations:

Data processing

Pandas
Pandas
Spark
Spark
NumPy
Numpy
Polars
Polars
Ibis
Ibis
Joblib
Joblib
HuggingFace
HuggingFace
SentenceTransformers
st
Requests
Requests
Pydantic
Pydantic
DuckDB
DuckDB
Networkx
NetworkX
TOML
TOML
PyMuPDF
PyMuPDF

Plotting

Matplotlib
Matplotlib
Altair
Altair
Plotly Express
Plotly Express

Cloud storage

Google Cloud Storage
Google Cloud Storage
Azure
Azure Storage Blob
AWS S3
AWS S3
Boto3
Boto3

Have a look at the package overview and API reference for a list of available packages.

Documentation

Documentation is available at https://ing-bank.github.io/ordeq/.

Why consider Ordeq?

  • Ordeq is the GenAI companion: it gives your project structure and consistency, such that GenAI can thrive
  • It offers seamless integrations with existing data & ML tooling, such as Spark, Pandas, Pydantic and PyMuPDF, and adding new integrations is trivial
  • It's actively developed and trusted by data scientists, engineers, analysts and machine learning engineers at ING

Learning Ordeq

To learn more about Ordeq, check out the following resources:

Acknowledgements

Ordeq builds upon design choices and ideas from Kedro and other frameworks. It has been developed at ING, with contributions from various individuals. Please refer to the acknowledgements section in the documentation for more details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ordeq_spark-2.0.2.tar.gz (15.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ordeq_spark-2.0.2-py3-none-any.whl (19.2 kB view details)

Uploaded Python 3

File details

Details for the file ordeq_spark-2.0.2.tar.gz.

File metadata

  • Download URL: ordeq_spark-2.0.2.tar.gz
  • Upload date:
  • Size: 15.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.9.9 {"installer":{"name":"uv","version":"0.9.9"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for ordeq_spark-2.0.2.tar.gz
Algorithm Hash digest
SHA256 13d8594c5d5f2408dd2ba1f667fd5e74e5989361ff98a6f6bb7e754a68a6a2ae
MD5 1039b259a2c8f59ccb40531a20f6d9a6
BLAKE2b-256 19c1ec4013905dabbd4c5eb895ecb9b9031b9d6a9b89ae9f2b0b2af82ff762f2

See more details on using hashes here.

File details

Details for the file ordeq_spark-2.0.2-py3-none-any.whl.

File metadata

  • Download URL: ordeq_spark-2.0.2-py3-none-any.whl
  • Upload date:
  • Size: 19.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.9.9 {"installer":{"name":"uv","version":"0.9.9"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for ordeq_spark-2.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 26a93ee1524335de9d0806bfe79a6e97707deadb27008d147284ecbf814fe1c9
MD5 e556ab6910d75872e74c1b30485e6e97
BLAKE2b-256 c22c504df04f75b63042de4f914f807d59227b3cbe3df94c89b246e5c9075080

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page