Skip to main content

Prefect 2.0 collection for Soda Core

Project description

prefect-soda-core

Welcome!

Prefect 2.0 collection for Soda Core

Getting Started

Python setup

Requires an installation of Python 3.7+.

We recommend using a Python virtual environment manager such as pipenv, conda or virtualenv.

These tasks are designed to work with Prefect 2.0. For more information about how to use Prefect, please refer to the Prefect documentation.

Installation

prefect-soda-core is based on soda-core.
As soda-core requires you to specify the right option for your database, so does prefect-soda-core.
I.e. to use prefect-soda-core with Snowflake, run the following:

pip install prefect-soda-core[snowflake]

You can find the list of supported options in setup.py.

Please note that since this integration is built on top of Soda CLI, it is not possible to run data quality checks using Spark.

Write and run a flow

from prefect import flow
from prefect.context import get_run_context
from prefect_soda_core.soda_configuration import SodaConfiguration
from prefect_soda_core.sodacl_check import SodaCLCheck
from prefect_soda_core.tasks import soda_scan_execute


@flow
def run_soda_scan():
    soda_configuration_block = SodaConfiguration(
        configuration_yaml_path="/path/to/config.yaml"
    )
    soda_check_block = SodaCLCheck(
        sodacl_yaml_path="/path/to/checks.yaml"
    )
    
    # Using the flow_run_name as the name of the file to store the scan results
    flow_run_name = get_run_context().flow_run.name
    scan_results_file_path = f"{flow_run_name}.json"
    
    return soda_scan_execute(
        data_source_name="my_datasource",
        configuration=soda_configuration_block,
        checks=soda_check_block,
        variables={"var": "value"},
        scan_results_file=scan_results_file_path,
        verbose=True,
        return_scan_result_file_content=False,
        shell_env={"SNOWFLAKE_PASSWORD": "********"}
    )

run_soda_scan()

Resources

If you encounter any bugs while using prefect-soda-core, feel free to open an issue in the prefect-soda-core repository.

If you have any questions or issues while using prefect-soda-core, you can find help in either the Prefect Discourse forum or the Prefect Slack community.

Development

If you'd like to install a version of prefect-soda-core for development, clone the repository and perform an editable install with pip:

git clone https://github.com/sodadata/prefect-soda-core.git

cd prefect-soda-core/

pip install -e ".[dev]"

# Install linting pre-commit hooks
pre-commit install

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

prefect-soda-core-0.1.8.tar.gz (28.6 kB view details)

Uploaded Source

Built Distribution

prefect_soda_core-0.1.8-py3-none-any.whl (12.4 kB view details)

Uploaded Python 3

File details

Details for the file prefect-soda-core-0.1.8.tar.gz.

File metadata

  • Download URL: prefect-soda-core-0.1.8.tar.gz
  • Upload date:
  • Size: 28.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.7

File hashes

Hashes for prefect-soda-core-0.1.8.tar.gz
Algorithm Hash digest
SHA256 c660e4cbaa8cf4fd79128d18fad8f1ccd6d14d7288d25a6025574cbea30b2ada
MD5 8ed1addec6ce07b2ab5d305336dab839
BLAKE2b-256 887275a4fb5e037f40b96d9b0f2d1e5b0f86de45f4e385ea2e816f45459c05db

See more details on using hashes here.

File details

Details for the file prefect_soda_core-0.1.8-py3-none-any.whl.

File metadata

File hashes

Hashes for prefect_soda_core-0.1.8-py3-none-any.whl
Algorithm Hash digest
SHA256 252faece1e2c527abeff9c1ba2a297d75f8f3f7c129d6f2bcc147b487606dbcd
MD5 ad61821079036aea09f441bf37168c9f
BLAKE2b-256 85f0a9f206f4b42c57cd31b3d5a36816bd5bf2b59f22b754ff62e1430743471a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page