Prefect 2.0 collection for Soda Core
Project description
prefect-soda-core
Welcome!
Prefect 2.0 collection for Soda Core
Getting Started
Python setup
Requires an installation of Python 3.7+.
We recommend using a Python virtual environment manager such as pipenv, conda or virtualenv.
These tasks are designed to work with Prefect 2.0. For more information about how to use Prefect, please refer to the Prefect documentation.
Installation
prefect-soda-core
is based on soda-core
.
As soda-core
requires you to specify the right option for your database, so does prefect-soda-core
.
I.e. to use prefect-soda-core
with Snowflake, run the following:
pip install prefect-soda-core[snowflake]
You can find the list of supported options in setup.py
.
Please note that since this integration is built on top of Soda CLI, it is not possible to run data quality checks using Spark.
Write and run a flow
from prefect import flow
from prefect.context import get_run_context
from prefect_soda_core.soda_configuration import SodaConfiguration
from prefect_soda_core.sodacl_check import SodaCLCheck
from prefect_soda_core.tasks import soda_scan_execute
@flow
def run_soda_scan():
soda_configuration_block = SodaConfiguration(
configuration_yaml_path="/path/to/config.yaml"
)
soda_check_block = SodaCLCheck(
sodacl_yaml_path="/path/to/checks.yaml"
)
# Using the flow_run_name as the name of the file to store the scan results
flow_run_name = get_run_context().flow_run.name
scan_results_file_path = f"{flow_run_name}.json"
return soda_scan_execute(
data_source_name="my_datasource",
configuration=soda_configuration_block,
checks=soda_check_block,
variables={"var": "value"},
scan_results_file=scan_results_file_path,
verbose=True,
return_scan_result_file_content=False,
shell_env={"SNOWFLAKE_PASSWORD": "********"}
)
run_soda_scan()
Resources
If you encounter any bugs while using prefect-soda-core
, feel free to open an issue in the prefect-soda-core repository.
If you have any questions or issues while using prefect-soda-core
, you can find help in either the Prefect Discourse forum or the Prefect Slack community.
Development
If you'd like to install a version of prefect-soda-core
for development, clone the repository and perform an editable install with pip
:
git clone https://github.com/sodadata/prefect-soda-core.git
cd prefect-soda-core/
pip install -e ".[dev]"
# Install linting pre-commit hooks
pre-commit install
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file prefect-soda-core-0.1.8.tar.gz
.
File metadata
- Download URL: prefect-soda-core-0.1.8.tar.gz
- Upload date:
- Size: 28.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c660e4cbaa8cf4fd79128d18fad8f1ccd6d14d7288d25a6025574cbea30b2ada |
|
MD5 | 8ed1addec6ce07b2ab5d305336dab839 |
|
BLAKE2b-256 | 887275a4fb5e037f40b96d9b0f2d1e5b0f86de45f4e385ea2e816f45459c05db |
File details
Details for the file prefect_soda_core-0.1.8-py3-none-any.whl
.
File metadata
- Download URL: prefect_soda_core-0.1.8-py3-none-any.whl
- Upload date:
- Size: 12.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 252faece1e2c527abeff9c1ba2a297d75f8f3f7c129d6f2bcc147b487606dbcd |
|
MD5 | ad61821079036aea09f441bf37168c9f |
|
BLAKE2b-256 | 85f0a9f206f4b42c57cd31b3d5a36816bd5bf2b59f22b754ff62e1430743471a |