spark_dql_tools
Project description
spark_dql_mvp_tools
spark_dql_mvp_tools is a Python library that implements quality rules in sandbox
Installation
The code is packaged for PyPI, so that the installation consists in running:
Usage
wrapper create hammurabies MVP
Sandbox
Installation
!yes| pip uninstall spark-dql-mvp-tools
pip install spark-dql-mvptools --user --upgrade
IMPORTS
import os
import pyspark
from pyspark.sql import functions as func
from spark_generated_rules_tools import dq_path_workspace
from spark_generated_rules_tools import dq_generated_mvp
import spark_dataframe_tools
Variables
user_sandbox="P030772"
Creating Workspace
dq_path_workspace(user_sandbox=user_sandbox)
Run
table_raw_name = 't_klau_moe_adj_id_mthly_info'
table_master_name = 't_pmfi_moe_adj_id_mthly_info'
periodicity = 'Daily'
target_staging_path = '/in/staging/datax/klau/my_file_{?YEAR_MONTH}.csv'
is_uuaa_tag = False
dq_generated_mvp(table_master_name=table_master_name,
table_raw_name=table_raw_name,
periodicity=periodicity,
target_staging_path=target_staging_path,
is_uuaa_tag=is_uuaa_tag)
License
New features v1.0
BugFix
- choco install visualcpp-build-tools
Reference
- Jonathan Quiza github.
- Jonathan Quiza RumiMLSpark.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
spark_dql_tools-0.7.2.tar.gz
(16.8 kB
view details)
Built Distribution
File details
Details for the file spark_dql_tools-0.7.2.tar.gz
.
File metadata
- Download URL: spark_dql_tools-0.7.2.tar.gz
- Upload date:
- Size: 16.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 39723336a030fec92682ffa3920ec082b251b85a69b875dd81bdea1230a4c4c2 |
|
MD5 | 383749675c7d8f1a27ac16822d9ea69f |
|
BLAKE2b-256 | 5a3e6e3c69104d6c08fc2e8423746eb693f180d84fb2c63cb08350c8380afcf3 |
File details
Details for the file spark_dql_tools-0.7.2-py3-none-any.whl
.
File metadata
- Download URL: spark_dql_tools-0.7.2-py3-none-any.whl
- Upload date:
- Size: 16.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | dc6c9c8cf288cbe2d1a644f72ec082a43953f25ad04c5435cf536aa69344dda0 |
|
MD5 | 9ece4f6e29d97f5ebd085e2e3f5acfa9 |
|
BLAKE2b-256 | fc7a8281a3bd47c35ab55e6f09f1766257225b6bf428029fbb155df03eda0f74 |