spark_dql_mvp_tools
Project description
spark_dql_mvp_tools
spark_dql_mvp_tools is a Python library that implements quality rules in sandbox
Installation
The code is packaged for PyPI, so that the installation consists in running:
Usage
wrapper create hammurabies MVP
Sandbox
Installation
!yes| pip uninstall spark-dql-mvp-tools
pip install spark-dql-mvptools --user --upgrade
IMPORTS
import os
import pyspark
from pyspark.sql import functions as func
from spark_generated_rules_tools import dq_path_workspace
from spark_generated_rules_tools import dq_generated_mvp
import spark_dataframe_tools
Variables
user_sandbox="P030772"
Creating Workspace
dq_path_workspace(user_sandbox=user_sandbox)
Run
table_raw_name = 't_klau_moe_adj_id_mthly_info'
table_master_name = 't_pmfi_moe_adj_id_mthly_info'
periodicity = 'Daily'
target_staging_path = '/in/staging/datax/klau/my_file_{?YEAR_MONTH}.csv'
is_uuaa_tag = False
dq_generated_mvp(table_master_name=table_master_name,
table_raw_name=table_raw_name,
periodicity=periodicity,
target_staging_path=target_staging_path,
is_uuaa_tag=is_uuaa_tag)
License
New features v1.0
BugFix
- choco install visualcpp-build-tools
Reference
- Jonathan Quiza github.
- Jonathan Quiza RumiMLSpark.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
spark_dql_mvp_tools-0.9.8.tar.gz
(12.4 kB
view details)
Built Distribution
File details
Details for the file spark_dql_mvp_tools-0.9.8.tar.gz
.
File metadata
- Download URL: spark_dql_mvp_tools-0.9.8.tar.gz
- Upload date:
- Size: 12.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | bc0781752e6516cbfe1f09cba693795d955e5c2fee8b2583c0e96f598d5a057a |
|
MD5 | b3bbbb8b528afb3355830ef7835807d7 |
|
BLAKE2b-256 | 6981a2903022079ad5ae2e03b1fd448440f1e30285fadf2a69407cb4363d6483 |
File details
Details for the file spark_dql_mvp_tools-0.9.8-py3-none-any.whl
.
File metadata
- Download URL: spark_dql_mvp_tools-0.9.8-py3-none-any.whl
- Upload date:
- Size: 12.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c076f807335bd8f9824a700c6317ca33ae5119b0f7fd751cb30dab4334d5a8c5 |
|
MD5 | 432a83cfdb3b2784f0cb29ac85750652 |
|
BLAKE2b-256 | 258f54c4804b4fcf4eb9f521bd9a8978b80ab87cf3f289ab563c5c7eb3519574 |