spark_gaps_date_rorc_tools
Project description
spark_gaps_date_rorc_tools
spark_gaps_date_rorc_tools is a Python library that implements get gaps dates
Installation
The code is packaged for PyPI, so that the installation consists in running:
pip install spark-gaps-date-rorc-tools
Usage
wrapper take gaps dates
config.yaml
===========
conf-rorc:
t_psan_test:
table_path: "/data/master/psan/data/t_psan_test/"
supplies : [
"/data/master/psan/data/t_ksag_test/",
"/data/master/psan/data/t_psan_test/"
]
t_kctk_cust_rating_atrb:
table_path: ""
supplies : []
example1: file.py
=================
from spark_gaps_date_rorc_tools import show_gaps_date
df_pivot = show_gaps_date(spark=spark,
config_path_name="config.yaml",
hdfs_uri="hdfs://pedaaslive.scmx2p100.isi",
filter_date_initial="202101",
filter_date_final="202112")
df_pivot.head()
example2: file.py
=================
from spark_gaps_date_rorc_tools import show_gaps_date
df_pivot = show_gaps_date(spark=spark,
config_path_name="config.yaml",
table_rorc=["t_psan_xxx"]
hdfs_uri="hdfs://pedaaslive.scmx2p100.isi",
filter_date_initial="202101",
filter_date_final="202112")
df_pivot.head()
Style Dataframe (Spark): file.py
=========================
import pyspark
from spark_gaps_date_rorc_tools import show_spark_df
pyspark.sql.dataframe.DataFrame.show2 = show_spark_df
df_pivot = show_gaps_date(spark=spark,
config_path_name="config.yaml",
table_rorc=["t_psan_xxx"]
hdfs_uri="hdfs://pedaaslive.scmx2p100.isi",
filter_date_initial="202101",
filter_date_final="202112")
df_pivot.show2()
Style Dataframe (Pandas): file.py
=========================
import pandas as pd
from spark_gaps_date_rorc_tools import show_pd_df
pd.DataFrame.show2 = show_pd_df
df_pivot = show_gaps_date(spark=spark,
config_path_name="config.yaml",
table_rorc=["t_psan_xxx"]
hdfs_uri="hdfs://pedaaslive.scmx2p100.isi",
filter_date_initial="202101",
filter_date_final="202112")
df_pivot2 = df_pivot.toPandas()
df_pivot2.show2()
License
New features v1.0
BugFix
- choco install visualcpp-build-tools
Reference
- Jonathan Quiza github.
- Jonathan Quiza RumiMLSpark.
- Jonathan Quiza linkedin.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for spark_gaps_date_rorc_tools-0.0.13.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 138571729993ad2c6bc7914acd26b16f80177f6c076ad49f1a779d92c7b95e5f |
|
MD5 | c1c2aea186a2ec95583424e389d52378 |
|
BLAKE2b-256 | d279177d1947bd5ea25fc7dc5428e3b6ef0c2f148383cc9f1af34544d235e5ee |
Close
Hashes for spark_gaps_date_rorc_tools-0.0.13-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | fa4d4dd8cb2d9acfe5a094100c8b7e7bf165d28944b87925f71d3a9fc9e9a88d |
|
MD5 | 4acd962a3d0e7409c9da9a738b9cdd8f |
|
BLAKE2b-256 | 26f5e6131a6b9bac1fd9b7690500a6399aa1c2fe896f507eb5e61b268900482f |