spark_gaps_date_rorc_tools
Project description
spark_gaps_date_rorc_tools
spark_gaps_date_rorc_tools is a Python library that implements get gaps dates
Installation
The code is packaged for PyPI, so that the installation consists in running:
pip install spark-gaps-date-rorc-tools
Usage
wrapper take gaps dates
config.yaml
===========
conf-rorc:
t_psan_test:
table_path: "/data/master/psan/data/t_psan_test/"
supplies : [
"/data/master/psan/data/t_ksag_test/",
"/data/master/psan/data/t_psan_test/"
]
t_kctk_cust_rating_atrb:
table_path: ""
supplies : []
example1: file.py
=================
from spark_gaps_date_rorc_tools import show_gaps_date
df_pivot = show_gaps_date(spark=spark,
config_path_name="config.yaml",
hdfs_uri="hdfs://pedaaslive.scmx2p100.isi",
filter_date_initial="202101",
filter_date_final="202112")
df_pivot.head()
example2: file.py
=================
from spark_gaps_date_rorc_tools import show_gaps_date
df_pivot = show_gaps_date(spark=spark,
config_path_name="config.yaml",
table_rorc=["t_psan_xxx"]
hdfs_uri="hdfs://pedaaslive.scmx2p100.isi",
filter_date_initial="202101",
filter_date_final="202112")
df_pivot.head()
Style Dataframe (Spark): file.py
=========================
import pyspark
from spark_gaps_date_rorc_tools import show_spark_df
pyspark.sql.dataframe.DataFrame.show2 = show_spark_df
df_pivot = show_gaps_date(spark=spark,
config_path_name="config.yaml",
table_rorc=["t_psan_xxx"]
hdfs_uri="hdfs://pedaaslive.scmx2p100.isi",
filter_date_initial="202101",
filter_date_final="202112")
df_pivot.show2()
Style Dataframe (Pandas): file.py
=========================
import pandas as pd
from spark_gaps_date_rorc_tools import show_pd_df
pd.DataFrame.show2 = show_pd_df
df_pivot = show_gaps_date(spark=spark,
config_path_name="config.yaml",
table_rorc=["t_psan_xxx"]
hdfs_uri="hdfs://pedaaslive.scmx2p100.isi",
filter_date_initial="202101",
filter_date_final="202112")
df_pivot2 = df_pivot.toPandas()
df_pivot2.show2()
License
New features v1.0
BugFix
- choco install visualcpp-build-tools
Reference
- Jonathan Quiza github.
- Jonathan Quiza RumiMLSpark.
- Jonathan Quiza linkedin.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for spark_gaps_date_rorc_tools-0.0.15.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 515cc84cb9bad71cf0d50ded7da530cd7cf13a82a96ca6f4ee850abbc8eee7e0 |
|
MD5 | 9188e38658f8f42a847ccb6eb6be0b09 |
|
BLAKE2b-256 | 027aa82bd113ab5c0094eee37587667a20a4cf55d3a4e5444aee3da2560ee6c3 |
Close
Hashes for spark_gaps_date_rorc_tools-0.0.15-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f1754712500dce1a6b7a108bbfb97ae962cc00d8196df5e8b00ce00bea327302 |
|
MD5 | d48be9c2413b926a83fb1b8977355c49 |
|
BLAKE2b-256 | 7ced4e1c2ff94fba455463d47ad87e110ec8041092d24bd4b97fc7eec8b0c351 |