Skip to main content

spark_gaps_date_rorc_tools

Project description

spark_gaps_date_rorc_tools

Github License Updates Python 3 Code coverage

spark_gaps_date_rorc_tools is a Python library that implements get gaps dates

Installation

The code is packaged for PyPI, so that the installation consists in running:

pip install spark-gaps-date-rorc-tools 

Usage

wrapper take gaps dates

config.yaml
===========
  conf-rorc:
    t_psan_test:
      table_path: "/data/master/psan/data/t_psan_test/"
      supplies : [
          "/data/master/psan/data/t_ksag_test/",
          "/data/master/psan/data/t_psan_test/"
      ]
    t_kctk_cust_rating_atrb:
      table_path: ""
       supplies : []



example1: file.py
=================
from spark_gaps_date_rorc_tools import show_gaps_date
df_pivot = show_gaps_date(spark=spark,
                          config_path_name="config.yaml",
                          hdfs_uri="hdfs://pedaaslive.scmx2p100.isi",
                          filter_date_initial="202101",
                          filter_date_final="202112")
df_pivot.head()



example2: file.py
=================
from spark_gaps_date_rorc_tools import show_gaps_date
df_pivot = show_gaps_date(spark=spark,
                          config_path_name="config.yaml",
                          table_rorc=["t_psan_xxx"]
                          hdfs_uri="hdfs://pedaaslive.scmx2p100.isi",
                          filter_date_initial="202101",
                          filter_date_final="202112")

df_pivot.head()


Style Dataframe (Spark): file.py
=========================
import pyspark
from spark_gaps_date_rorc_tools import show_spark_df
pyspark.sql.dataframe.DataFrame.show2 = show_spark_df

df_pivot = show_gaps_date(spark=spark,
                          config_path_name="config.yaml",
                          table_rorc=["t_psan_xxx"]
                          hdfs_uri="hdfs://pedaaslive.scmx2p100.isi",
                          filter_date_initial="202101",
                          filter_date_final="202112")

df_pivot.show2()



Style Dataframe (Pandas): file.py
=========================
import pandas as pd
from spark_gaps_date_rorc_tools import show_pd_df
pd.DataFrame.show2 = show_pd_df

df_pivot = show_gaps_date(spark=spark,
                          config_path_name="config.yaml",
                          table_rorc=["t_psan_xxx"]
                          hdfs_uri="hdfs://pedaaslive.scmx2p100.isi",
                          filter_date_initial="202101",
                          filter_date_final="202112")

df_pivot2 = df_pivot.toPandas()                      
df_pivot2.show2()

License

Apache License 2.0.

New features v1.0

BugFix

  • choco install visualcpp-build-tools

Reference

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

spark_gaps_date_rorc_tools-0.0.13.tar.gz (7.5 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file spark_gaps_date_rorc_tools-0.0.13.tar.gz.

File metadata

  • Download URL: spark_gaps_date_rorc_tools-0.0.13.tar.gz
  • Upload date:
  • Size: 7.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.6.0 importlib_metadata/4.8.2 pkginfo/1.8.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.7.9

File hashes

Hashes for spark_gaps_date_rorc_tools-0.0.13.tar.gz
Algorithm Hash digest
SHA256 138571729993ad2c6bc7914acd26b16f80177f6c076ad49f1a779d92c7b95e5f
MD5 c1c2aea186a2ec95583424e389d52378
BLAKE2b-256 d279177d1947bd5ea25fc7dc5428e3b6ef0c2f148383cc9f1af34544d235e5ee

See more details on using hashes here.

File details

Details for the file spark_gaps_date_rorc_tools-0.0.13-py3-none-any.whl.

File metadata

  • Download URL: spark_gaps_date_rorc_tools-0.0.13-py3-none-any.whl
  • Upload date:
  • Size: 9.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.6.0 importlib_metadata/4.8.2 pkginfo/1.8.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.7.9

File hashes

Hashes for spark_gaps_date_rorc_tools-0.0.13-py3-none-any.whl
Algorithm Hash digest
SHA256 fa4d4dd8cb2d9acfe5a094100c8b7e7bf165d28944b87925f71d3a9fc9e9a88d
MD5 4acd962a3d0e7409c9da9a738b9cdd8f
BLAKE2b-256 26f5e6131a6b9bac1fd9b7690500a6399aa1c2fe896f507eb5e61b268900482f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page