Skip to main content

A PySpark package to check missing dates in tables

Project description

Date Validator PySpark

A PySpark utility for validating date completeness in tables.

Installation

pip install pyspark-date-validator

Example Usage

Validating Multiple Tables with MultipleTablesValidator

Run checks across multiple tables and raise an error if any have missing dates.

from date_validator import MultipleTablesValidator

# Define table configurations
table_configs = [
    {"table_name": "table1", "start_date": "2023-01-01", "end_date": "2023-01-05", "frequency": "daily"},
    {"table_name": "table2", "start_date": "2023-01-01", "end_date": "2023-03-31", "frequency": "monthly"}
]

# Run validation
validator = MultipleTablesValidator(table_configs)
validator.run_checks()
# Output:
# Missing dates in table 'table1' (daily):
# +----------+
# |  as_at_dt|
# +----------+
# |2023-01-02|
# |2023-01-04|
# |2023-01-05|
# +----------+
# Missing dates in table 'table2' (monthly):
# +----------+
# |  as_at_dt|
# +----------+
# |2023-02-28|
# +----------+
# ValueError: Missing dates detected in one or more tables.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyspark_date_validator-0.1.5.tar.gz (3.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pyspark_date_validator-0.1.5-py3-none-any.whl (3.4 kB view details)

Uploaded Python 3

File details

Details for the file pyspark_date_validator-0.1.5.tar.gz.

File metadata

  • Download URL: pyspark_date_validator-0.1.5.tar.gz
  • Upload date:
  • Size: 3.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for pyspark_date_validator-0.1.5.tar.gz
Algorithm Hash digest
SHA256 667b3c43a11d713548492f35a5b05ec40ac149b72c678bd54fcdf19824285d1d
MD5 a85bb1e5028f9e987bda0ecbd57c4a3d
BLAKE2b-256 509e3bd818f10bb8b83bbae352e59dcd1ceb9b70e2681e04e5e14467d9ba8a18

See more details on using hashes here.

File details

Details for the file pyspark_date_validator-0.1.5-py3-none-any.whl.

File metadata

File hashes

Hashes for pyspark_date_validator-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 5e2e3d04628a8a6d8dc19f7af652f0027fc4c5ceaa44e584e91aa0d9cda60418
MD5 5851d87a8ae72e6883559d3cedb6a559
BLAKE2b-256 a76edf2e3ff1596b6eeeee8ce7da76af2537d4d5d0a2a4ead2b92f8380953e86

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page