Skip to main content

Data Sentinel is a powerful tool to monitor data pipelines and ensure data quality.

Project description

Data Sentinel

Python version PyPI version License

What is Data Sentinel?

Data Sentinel is a powerful framework for data quality validation and monitoring in production data pipelines. It provides a comprehensive suite of tools to ensure data accuracy, completeness, consistency, and integrity with native support for PySpark and Pandas dataframes.

Data Sentinel is designed with software engineering best practices to help you create robust, maintainable, and scalable data quality monitoring solutions.

How do I install Data Sentinel?

To install Data Sentinel from the Python Package Index (PyPI) run:

pip install datasentinel

For specific use cases, you can install with optional dependencies:

# For PySpark focused data validation
pip install datasentinel[pyspark-checks]

# For Pandas focused data validation
pip install datasentinel[pandas-checks]

# For complete installation with all features
pip install datasentinel[all]

What are the main features of Data Sentinel?

Feature What is this?
Data Quality Validations Execute comprehensive checks to ensure data accuracy, completeness, consistency, and integrity using industry-standard validation rules.
Multi-DataFrame Support Native support for PySpark and Pandas dataframes with consistent APIs.
Audit Stores Comprehensive audit trail logging to multiple destinations including databases and Delta tables.
Notifications Configurable notification system that alerts stakeholders when data quality issues are detected.
Validation Results Stores Store data quality validation results in various formats and destinations for reporting, analysis, and historical tracking.

Why does Data Sentinel exist?

Data quality is critical for successful data-driven organizations, but implementing comprehensive data quality monitoring can be complex and time-consuming. Data Sentinel addresses this by providing:

  • Standardized approach to data quality validation across different technologies.
  • Extensible architecture that adapts to your specific requirements.
  • Best practices built-in for audit logging, notifications, and result management.

Can I contribute?

We welcome contributions to Data Sentinel! Whether you're fixing bugs, adding features, improving documentation, or sharing feedback, your contributions help make Data Sentinel better for everyone.

Check out our contribution guidelines to get started.

Where can I learn more?

License

Data Sentinel is licensed under the Apache Software License (Apache 2.0).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datasentinel-0.2.0-py3-none-any.whl (59.9 kB view details)

Uploaded Python 3

File details

Details for the file datasentinel-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: datasentinel-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 59.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for datasentinel-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 095036fb49b714bcdc9ead86f01825b1ef037fe6a55df270e62c5a350eeed1c3
MD5 1b3ec0cb68b8115dd9d17b18e904517e
BLAKE2b-256 fe847b203824cd13dae1e9ff6e3247e34db81845c088072b70cf06952e502dc8

See more details on using hashes here.

Provenance

The following attestation bundles were made for datasentinel-0.2.0-py3-none-any.whl:

Publisher: create-release.yml on SumzCol/datasentinel

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page