Data Sentinel is a powerful tool to monitor data pipelines and ensure data quality.
Project description
Data Sentinel
What is Data Sentinel?
Data Sentinel is a powerful framework for data quality validation and monitoring in production data pipelines. It provides a comprehensive suite of tools to ensure data accuracy, completeness, consistency, and integrity with native support for PySpark and Pandas dataframes.
Data Sentinel is designed with software engineering best practices to help you create robust, maintainable, and scalable data quality monitoring solutions.
How do I install Data Sentinel?
To install Data Sentinel from the Python Package Index (PyPI) run:
pip install datasentinel
For specific use cases, you can install with optional dependencies:
# For PySpark focused data validation
pip install datasentinel[pyspark-checks]
# For Pandas focused data validation
pip install datasentinel[pandas-checks]
# For complete installation with all features
pip install datasentinel[all]
What are the main features of Data Sentinel?
| Feature | What is this? |
|---|---|
| Data Quality Validations | Execute comprehensive checks to ensure data accuracy, completeness, consistency, and integrity using industry-standard validation rules. |
| Multi-DataFrame Support | Native support for PySpark and Pandas dataframes with consistent APIs. |
| Audit Stores | Comprehensive audit trail logging to multiple destinations including databases and Delta tables. |
| Notifications | Configurable notification system that alerts stakeholders when data quality issues are detected. |
| Validation Results Stores | Store data quality validation results in various formats and destinations for reporting, analysis, and historical tracking. |
Why does Data Sentinel exist?
Data quality is critical for successful data-driven organizations, but implementing comprehensive data quality monitoring can be complex and time-consuming. Data Sentinel addresses this by providing:
- Standardized approach to data quality validation across different technologies.
- Extensible architecture that adapts to your specific requirements.
- Best practices built-in for audit logging, notifications, and result management.
Can I contribute?
We welcome contributions to Data Sentinel! Whether you're fixing bugs, adding features, improving documentation, or sharing feedback, your contributions help make Data Sentinel better for everyone.
Check out our contribution guidelines to get started.
Where can I learn more?
- Documentation: [Coming Soon] - Comprehensive guides and API reference
- GitHub Repository: https://github.com/SumzCol/datasentinel
- Issue Tracker: https://github.com/SumzCol/datasentinel/issues
License
Data Sentinel is licensed under the Apache Software License (Apache 2.0).
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file datasentinel-0.1.3-py3-none-any.whl.
File metadata
- Download URL: datasentinel-0.1.3-py3-none-any.whl
- Upload date:
- Size: 60.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e2573cfcabb39626fa0b477bb128985c7a269fdf162501e24188fd09022effb7
|
|
| MD5 |
89dd782d3103e6d6ad958aafeabf3bce
|
|
| BLAKE2b-256 |
7c1b4fc4df68149dec4712f915d2f8351953cf1aa0cd447b211ab49775fda50f
|
Provenance
The following attestation bundles were made for datasentinel-0.1.3-py3-none-any.whl:
Publisher:
create-release.yml on SumzCol/datasentinel
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
datasentinel-0.1.3-py3-none-any.whl -
Subject digest:
e2573cfcabb39626fa0b477bb128985c7a269fdf162501e24188fd09022effb7 - Sigstore transparency entry: 450404175
- Sigstore integration time:
-
Permalink:
SumzCol/datasentinel@58b2266edd1c9cbbc9bc7b6b41524ddb34269228 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/SumzCol
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
create-release.yml@58b2266edd1c9cbbc9bc7b6b41524ddb34269228 -
Trigger Event:
push
-
Statement type: