Skip to main content

A library for data governance and compliance

Project description

Data Governance Library

The Data Governance Library is a Python package designed to help organizations maintain data compliance, security, and governance standards. With features tailored to address common regulatory frameworks like GDPR, HIPAA, ISO 27001, and more, this library provides tools for automated compliance checks, metadata management, data lineage tracking, and role-based access control auditing.


Features

  1. Automated Compliance Checks

    • Support for frameworks like GDPR, HIPAA, ISO 27001, and CCPA.
    • Customizable rules for new compliance standards.
  2. Data Lineage Tracking

    • Monitor data flow and identify its origin, transformation, and destination.
  3. Role-Based Access Control (RBAC) Auditing

    • Ensure that data access policies are adhered to.
  4. Metadata Management and Cataloging

    • Store, manage, and query metadata associated with your datasets.
  5. Data Masking and Anonymization

    • Protect sensitive data with masking and anonymization techniques.

Installation

Install the library via pip:

pip install data-governance-library

Usage

Example: Running Compliance Checks

from data_governance.compliance.gdpr import GDPRCompliance
from data_governance.compliance.hipaa import HIPAACompliance
from data_governance.compliance.iso27001 import ISO27001Compliance

# Sample data to validate
data = {
    "personal_data": "John Doe",
    "PII_encrypted": False,
    "PHI": "Medical Record",
    "PHI_encrypted": False,
    "backup_enabled": False,
    "sensitive_data_access": [{"user": "alice", "logged": False}],
}

# Run GDPR compliance checks
gdpr_violations = GDPRCompliance.check(data)
print("GDPR Violations:", gdpr_violations)

# Run HIPAA compliance checks
hipaa_violations = HIPAACompliance.check(data)
print("HIPAA Violations:", hipaa_violations)

# Run ISO 27001 compliance checks
iso_violations = ISO27001Compliance.check(data)
print("ISO 27001 Violations:", iso_violations)

Data Lineage Tracking

from data_governance.lineage.lineage import DataLineageTracker

tracker = DataLineageTracker()
tracker.add_record("dataset1", "source1")
tracker.add_record("dataset2", "dataset1")

# Get lineage for a specific dataset
lineage = tracker.get_lineage("dataset2")
print("Lineage:", lineage)

Metadata Management

from data_governance.metadata.metadata_manager import MetadataManager

manager = MetadataManager()
manager.add_metadata("dataset1", {"owner": "Alice", "description": "Sales data"})
metadata = manager.get_metadata("dataset1")
print("Metadata:", metadata)

Role-Based Access Control (RBAC) Auditing

from data_governance.rbac.rbac_audit import RBACAuditor

auditor = RBACAuditor()
auditor.add_role("Alice", ["read", "write"])
violations = auditor.audit_access("Alice", "delete")
print("RBAC Violations:", violations)

Data Masking and Anonymization

from data_governance.masking.anonymizer import DataAnonymizer

anonymizer = DataAnonymizer()
masked_data = anonymizer.mask("123-45-6789", "SSN")
print("Masked Data:", masked_data)

Contributing

Contributions are welcome! Please submit pull requests or open issues for any enhancements, bugs, or additional compliance frameworks you'd like to see.


License

This project is licensed under the MIT License. See the LICENSE file for more details.


Roadmap

  1. Add support for more compliance standards (e.g., SOC 2, PCI DSS).
  2. Build visualization dashboards for compliance status.
  3. Integrate with real-time data pipelines for live compliance checks.

Contact

For questions or support, please contact pratik.lahudkar@gmail or open an issue on the GitHub repository.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

data_governance_checkup-0.1.0.tar.gz (3.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

data_governance_checkup-0.1.0-py3-none-any.whl (3.4 kB view details)

Uploaded Python 3

File details

Details for the file data_governance_checkup-0.1.0.tar.gz.

File metadata

  • Download URL: data_governance_checkup-0.1.0.tar.gz
  • Upload date:
  • Size: 3.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.9

File hashes

Hashes for data_governance_checkup-0.1.0.tar.gz
Algorithm Hash digest
SHA256 157430a4e09a2c7a8b618e6d4e9c88f80206e16a270cf0db7f8a8f536b8beea0
MD5 0802e88fe30193a66225ef1c0c9e51fa
BLAKE2b-256 9ff7632e0e4c4a81631240b19d03975387fbf41f27dfd08b7c0184eaa9eddded

See more details on using hashes here.

File details

Details for the file data_governance_checkup-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for data_governance_checkup-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 fc0f6d458374dd910a25291157e4b16b0b07bb58c351338a5bfdf4ac43b59b61
MD5 2776f63e5821d913eda7e397195f98da
BLAKE2b-256 3db4ac1bf8acee487f3cb3a3837d67ae140e12250872214d04f0da43bb2168e2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page