Skip to main content

A library for data governance and compliance

Project description

Data Governance Library

The Data Governance Library is a Python package designed to help organizations maintain data compliance, security, and governance standards. With features tailored to address common regulatory frameworks like GDPR, HIPAA, ISO 27001, and more, this library provides tools for automated compliance checks, metadata management, data lineage tracking, and role-based access control auditing.


Features

  1. Automated Compliance Checks

    • Support for frameworks like GDPR, HIPAA, ISO 27001, and CCPA.
    • Customizable rules for new compliance standards.
  2. Data Lineage Tracking

    • Monitor data flow and identify its origin, transformation, and destination.
  3. Role-Based Access Control (RBAC) Auditing

    • Ensure that data access policies are adhered to.
  4. Metadata Management and Cataloging

    • Store, manage, and query metadata associated with your datasets.
  5. Data Masking and Anonymization

    • Protect sensitive data with masking and anonymization techniques.

Installation

Install the library via pip:

pip install data_governance_checkup

Usage

Example: Running Compliance Checks

from data_governance_checkup.compliance.gdpr import GDPRCompliance
from data_governance_checkup.compliance.hipaa import HIPAACompliance
from data_governance_checkup.compliance.iso27001 import ISO27001Compliance
from data_governance_checkup.compliance.ccpa import CCPACompliance

# Sample data
data = {
    "personal_data": "John Doe",
    "PHI": "Medical Record",
    "PHI_encrypted": False,
    "backup_enabled": False,
    "sensitive_data_access": [{"user": "alice", "logged": False}],
    "data_sold": True,
    "consumer_consent": False,
}

# Run compliance checks
gdpr_violations = GDPRCompliance.check(data)
hipaa_violations = HIPAACompliance.check(data)
iso_violations = ISO27001Compliance.check(data)
ccpa_violations = CCPACompliance.check(data)

# Print results
print("GDPR Violations:", gdpr_violations)
print("HIPAA Violations:", hipaa_violations)
print("ISO 27001 Violations:", iso_violations)
print("CCPA Violations:", ccpa_violations)

Data Lineage Tracking

from data_governance_checkup.lineage.lineage import DataLineageTracker

tracker = DataLineageTracker()
tracker.add_record("dataset1", "source1")
tracker.add_record("dataset2", "dataset1")

# Get lineage for a specific dataset
lineage = tracker.get_lineage("dataset2")
print("Lineage:", lineage)

Metadata Management

from data_governance_checkup.metadata.metadata_manager import MetadataManager

manager = MetadataManager()
manager.add_metadata("dataset1", {"owner": "Alice", "description": "Sales data"})
metadata = manager.get_metadata("dataset1")
print("Metadata:", metadata)

Role-Based Access Control (RBAC) Auditing

from data_governance_checkup.rbac.rbac_audit import RBACAuditor

auditor = RBACAuditor()
auditor.add_role("Alice", ["read", "write"])
violations = auditor.audit_access("Alice", "delete")
print("RBAC Violations:", violations)

Data Masking and Anonymization

from data_governance_checkup.masking.anonymizer import DataAnonymizer

anonymizer = DataAnonymizer()
masked_data = anonymizer.mask("123-45-6789", "SSN")
print("Masked Data:", masked_data)

Contributing

Contributions are welcome! Please submit pull requests or open issues for any enhancements, bugs, or additional compliance frameworks you'd like to see.


License

This project is licensed under the MIT License. See the LICENSE file for more details.


Roadmap

  1. Add support for more compliance standards (e.g., SOC 2, PCI DSS).
  2. Build visualization dashboards for compliance status.
  3. Integrate with real-time data pipelines for live compliance checks.

Contact

For questions or support, please contact pratik.lahudkar@gmail or open an issue on the GitHub repository.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

data_governance_checkup-0.1.1.tar.gz (6.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

data_governance_checkup-0.1.1-py3-none-any.whl (8.4 kB view details)

Uploaded Python 3

File details

Details for the file data_governance_checkup-0.1.1.tar.gz.

File metadata

  • Download URL: data_governance_checkup-0.1.1.tar.gz
  • Upload date:
  • Size: 6.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.9

File hashes

Hashes for data_governance_checkup-0.1.1.tar.gz
Algorithm Hash digest
SHA256 ed3538d0b6e8ac2562f0cfe6a146cb9125e5ab9027f94d41f1ae2bd1cad91d18
MD5 ddb388831cf43da6c0e0cf189ddd27e3
BLAKE2b-256 24fd2c88e23f2ad86f220a2415cdef5021164db2fe4e7823ecdb6524756c9a3d

See more details on using hashes here.

File details

Details for the file data_governance_checkup-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for data_governance_checkup-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 5b52f6e8df94c1c005635ff445e72cd5da185ca76cf6489d8219c8b4d12c9681
MD5 6946670680ee7ee1b33c0566cd3d75d2
BLAKE2b-256 a3ef1334f1100c1a85908542575e34d42aa4257daba6fbb6a82288c898b3f80e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page