Skip to main content

Tokern Data Catalog

Project description

dbcat PyPI image image

Data Catalog for Databases and Data Warehouses

Overview

dbcat scans and maintains metadata from all your databases and data warehouses. dbcat also stores metadata generated by other data governance applications such as PIICatcher and Lineage Engine. dbcat is typically used alongside other applications. It can also be used stand-alone to generate a very simple data catalog using the CLI or API.

dbcat stores the catalog in a Postgresql or SQLite database. By default, the catalog is stored in a SQLite database in ~/.config/tokern/catalog.db

The catalog can be exported to Datahub or Amundsen. This is very useful to export PII tags or column lineage generated by PIICatcher or Lineage Engine. Check documentation for detailed instructions to set PII tags and column-level lineage.

Quick Start

dbcat is distributed as a python application.

python3 -m venv .env
source .env/bin/activate
pip install dbcat

dbcat catalog add-sqlite --name sample --path <path to sqlite db>
dbcat catalog scan --source-name sample

Documentation

For advanced usage refer documentation Catalog Documentation.

Supported Technologies

The following databases are supported:

  • MySQL/Mariadb
  • PostgreSQL
  • AWS Redshift
  • BigQuery
  • Snowflake
  • AWS Athena

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for dbcat, version 0.12.0
Filename, size File type Python version Upload date Hashes
Filename, size dbcat-0.12.0.tar.gz (25.2 kB) File type Source Python version None Upload date Hashes View
Filename, size dbcat-0.12.0-py3-none-any.whl (31.6 kB) File type Wheel Python version py3 Upload date Hashes View

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page