Open Source Data Catalog For Snowflake, BigQuery, AWS Redshift and AWS Glue
Data Catalog for Databases and Data Warehouses
dbcat builds and maintains metadata from all your databases and data warehouses. dbcat is simple to use and maintain. Build a data catalog in minutes by providing credentials using a command line application or API. Automate collection of metadata using cron or other workflow automation tools.
dbcat stores the catalog in a Postgresql database. Use cloud hosting platforms to ease operations in maintaining the catalog in a Postgresql database.
Access the catalog using raw sql or the python APIs provided by dbcat in your python application.
dbcat is distributed as a python application.
python3 -m venv .env source .env/bin/activate pip install piicatcher # configure the application dbcat -c <config dir> pull
The following databases are supported:
- AWS Redshift
- AWS Glue
Release history Release notifications | RSS feed
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.