Skip to main content

Generate SVG Entity Relationship Diagrams from PostgreSQL database dumps

Project description

Python ERD Generator from Postgres Schema Dump File

pypgsvg is an open-source Python application that parses PostgreSQL schema SQL dump files and generates Directed Entity Relationship Diagrams (ERDs) using Graphviz. It includes features such as overview controls, edge highlighting, and introspection to display the SQL used to generate table nodes and edges, making it easier to debug PostgreSQL data structures.


In a past life I had been tasked with showing what the normalized postgresql database looks like for employers who do not want to pay for postgres tools that have the graphical tools. There are certainly usable free ones out there, and at the time required the installation of Java. So admittedly this started as an academic excersise. By no means is this even an alledgedly full throated tool, but takes most diagraph args.

Some versions of this saved me hours of time in explainations, now easier to share.

Features

  • Parse PostgreSQL dump files to extract table definitions and relationships.
  • Generate interactive SVG ERDs with automatic color coding.
  • Support for complex SQL structures, including foreign keys, constraints, and various data types.
  • Table filtering to exclude temporary or utility tables.
  • Accessible color palette with proper contrast for readability.
  • Comprehensive test suite with >80% code coverage.

Installation

  1. Install pypgsvg:

    pip install pypgsvg
    
  2. Ensure Graphviz is installed:

    • macOS: brew install graphviz
    • Ubuntu/Debian: sudo apt-get install graphviz
    • Windows: Download from Graphviz.org.

Usage

Command-Line Usage

Generate an ERD from your SQL dump file:

python -m src.pypgsvg Samples/schema.dump --output your_database_erd --rankdir TB --node-sep 4 --packmode graph

View the diagram immediately after generation:

python -m src.pypgsvg Samples/schema.dump --view

The following screenshots were generated from the dump file in the samples directory

Demo Images

Python API Usage

For programmatic use:

from src.pypgsvg import parse_sql_dump, generate_erd_with_graphviz

# Load SQL dump
with open("your_database_dump.sql", "r", encoding='utf-8') as file:
    sql_content = file.read()

# Parse tables and relationships
tables, foreign_keys, errors = parse_sql_dump(sql_content)

# Generate ERD
if not errors:
    generate_erd_with_graphviz(tables, foreign_keys, "database_diagram")
    print("ERD generated successfully!")
else:
    print("Parsing errors found:", errors)

Components

Metadata

Displays information about the generated SVG and the options used. Metadata

Overview

A minimap to quickly navigate larger SVG files. Overview

Selection SQL

Highlights SQL generation text for selected elements. Single-click inside the content area converts it to a text area for copy/paste. SQL


Testing

Run the complete test suite:

PYTHONPATH=src python -m pytest tests/tests

Run specific test categories:

pytest -m unit          # Unit tests only
pytest -m integration   # Integration tests only

Generate an HTML coverage report:

pytest --cov-report=html
open htmlcov/index.html

Project Structure

├── src/
│   └── create_graph.py          # Main application code
├── tests/
│   ├── conftest.py              # Test fixtures and configuration
│   ├── test_utils.py            # Tests for utility functions
│   ├── test_parser.py           # Tests for SQL parsing
│   ├── test_erd_generation.py   # Tests for ERD generation
│   └── test_integration.py      # Integration tests
├── requirements.txt             # Python dependencies
├── pyproject.toml               # pytest configuration
└── README.md                    # Documentation

Configuration

Table Exclusion

The application automatically excludes tables matching certain patterns:

  • Views (vw_)
  • Backup tables (bk)
  • Temporary fix tables (fix)
  • Duplicate tables (dups, duplicates)
  • Match tables (matches)
  • Version logs (versionlog)
  • Old tables (old)
  • Member data (memberdata)

Color Palette

The ERD uses an accessible color palette with automatic contrast calculation for text readability following WCAG guidelines.


Supported SQL Features

  • CREATE TABLE statements with various column types.
  • ALTER TABLE ... ADD CONSTRAINT ... FOREIGN KEY.
  • Quoted identifiers and complex data types (e.g., numeric, timestamp, jsonb).
  • Multiple constraint variations.

Error Handling

The application includes comprehensive error handling for:

  • Malformed SQL syntax.
  • Missing table references in foreign keys.
  • Unicode encoding issues.
  • File reading errors.

Contributing

  1. Follow PEP 8 style guidelines.
  2. Write tests for new functionality.
  3. Maintain >90% test coverage.
  4. Use type hints where appropriate.
  5. Update documentation as needed.

Dependencies

  • graphviz>=0.20.1 - For generating diagrams.
  • pytest>=7.4.0 - Testing framework.
  • pytest-cov>=4.1.0 - Coverage reporting.
  • pytest-mock>=3.11.0 - Mocking utilities.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pypgsvg-1.0.73.tar.gz (160.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pypgsvg-1.0.73-py3-none-any.whl (20.1 kB view details)

Uploaded Python 3

File details

Details for the file pypgsvg-1.0.73.tar.gz.

File metadata

  • Download URL: pypgsvg-1.0.73.tar.gz
  • Upload date:
  • Size: 160.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for pypgsvg-1.0.73.tar.gz
Algorithm Hash digest
SHA256 ab6c4e482815cd2e62407294f8c79aa55b305147b5a0cc8e0cdc2b31c2334f6c
MD5 37efa60e6e85f37ee54631eff966a1fe
BLAKE2b-256 671f2e0cc730efb492191aee1456eb8d3d5cb1b7db5f7198ea36d1eb896b3eb1

See more details on using hashes here.

File details

Details for the file pypgsvg-1.0.73-py3-none-any.whl.

File metadata

  • Download URL: pypgsvg-1.0.73-py3-none-any.whl
  • Upload date:
  • Size: 20.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for pypgsvg-1.0.73-py3-none-any.whl
Algorithm Hash digest
SHA256 64ccf15d730927c4e75d21b27832ca9cf227059f3740ec3f251c696a72f4cf1d
MD5 8775d3d67890ee0d81e8902d5837ff69
BLAKE2b-256 f393373a83a4f5401fd0a8d49293a2c992c68a789593d0da969ca464688e394a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page