Skip to main content

Generate SVG Entity Relationship Diagrams from PostgreSQL database dumps

Project description

Python ERD Generator from Postgres schema dump file

pypgsvg is an open source Python application that parses postgresql schema SQL dump files and generates Directed Entity Relationship Diagrams (Diagraph, ERDs) using Graphviz with overview controls, user edge highilight and introspection displaying the SQL used to generate the table nodes, and edges to quickly diagnose or debug postgres sql data structures

In a past life I had been tasked with showing what the normalized postgresql database looks like for employers who do not want to pay for postgres tools that have the graphical tools. There are certainly usable free ones out there, and at the time required the installation of Java. So admittedly this started as an academic excersise. By no means is this even an alledgedly full throated tool, but takes most diagraph args.

Some versions of this saved me hours of time in explainations, now easier to share.


Supported Command-Line Arguments

The following arguments are supported and passed to Graphviz for generating the ERD:

Input and Output

  • input_file: Path to the PostgreSQL dump file to be parsed.
  • -o, --output: Output file name (without extension). Default: schema_erd.

Diagram Customization

  • --show-standalone: Hide standalone tables. Default: 'true'.
  • --view: Open the generated SVG in a browser.
  • --saturate: Saturation factor for table colors. Default: 1.8.
  • --brightness: Brightness factor for table colors. Default: 1.0.

Graphviz Parameters

  • --packmode: Graphviz packmode (array, cluster, graph). Default: array.
  • --rankdir: Direction of graph layout (TB, LR, BT, RL). Default: TB (top-to-bottom).
  • --esep: Edge separation value. Default: 8.

Font Customization

  • --fontname: Font name for graph, nodes, and edges. Default: Arial.
  • --fontsize: Font size for graph label. Default: 18.
  • --node-fontsize: Font size for node labels. Default: 14.
  • --edge-fontsize: Font size for edge labels. Default: 12.

Node Style and Layout

  • --node-style: Node style (e.g., filled, rounded,filled). Default: rounded,filled.
  • --node-shape: Node shape (e.g., rect, ellipse). Default: rect.
  • --node-sep: Node separation distance. Default: 0.5.
  • --rank-sep: Rank separation distance. Default: 1.2.

 PYTHONPATH=src python -m pytest tests/tests/ 
python -m src.pypgsvg Samples/schema.dump  --show-standalone=true --output=test  --rankdir TB --node-sep 4 --packmode 'graph' 
 python -m src.pypgsvg Samples/schema.dump --output=test

BETA, TODO, and Features

  • TODO: Allow all argument options for Diagraph..
    • allow show/hide FK based on cascade type.
    • CSS facelift.
  • Parse PostgreSQL dump files to extract table definitions and relationships
  • Generate interactive SVG Entity Relationship Diagrams
  • Automatic color coding for tables with accessible color palette
  • Support for complex SQL structures including foreign keys, constraints, and various data types
  • Table filtering to exclude temporary/utility tables
  • Comprehensive test suite with >80% code coverage

Installation

  1. Install pypgsvg:
pip install pypgsvg
  1. Ensure Graphviz is installed on your system:

Usage

Basic Usage

Generate an ERD from your SQL dump file:

pypgsvg your_database.sql

This will create an SVG file with the same name as your input file (e.g., your_database_erd.svg).

Example Output

Here's an example of the generated ERD from a sample database schema with users, posts, and comments:

Sample ERD

Simple Blog Schema - showing basic relationships between users, posts, and comments


For more complex databases, pypgsvg can handle extensive schemas with many tables and relationships:

Complex Database Schema

Complex Database Schema - demonstrating pypgsvg's ability to visualize large, real-world database structures

View the interactive SVG diagrams:

The diagram shows:

  • Tables as nodes with their column definitions
  • Foreign key relationships as directed edges between tables
  • Automatic color coding for visual distinction
  • Accessible color palette with proper contrast for readability

Usage

Specify a custom output filename:

pypgsvg your_database_dump.sql --output custom_diagram.svg

View the diagram immediately after generation:

pypgsvg your_database_dump.sql --view

Python API Usage

For programmatic use:

from src.pypgsvg import parse_sql_dump, generate_erd_with_graphviz

# Load SQL dump
with open("your_database_dump.sql", "r", encoding='utf-8') as file:
    sql_content = file.read()

# Parse tables and relationships
tables, foreign_keys, errors = parse_sql_dump(sql_content)

# Generate ERD
if not errors:
    generate_erd_with_graphviz(tables, foreign_keys, "database_diagram")
    print("ERD generated successfully!")
else:
    print("Parsing errors found:", errors)

Testing

Run the complete test suite with coverage:

# Run all tests with coverage
PYTHONPATH=src python -m pytest tests/tests/


# Run specific test categories
pytest -m unit          # Unit tests only
pytest -m integration   # Integration tests only

# Run with verbose output
pytest -v

# Generate HTML coverage report
pytest --cov-report=html
open htmlcov/index.html  # View coverage report

Project Structure

├── src/
│   └── create_graph.py          # Main application code
├── tests/
│   ├── conftest.py              # Test fixtures and configuration
│   ├── test_utils.py            # Tests for utility functions
│   ├── test_parser.py           # Tests for SQL parsing
│   ├── test_erd_generation.py   # Tests for ERD generation
│   └── test_integration.py      # Integration tests
├── requirements.txt             # Python dependencies
├── pyproject.toml              # pytest configuration
└── README.md                   # This file

Configuration

Table Exclusion

The application automatically excludes tables matching certain patterns (defined in should_exclude_table):

  • Views (vw_)
  • Backup tables (bk)
  • Temporary fix tables (fix)
  • Duplicate tables (dups, duplicates)
  • Match tables (matches)
  • Version logs (versionlog)
  • Old tables (old)
  • Member data (memberdata)

Color Palette

The ERD uses an accessible color palette with automatic contrast calculation for text readability following WCAG guidelines.

Supported SQL Features

  • CREATE TABLE statements with various column types
  • CREATE TABLE IF NOT EXISTS
  • ALTER TABLE ... ADD CONSTRAINT ... FOREIGN KEY
  • Quoted identifiers
  • Complex data types (numeric, timestamp, jsonb, etc.)
  • Multiple constraint variations

Error Handling

The application includes comprehensive error handling for:

  • Malformed SQL syntax
  • Missing table references in foreign keys
  • Unicode encoding issues
  • File reading errors

Contributing

  1. Follow PEP 8 style guidelines
  2. Write tests for new functionality
  3. Maintain >90% test coverage
  4. Use type hints where appropriate
  5. Update documentation as needed

Dependencies

  • graphviz>=0.20.1 - For generating diagrams
  • pytest>=7.4.0 - Testing framework
  • pytest-cov>=4.1.0 - Coverage reporting
  • pytest-mock>=3.11.0 - Mocking utilities

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pypgsvg-1.0.71.tar.gz (162.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pypgsvg-1.0.71-py3-none-any.whl (20.8 kB view details)

Uploaded Python 3

File details

Details for the file pypgsvg-1.0.71.tar.gz.

File metadata

  • Download URL: pypgsvg-1.0.71.tar.gz
  • Upload date:
  • Size: 162.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for pypgsvg-1.0.71.tar.gz
Algorithm Hash digest
SHA256 e6288880318ec6b9c98e64c88aa0aa929761f69f08aab0f8e409f3d7217097d3
MD5 da74669db8daa6f85511bd2c46fe069b
BLAKE2b-256 c73b69f97c42538b0ef1a89912b070e14ac008b2dbbfd6ea70241ae5e43ee706

See more details on using hashes here.

File details

Details for the file pypgsvg-1.0.71-py3-none-any.whl.

File metadata

  • Download URL: pypgsvg-1.0.71-py3-none-any.whl
  • Upload date:
  • Size: 20.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for pypgsvg-1.0.71-py3-none-any.whl
Algorithm Hash digest
SHA256 7aa5e049781f549e179fe60d28bec2debfb4d2f940998f57115231a80c47de1b
MD5 a89d87602340efcb3917950bce9010cf
BLAKE2b-256 45df093e826a044c15e77ddc51653082211d5b56cd2b1a9b4d1d6342204bf882

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page