Generate SVG Entity Relationship Diagrams from PostgreSQL database dumps
Project description
Python ERD Generator from Postgres schema dump file
pypgsvg is an open source Python application that parses postgresql schema SQL dump files and generates Directed Entity Relationship Diagrams (Diagraph, ERDs) using Graphviz with overview controls, user edge highilight and introspection displaying the SQL used to generate the table nodes, and edges to quickly diagnose or debug postgres sql data structures
In a past life I had been tasked with showing what the normalized postgresql database looks like for employers who do not want to pay for postgres tools that have the graphical tools. There are certainly usable free ones out there, and at the time required the installation of Java. So admittedly this started as an academic excersise. By no means is this even an alledgedly full throated tool, but takes most diagraph args.
Some versions of this saved me hours of time in explainations, now easier to share.
Supported Command-Line Arguments
The following arguments are supported and passed to Graphviz for generating the ERD:
Input and Output
input_file: Path to the PostgreSQL dump file to be parsed.-o, --output: Output file name (without extension). Default:schema_erd.
Diagram Customization
--show-standalone: Hide standalone tables. Default:'true'.--view: Open the generated SVG in a browser.--saturate: Saturation factor for table colors. Default:1.8.--brightness: Brightness factor for table colors. Default:1.0.
Graphviz Parameters
--packmode: Graphviz packmode (array,cluster,graph). Default:array.--rankdir: Direction of graph layout (TB,LR,BT,RL). Default:TB(top-to-bottom).--esep: Edge separation value. Default:8.
Font Customization
--fontname: Font name for graph, nodes, and edges. Default:Arial.--fontsize: Font size for graph label. Default:18.--node-fontsize: Font size for node labels. Default:14.--edge-fontsize: Font size for edge labels. Default:12.
Node Style and Layout
--node-style: Node style (e.g.,filled,rounded,filled). Default:rounded,filled.--node-shape: Node shape (e.g.,rect,ellipse). Default:rect.--node-sep: Node separation distance. Default:0.5.--rank-sep: Rank separation distance. Default:1.2.
PYTHONPATH=src python -m pytest tests/tests/
python -m src.pypgsvg Samples/schema.dump --show-standalone=true --output=test --rankdir TB --node-sep 4 --packmode 'graph'
python -m src.pypgsvg Samples/schema.dump --output=test
BETA, TODO, and Features
- TODO: Allow all argument options for Diagraph..
- allow show/hide FK based on cascade type.
- CSS facelift.
- Parse PostgreSQL dump files to extract table definitions and relationships
- Generate interactive SVG Entity Relationship Diagrams
- Automatic color coding for tables with accessible color palette
- Support for complex SQL structures including foreign keys, constraints, and various data types
- Table filtering to exclude temporary/utility tables
- Comprehensive test suite with >80% code coverage
Installation
- Install pypgsvg:
pip install pypgsvg
- Ensure Graphviz is installed on your system:
- macOS:
brew install graphviz - Ubuntu/Debian:
sudo apt-get install graphviz - Windows: Download from https://graphviz.org/download/
- macOS:
Usage
Basic Usage
Generate an ERD from your SQL dump file:
pypgsvg your_database.sql
This will create an SVG file with the same name as your input file (e.g., your_database_erd.svg).
Example Output
Here's an example of the generated ERD from a sample database schema with users, posts, and comments:
Simple Blog Schema - showing basic relationships between users, posts, and comments
For more complex databases, pypgsvg can handle extensive schemas with many tables and relationships:
Complex Database Schema - demonstrating pypgsvg's ability to visualize large, real-world database structures
View the interactive SVG diagrams:
The diagram shows:
- Tables as nodes with their column definitions
- Foreign key relationships as directed edges between tables
- Automatic color coding for visual distinction
- Accessible color palette with proper contrast for readability
Usage
Specify a custom output filename:
pypgsvg your_database_dump.sql --output custom_diagram.svg
View the diagram immediately after generation:
pypgsvg your_database_dump.sql --view
Python API Usage
For programmatic use:
from src.pypgsvg import parse_sql_dump, generate_erd_with_graphviz
# Load SQL dump
with open("your_database_dump.sql", "r", encoding='utf-8') as file:
sql_content = file.read()
# Parse tables and relationships
tables, foreign_keys, errors = parse_sql_dump(sql_content)
# Generate ERD
if not errors:
generate_erd_with_graphviz(tables, foreign_keys, "database_diagram")
print("ERD generated successfully!")
else:
print("Parsing errors found:", errors)
Testing
Run the complete test suite with coverage:
# Run all tests with coverage
PYTHONPATH=src python -m pytest tests/tests/
# Run specific test categories
pytest -m unit # Unit tests only
pytest -m integration # Integration tests only
# Run with verbose output
pytest -v
# Generate HTML coverage report
pytest --cov-report=html
open htmlcov/index.html # View coverage report
Project Structure
├── src/
│ └── create_graph.py # Main application code
├── tests/
│ ├── conftest.py # Test fixtures and configuration
│ ├── test_utils.py # Tests for utility functions
│ ├── test_parser.py # Tests for SQL parsing
│ ├── test_erd_generation.py # Tests for ERD generation
│ └── test_integration.py # Integration tests
├── requirements.txt # Python dependencies
├── pyproject.toml # pytest configuration
└── README.md # This file
Configuration
Table Exclusion
The application automatically excludes tables matching certain patterns (defined in should_exclude_table):
- Views (
vw_) - Backup tables (
bk) - Temporary fix tables (
fix) - Duplicate tables (
dups,duplicates) - Match tables (
matches) - Version logs (
versionlog) - Old tables (
old) - Member data (
memberdata)
Color Palette
The ERD uses an accessible color palette with automatic contrast calculation for text readability following WCAG guidelines.
Supported SQL Features
CREATE TABLEstatements with various column typesCREATE TABLE IF NOT EXISTSALTER TABLE ... ADD CONSTRAINT ... FOREIGN KEY- Quoted identifiers
- Complex data types (numeric, timestamp, jsonb, etc.)
- Multiple constraint variations
Error Handling
The application includes comprehensive error handling for:
- Malformed SQL syntax
- Missing table references in foreign keys
- Unicode encoding issues
- File reading errors
Contributing
- Follow PEP 8 style guidelines
- Write tests for new functionality
- Maintain >90% test coverage
- Use type hints where appropriate
- Update documentation as needed
Dependencies
graphviz>=0.20.1- For generating diagramspytest>=7.4.0- Testing frameworkpytest-cov>=4.1.0- Coverage reportingpytest-mock>=3.11.0- Mocking utilities
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pypgsvg-1.0.71.tar.gz.
File metadata
- Download URL: pypgsvg-1.0.71.tar.gz
- Upload date:
- Size: 162.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e6288880318ec6b9c98e64c88aa0aa929761f69f08aab0f8e409f3d7217097d3
|
|
| MD5 |
da74669db8daa6f85511bd2c46fe069b
|
|
| BLAKE2b-256 |
c73b69f97c42538b0ef1a89912b070e14ac008b2dbbfd6ea70241ae5e43ee706
|
File details
Details for the file pypgsvg-1.0.71-py3-none-any.whl.
File metadata
- Download URL: pypgsvg-1.0.71-py3-none-any.whl
- Upload date:
- Size: 20.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7aa5e049781f549e179fe60d28bec2debfb4d2f940998f57115231a80c47de1b
|
|
| MD5 |
a89d87602340efcb3917950bce9010cf
|
|
| BLAKE2b-256 |
45df093e826a044c15e77ddc51653082211d5b56cd2b1a9b4d1d6342204bf882
|