Skip to main content

SQL Lineage Analysis Tool powered by Python

Project description

SQLLineage

SQL Lineage Analysis Tool powered by Python

image image image image Build Status Documentation Status codecov Code style: black security: bandit

Never get the hang of a SQL parser? SQLLineage comes to the rescue. Given a SQL command, SQLLineage will tell you its source and target tables, without worrying about Tokens, Keyword, Identifier and all the jagons used by SQL parsers.

Behind the scene, SQLLineage uses the fantastic sqlparse library to parse the SQL command, and bring you all the human-readable result with ease.

Documentation

Documentation is online hosted by readthedocs, and you can check the release note there.

Quick Start

Install sqllineage via PyPI:

$ pip install sqllineage

Using sqllineage command to parse a quoted-query-string:

$ sqllineage -e "insert into db1.table1 select * from db2.table2"
Statements(#): 1
Source Tables:
    db2.table2
Target Tables:
    db1.table1

Or you can parse a SQL file with -f option:

$ sqllineage -f foo.sql
Statements(#): 1
Source Tables:
    db1.table_foo
    db1.table_bar
Target Tables:
    db2.table_baz

Advanced Usage

Multiple SQL Statements

Lineage result combined for multiple SQL statements, with intermediate tables identified:

$ sqllineage -e "insert into db1.table1 select * from db2.table2; insert into db3.table3 select * from db1.table1;"
Statements(#): 2
Source Tables:
    db2.table2
Target Tables:
    db3.table3
Intermediate Tables:
    db1.table1

Verbose Lineage Result

And if you want to see lineage result for every SQL statement, just toggle verbose option

$ sqllineage -v -e "insert into db1.table1 select * from db2.table2; insert into db3.table3 select * from db1.table1;"
Statement #1: insert into db1.table1 select * from db2.table2;
    table read: [Table: db2.table2]
    table write: [Table: db1.table1]
    table rename: []
    table drop: []
    table intermediate: []
Statement #2: insert into db3.table3 select * from db1.table1;
    table read: [Table: db1.table1]
    table write: [Table: db3.table3]
    table rename: []
    table drop: []
    table intermediate: []
==========
Summary:
Statements(#): 2
Source Tables:
    db2.table2
Target Tables:
    db3.table3
Intermediate Tables:
    db1.table1

Lineage Visualization

One more cool feature, if you want a graph visualization for the lineage result, toggle graphviz option

sqllineage -g -e "insert into db1.table11 select * from db2.table21 union select * from db2.table22; insert into db3.table3 select * from db1.table11 join db1.table12;"

An interactive matplotlib graph will then pop up, showing DAG representation of the lineage result:

For visualization to work, you must have graphviz installed. With Ubuntu, it's simply

sudo apt install graphviz

For mac user, brew can help

brew install graphviz

graphviz also comes with Windows support, see Graphviz Download for details.

After that, all the extra dependencies for sqllineage must be installed as well. It will install matplotlib and pygraphviz :

pip install sqllineage[all]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sqllineage-1.0.1.tar.gz (10.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sqllineage-1.0.1-py3-none-any.whl (11.6 kB view details)

Uploaded Python 3

File details

Details for the file sqllineage-1.0.1.tar.gz.

File metadata

  • Download URL: sqllineage-1.0.1.tar.gz
  • Upload date:
  • Size: 10.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.9.0

File hashes

Hashes for sqllineage-1.0.1.tar.gz
Algorithm Hash digest
SHA256 cc12d9a9196f8bfaed2f2eb4f5a7812aa16406fab09032b66087c96a334f1785
MD5 8ce142f47fc3d1595fef94e491ed80a6
BLAKE2b-256 a201d1c552271d5b8a0f0c42640af4c8d1c2777b59a000512ffec91e8b17aa69

See more details on using hashes here.

File details

Details for the file sqllineage-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: sqllineage-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 11.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.9.0

File hashes

Hashes for sqllineage-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 965da12b3ec8180efeac76f5a0f72645d36f123a57184a34ca16901a1c2c4a60
MD5 e64536eeba3f62aa02f854e817f69b18
BLAKE2b-256 8aeed55241d029b93a8a7da2e602c5a6bb63011c1e91ce132614c15dbb2f311b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page