Skip to main content

The MySQL adapter plugin for dbt (data build tool)

Project description

dbt-mysql

This plugin ports dbt functionality to MySQL.

This is an experimental plugin:

  • We have not tested it extensively
  • Storage engines other than the default of InnoDB are untested
  • MariaDB compatibility is untested
  • Only tested with dbt-adapter-tests with MySQL 5.6, 5.7, and 8.0
  • Compatiblity with other dbt packages (like dbt_utils) is also untested

Please read these docs carefully and use at your own risk. Issues and PRs welcome!

Table of Contents

Installation

This plugin can be installed via pip:

$ pip install dbt-mysql

dbt-mysql creates connections via an ODBC driver that requires pyodbc.

See https://github.com/mkleehammer/pyodbc/wiki/Install for more info about installing pyodbc.

Supported features

MySQL 8.0

Supported? Feature
Table materialization
View materialization
Incremental materialization
Ephemeral materialization
Seeds
Sources
Custom data tests
Docs generate
Snapshots

MySQL 5.6 and 5.7

Supported? Feature
Table materialization
View materialization
Incremental materialization
Ephemeral materialization
Seeds
Sources
Custom data tests
Docs generate
🤷 Snapshots

Notes:

  • Ephemeral materializations rely upon Common Table Expressions (CTE), which is not supported until MySQL 8.0
  • MySQL 5.6 and 5.7 have some configuration gotchas that affect snapshots (see below).
MySQL 5.6 configuration gotchas

dbt snapshots might not work properly due to automatic initialization and updating for TIMESTAMP if:

  • the output of SHOW VARIABLES LIKE 'sql_mode' includes NO_ZERO_DATE
  • the output of SHOW GLOBAL VARIABLES LIKE 'explicit_defaults_for_timestamp' has a value of OFF

A solution is to include the following in a *.cnf file: Configuration to include in a *.cnf file:

[mysqld]
explicit_defaults_for_timestamp = true
MySQL 5.7 configuration gotchas

dbt snapshots might not work properly due to automatic initialization and updating for TIMESTAMP if: dbt snapshots might not work properly if:

  • the output of SHOW VARIABLES LIKE 'sql_mode' includes NO_ZERO_DATE

A solution is to include the following in a *.cnf file: Configuration to include in a *.cnf file:

[mysqld]
explicit_defaults_for_timestamp = true
sql_mode = "ALLOW_INVALID_DATES,{other_sql_modes}"

where {other_sql_modes} is the rest of the modes from the SHOW VARIABLES LIKE 'sql_mode' output.

Configuring your profile

A dbt profile can be configured to run against MySQL using the following configuration:

Example entry for profiles.yml:

your_profile_name:
  target: dev
  outputs:
    dev:
      type: mysql
      server: localhost
      schema: analytics
      username: your_mysql_username
      password: your_mysql_password
      driver: MySQL ODBC 8.0 ANSI Driver
Option Description Required? Example
server The server (hostname) to connect to Required yourorg.mysqlhost.com
schema Specify the schema (database) to build models into Required analytics
username The username to use to connect to the server Required dbt_admin
password The password to use for authenticating to the server Required correct-horse-battery-staple
driver ODBC DSN configured Required MySQL ODBC 8.0 ANSI Driver

Notes

Conflicting terminology is used between:

  • dbt
  • Database management systems (DBMS) like MySQL, Postgres, and Snowflake
  • metadata in the ANSI-standard information_schema

The conflicts include both:

  • the same word meaning different things
  • different words meaning the same thing

For example, a "database" in MySQL is not the same as a "database" in dbt, but it is equivalent to a "schema" in Postgres 🤯.

dbt-mysql uses the dbt terms. The native MySQL verbiage is restricted to SQL statements.

This cross-walk aligns the terminology:

information_schema dbt (and Postgres) MySQL
catalog database undefined / not implemented
schema schema database
relation (table/view) relation (table/view) relation (table/view)
column column column

Additionally, many DBMS have relation names with three parts whereas MySQL has only two. E.g., a fully-qualified table name in Postgres is database.schema.table versus database.table in MySQL. The missing part in MySQL is the information_schema "catalog".

DBMS Fully-qualified relation name Parts
Postgres database.schema.table 3
MySQL database.table 2

dbt-mysql borrows from dbt-spark and dbt-sqlite since Spark and SQLite also use two-part relation names.

Running Tests

  1. Modify test/mysql.dbtspec with your server, username, password, and (optionally) port
  2. Install the pytest-dbt-adapter package
  3. Run the test specs in this repository
$ pip install pytest-dbt-adapter
$ pytest test/mysql.dbtspec

Reporting bugs and contributing code

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dbt-mysql-0.18.0rc3.tar.gz (15.6 kB view details)

Uploaded Source

Built Distribution

dbt_mysql-0.18.0rc3-py3-none-any.whl (21.2 kB view details)

Uploaded Python 3

File details

Details for the file dbt-mysql-0.18.0rc3.tar.gz.

File metadata

  • Download URL: dbt-mysql-0.18.0rc3.tar.gz
  • Upload date:
  • Size: 15.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.23.0 setuptools/50.3.2 requests-toolbelt/0.9.1 tqdm/4.54.0 CPython/3.6.11

File hashes

Hashes for dbt-mysql-0.18.0rc3.tar.gz
Algorithm Hash digest
SHA256 ebcf0ba9939b14f418336b389048375cfee5c981bc0bc963745cf8c3b81f864e
MD5 58b82f47363e28da24178b42638b2ec8
BLAKE2b-256 4525333cded7539f95e7098e1b680c7ef06d5b7bc1aad234c61db029e4dfec7b

See more details on using hashes here.

File details

Details for the file dbt_mysql-0.18.0rc3-py3-none-any.whl.

File metadata

  • Download URL: dbt_mysql-0.18.0rc3-py3-none-any.whl
  • Upload date:
  • Size: 21.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.23.0 setuptools/50.3.2 requests-toolbelt/0.9.1 tqdm/4.54.0 CPython/3.6.11

File hashes

Hashes for dbt_mysql-0.18.0rc3-py3-none-any.whl
Algorithm Hash digest
SHA256 c28bbc1a99ca872ea645fa40b7ec9b4296555a41c7fa786f8e65973a581b8a51
MD5 c7f0aac8cd2ba4ec86ab3cf5e5bcb9dc
BLAKE2b-256 bd2bd272b667deb56707ab383bbe1205b0332f38f2fb84ddc3e23c27d1d2c05e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page