Skip to main content

MySQL loader for mkpipe.

Project description

mkpipe-loader-mysql

MySQL loader plugin for MkPipe. Writes Spark DataFrames into MySQL tables via JDBC.

Documentation

For more detailed documentation, please visit the GitHub repository.

License

This project is licensed under the Apache 2.0 License - see the LICENSE file for details.


Connection Configuration

connections:
  mysql_target:
    variant: mysql
    host: localhost
    port: 3306
    database: mydb
    user: myuser
    password: mypassword

Table Configuration

pipelines:
  - name: pg_to_mysql
    source: pg_source
    destination: mysql_target
    tables:
      - name: public.orders
        target_name: stg_orders
        replication_method: full
        batchsize: 10000

Write Strategy

Control how data is written to MySQL:

      - name: public.orders
        target_name: stg_orders
        write_strategy: upsert       # append | replace | upsert | merge
        write_key: [id]              # required for upsert/merge
Strategy MySQL Behavior
append Plain INSERT via JDBC (default for incremental)
replace Drop and recreate table, then insert (default for full). Use if_exists: append to preserve existing table
upsert INSERT ... ON DUPLICATE KEY UPDATE via temp table
merge Same as upsert for MySQL

Note: upsert/merge requires write_key. The target table must have a unique index/primary key on the write_key columns.


Write Parallelism & Throughput

      - name: public.orders
        target_name: stg_orders
        replication_method: full
        batchsize: 10000
        write_partitions: 4
  • batchsize: rows per JDBC batch INSERT. MySQL handles 5,000–20,000 well.
  • write_partitions: reduces concurrent JDBC connections via coalesce(N).

All Table Parameters

Parameter Type Default Description
name string required Source table name
target_name string required MySQL destination table name
replication_method full / incremental full Replication strategy
batchsize int 10000 Rows per JDBC batch insert
write_partitions int Coalesce DataFrame to N partitions before writing
write_strategy string append, replace, upsert, merge
write_key list Key columns for upsert/merge (required)
if_exists string replace (drop+create) or append (preserve table). Inherits from settings
dedup_columns list Columns used for mkpipe_id hash deduplication
tags list [] Tags for selective pipeline execution
pass_on_error bool false Skip table on error instead of failing

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mkpipe_loader_mysql-0.6.0.tar.gz (7.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mkpipe_loader_mysql-0.6.0-py3-none-any.whl (7.6 kB view details)

Uploaded Python 3

File details

Details for the file mkpipe_loader_mysql-0.6.0.tar.gz.

File metadata

  • Download URL: mkpipe_loader_mysql-0.6.0.tar.gz
  • Upload date:
  • Size: 7.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for mkpipe_loader_mysql-0.6.0.tar.gz
Algorithm Hash digest
SHA256 90463b4acf363de9d193ca875db0598c93904ab0ea8392112fae67a987bda855
MD5 f3b0748c132a8a0a4749bca391c3af02
BLAKE2b-256 03c53373f897ffb5a22710be4cbfc032d4f2fa37e94705e49f39ffbe52b0ddac

See more details on using hashes here.

File details

Details for the file mkpipe_loader_mysql-0.6.0-py3-none-any.whl.

File metadata

File hashes

Hashes for mkpipe_loader_mysql-0.6.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c6779b827198383799d4294d208140421add3f6818d98ae2ae9cdd930f39e75c
MD5 9084cb370e2d00936a47ea6ea35888a6
BLAKE2b-256 d7299f6c786c3f9718ba296e4e755b2a26e096252985186862acbec74b4dedc7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page