Skip to main content

MariaDB loader for mkpipe.

Project description

mkpipe-loader-mariadb

MariaDB loader plugin for MkPipe. Writes Spark DataFrames into MariaDB tables via JDBC.

Documentation

For more detailed documentation, please visit the GitHub repository.

License

This project is licensed under the Apache 2.0 License - see the LICENSE file for details.


Connection Configuration

connections:
  mariadb_target:
    variant: mariadb
    host: localhost
    port: 3306
    database: mydb
    user: myuser
    password: mypassword

Table Configuration

pipelines:
  - name: pg_to_mariadb
    source: pg_source
    destination: mariadb_target
    tables:
      - name: public.orders
        target_name: stg_orders
        replication_method: full
        batchsize: 10000

Write Strategy

Control how data is written to MariaDB:

      - name: public.orders
        target_name: stg_orders
        write_strategy: upsert       # append | replace | upsert | merge
        write_key: [id]              # required for upsert/merge
Strategy MariaDB Behavior
append Plain INSERT via JDBC (default for incremental)
replace Drop and recreate table, then insert (default for full)
upsert INSERT ... ON DUPLICATE KEY UPDATE via temp table
merge Same as upsert for MariaDB

Write Parallelism & Throughput

      - name: public.orders
        target_name: stg_orders
        replication_method: full
        batchsize: 10000
        write_partitions: 4
  • batchsize: rows per JDBC batch INSERT.
  • write_partitions: reduces concurrent JDBC connections via coalesce(N).

All Table Parameters

Parameter Type Default Description
name string required Source table name
target_name string required MariaDB destination table name
replication_method full / incremental full Replication strategy
batchsize int 10000 Rows per JDBC batch insert
write_partitions int Coalesce DataFrame to N partitions before writing
write_strategy string append, replace, upsert, merge
write_key list Key columns for upsert/merge (required)
dedup_columns list Columns used for mkpipe_id hash deduplication
tags list [] Tags for selective pipeline execution
pass_on_error bool false Skip table on error instead of failing

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mkpipe_loader_mariadb-0.5.0.tar.gz (6.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mkpipe_loader_mariadb-0.5.0-py3-none-any.whl (7.5 kB view details)

Uploaded Python 3

File details

Details for the file mkpipe_loader_mariadb-0.5.0.tar.gz.

File metadata

  • Download URL: mkpipe_loader_mariadb-0.5.0.tar.gz
  • Upload date:
  • Size: 6.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for mkpipe_loader_mariadb-0.5.0.tar.gz
Algorithm Hash digest
SHA256 06ada77a9267445a7735af3b3c6a8dbf93d212053d3cea1d1c5d8de750d2235f
MD5 66e8d06786bd41d548301b285e93df31
BLAKE2b-256 b6063f55cbb8dc00410d7a39613676d1db102ab0f96414bc99e97e243bc0579b

See more details on using hashes here.

File details

Details for the file mkpipe_loader_mariadb-0.5.0-py3-none-any.whl.

File metadata

File hashes

Hashes for mkpipe_loader_mariadb-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 be411110c23900fc7bfd47c8366610e6bfed76c460fabee636c6b43b56f9781a
MD5 6e34313649d91750b42357fb2da92373
BLAKE2b-256 f2865e579222c67756480d9629e6d9ebb9c44d365a574056a018d879ccb5833e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page