A liquibase datasource client for python

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

liquibase数据源驱动

liquibase引入python3脚本，统一管理管理mongo、clickhouse的库表结构。changelog记录还是选在记录到mysql中，这样业务上会更加灵活

<changeSet id="xxxxx" author="xxxxxx" labels="mongo">
    <comment>xxxxx</comment>
    <executeCommand executable="python3">
        <arg value="script/db_tag/creat_collection.py"/>
    </executeCommand>
</changeSet>

#!/usr/bin/env python3
# -*- coding: utf-8 -*-

from liquibase_datasource import *


def create_tag_database():
    # 获取mongo链接实例
    client = get_client(filepath)


    return client[db]


def create_tag_collection():
    # 获取mongo链接实例
    db_name = get_tenant_shard(tag_database)
    client = get_client()
    db = client[db_name]

    # db开启分片
    client.admin.command("enableSharding", db_name)

    db.create_collection(tag_collection)

    # 创建索引
    coll = db[tag_collection]
    coll.create_index(
        [("id", 1), ("name", 1)])


if __name__ == "__main__":
    # 创建标签集合
    create_tag_database()[run.py](..%2F..%2F..%2FDownloads%2Frun.py)
    create_tag_collection()

Iceberg 建表

通过 PySpark + REST Catalog（如 Polaris）执行 Iceberg SQL 语句，支持完整的 Spark SQL 语法和多种存储后端。

前置要求

Java 8/11/17（PySpark 运行需要 JVM 环境）
Python 3.8+

# 检查 Java 版本
java -version

# 安装依赖
pip install -r requirements.txt

配置说明

在 liquibase.properties 中添加 Iceberg 相关配置：

基础配置（必填）：

iceberg.catalog.name=my_catalog
iceberg.catalog.type=rest
iceberg.catalog.uri=http://your-polaris-server:8080/api/catalog
iceberg.catalog.warehouse=my_catalog

REST Catalog 认证（Polaris 等需要 OAuth2 认证的 Catalog）：

iceberg.catalog.credential=root:s3cr3t
iceberg.catalog.scope=PRINCIPAL_ROLE:ALL

存储配置 — 腾讯云 COS：

iceberg.s3.type=cos
iceberg.s3.endpoint=cos.ap-guangzhou.myqcloud.com
iceberg.s3.access_key=your-secret-id
iceberg.s3.secret_key=your-secret-key

存储配置 — AWS S3：

iceberg.s3.type=s3
iceberg.s3.endpoint=s3.amazonaws.com
iceberg.s3.access_key=your-access-key
iceberg.s3.secret_key=your-secret-key
iceberg.s3.region=us-east-1

存储配置 — MinIO：

iceberg.s3.type=minio
iceberg.s3.endpoint=http://minio:9000
iceberg.s3.access_key=minioadmin
iceberg.s3.secret_key=minioadmin

存储配置 — 阿里云 OSS：

iceberg.s3.type=oss
iceberg.s3.endpoint=oss-cn-hangzhou.aliyuncs.com
iceberg.s3.access_key=your-access-key
iceberg.s3.secret_key=your-secret-key

集群模式 配置前缀改为 {cluster}.iceberg.*，例如：

cluster1.iceberg.catalog.name=my_catalog
cluster1.iceberg.s3.type=cos
cluster1.iceberg.s3.endpoint=cos.ap-guangzhou.myqcloud.com
...

代码示例

方式一：Python API 建表

from liquiclient.iceberg_client import get_iceberg_client
from pyiceberg.schema import Schema
from pyiceberg.types import LongType, StringType, NestedField

# 获取 pyiceberg catalog 实例
catalog = get_iceberg_client()

# 列出 namespaces
namespaces = catalog.list_namespaces()

# 创建 namespace
catalog.create_namespace_if_not_exists("my_db")

# 建表
schema = Schema(
    NestedField(field_id=1, name="id", field_type=LongType(), required=True),
    NestedField(field_id=2, name="name", field_type=StringType(), required=False),
)
table = catalog.create_table("my_db.my_table", schema=schema)

方式二：Spark SQL 建表（推荐）

基于 PySpark，支持完整的 Spark SQL Iceberg 语法，包括 DDL、DML、查询等：

from liquiclient.iceberg_client import execute_iceberg_sql, stop_iceberg_spark

# 创建 Iceberg 表
execute_iceberg_sql("""
    CREATE TABLE IF NOT EXISTS my_db.users (
        id BIGINT NOT NULL COMMENT '用户ID',
        name STRING COMMENT '用户名',
        age INT,
        created_at TIMESTAMP
    )
    USING iceberg
    PARTITIONED BY (day(created_at))
""")

# 带分区 + 表属性
execute_iceberg_sql("""
    CREATE TABLE my_db.events (
        id BIGINT NOT NULL,
        event_type STRING,
        amount DECIMAL(18, 2),
        event_time TIMESTAMP
    )
    USING iceberg
    PARTITIONED BY (day(event_time), bucket(16, id))
    TBLPROPERTIES ('write.format.default' = 'parquet')
""")

# 插入数据
execute_iceberg_sql("INSERT INTO my_db.users VALUES (1, 'test', 25, current_timestamp())")

# 查询数据
df = execute_iceberg_sql("SELECT * FROM my_db.users")
df.show()

# ALTER TABLE
execute_iceberg_sql("ALTER TABLE my_db.users ADD COLUMN email STRING COMMENT '邮箱'")

# 删除表
execute_iceberg_sql("DROP TABLE IF EXISTS my_db.users")

# 创建 namespace
execute_iceberg_sql("CREATE NAMESPACE IF NOT EXISTS my_db")

# 集群模式
execute_iceberg_sql("CREATE TABLE my_db.t1 (id BIGINT) USING iceberg", cluster="cluster1")

# 批量执行
from liquiclient.iceberg_client import execute_iceberg_sql_batch
execute_iceberg_sql_batch("""
    CREATE NAMESPACE IF NOT EXISTS my_db;
    CREATE TABLE IF NOT EXISTS my_db.t1 (id BIGINT, name STRING) USING iceberg;
    INSERT INTO my_db.t1 VALUES (1, 'hello')
""")

# 使用完毕后停止 SparkSession
stop_iceberg_spark()

支持的 SQL 语法（Spark SQL 完整语法）：

功能	语法示例
建表	`CREATE TABLE ... USING iceberg`
删表	`DROP TABLE IF EXISTS ...`
改表	`ALTER TABLE ... ADD/DROP/RENAME COLUMN`
插入	`INSERT INTO ...`, `INSERT OVERWRITE ...`
查询	`SELECT ...`, `MERGE INTO ...`
分区	`PARTITIONED BY (col)`, `PARTITIONED BY (year(ts), bucket(16, id))`
表属性	`TBLPROPERTIES ('key' = 'value')`
Namespace	`CREATE/DROP NAMESPACE/SCHEMA/DATABASE`
快照	`SELECT * FROM my_table.snapshots`
时间旅行	`SELECT * FROM my_table TIMESTAMP AS OF '2024-01-01'`

验证脚本

项目提供了 test_iceberg.py 验证脚本，可用于快速验证连接和建表：

# 1. 编辑 test_iceberg.py 顶部的配置区域，填入你的实际信息
# 2. 运行验证
python test_iceberg.py

发布包

python3 -m pip install --upgrade build
python3 -m build
python3 -m pip install --upgrade twine
python3 -m twine upload  dist/*

Project details

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

1.3.6

Apr 20, 2026

1.3.5

Apr 9, 2026

1.3.4

Apr 9, 2026

1.3.3

Apr 8, 2026

This version

1.3.2

Apr 7, 2026

1.3.1

Mar 25, 2026

1.3.0

Mar 24, 2026

1.2.21

Jul 18, 2025

1.2.20

Jul 18, 2025

1.2.19

Jul 16, 2025

1.2.18

Jul 16, 2025

1.2.17

Jul 8, 2025

1.2.16

Oct 11, 2024

1.2.15

Sep 14, 2024

1.2.13

Apr 8, 2024

1.2.12

Apr 8, 2024

1.2.11

Apr 1, 2024

1.2.10

Mar 7, 2024

1.2.9

Feb 27, 2024

1.2.7

Jul 28, 2023

1.2.6

Jul 24, 2023

1.2.5

Jul 19, 2023

1.2.3

Jul 19, 2023

1.2.1

Jun 6, 2023

1.2.0

Jun 6, 2023

1.1.9

Jun 6, 2023

1.1.8

Jun 6, 2023

1.1.7

Jun 6, 2023

1.1.6

Jun 6, 2023

1.1.5

Jun 6, 2023

1.1.4

Jun 6, 2023

1.1.3

Jun 6, 2023

1.1.2

Jun 6, 2023

1.1.1

Jun 6, 2023

1.1.0

Jun 6, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

liquisource-1.3.2.tar.gz (13.0 kB view details)

Uploaded Apr 7, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

liquisource-1.3.2-py3-none-any.whl (13.5 kB view details)

Uploaded Apr 7, 2026 Python 3

File details

Details for the file liquisource-1.3.2.tar.gz.

File metadata

Download URL: liquisource-1.3.2.tar.gz
Upload date: Apr 7, 2026
Size: 13.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.6

File hashes

Hashes for liquisource-1.3.2.tar.gz
Algorithm	Hash digest
SHA256	`fd001d63b17e27b0d8198201ce7b4756c0c27a0b85d0ae41dbc11446ea836342`
MD5	`40e48d5804c99084ea8b94da592cb418`
BLAKE2b-256	`dfc7bf8f886eeb76b4d2e906b644dea2c43667a78934af95706c5ecfebdb2f3a`

See more details on using hashes here.

File details

Details for the file liquisource-1.3.2-py3-none-any.whl.

File metadata

Download URL: liquisource-1.3.2-py3-none-any.whl
Upload date: Apr 7, 2026
Size: 13.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.6

File hashes

Hashes for liquisource-1.3.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3b983ffb812541cea65b207b8ed9938239ab54f71b53fff645d408a103cc526e`
MD5	`f4628be2cec4c35d18037bc8eb463b4c`
BLAKE2b-256	`4255e4af7ac4c600901dcb90d42f29ad0a25b96b9287a4a5401efe0a84412f52`

See more details on using hashes here.

liquisource 1.3.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

liquibase数据源驱动

Iceberg 建表

前置要求

配置说明

代码示例

方式一：Python API 建表

方式二：Spark SQL 建表（推荐）

验证脚本

发布包

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes