Skip to main content

Python SDK for Kimberlite database

Project description

Kimberlite Python SDK

Status: 🚧 In Progress (Phase 11.2)

Pythonic client library for Kimberlite database.

Installation

pip install kimberlite

Quick Start

Stream Operations

from kimberlite import Client, DataClass

# Connect with context manager
with Client.connect(
    addresses=["localhost:5432"],
    tenant_id=1,
    auth_token="secret"
) as client:
    # Create stream
    stream_id = client.create_stream("events", DataClass.PHI)

    # Append events
    events = [b"event1", b"event2", b"event3"]
    offset = client.append(stream_id, events)

    # Read events
    results = client.read(stream_id, from_offset=0, max_bytes=1024)
    for event in results:
        print(f"Offset {event.offset}: {event.data}")

SQL Queries

from kimberlite import Client, Value

with Client.connect(addresses=["localhost:5432"], tenant_id=1) as client:
    # Create table
    client.execute("""
        CREATE TABLE users (
            id BIGINT PRIMARY KEY,
            name TEXT,
            email TEXT,
            active BOOLEAN,
            created_at TIMESTAMP
        )
    """)

    # Insert data with parameterized queries
    client.execute(
        "INSERT INTO users (id, name, email, active, created_at) VALUES ($1, $2, $3, $4, $5)",
        [
            Value.bigint(1),
            Value.text("Alice"),
            Value.text("alice@example.com"),
            Value.boolean(True),
            Value.timestamp(1609459200_000_000_000)  # 2021-01-01 UTC
        ]
    )

    # Query data
    result = client.query(
        "SELECT * FROM users WHERE active = $1",
        [Value.boolean(True)]
    )

    for row in result.rows:
        id_val = row[result.columns.index('id')]
        name_val = row[result.columns.index('name')]
        print(f"User {id_val.data}: {name_val.data}")

    # Point-in-time query (compliance audit)
    from kimberlite.types import Offset
    historical_offset = Offset(1000)
    historical_result = client.query_at(
        "SELECT COUNT(*) FROM users",
        [],
        historical_offset
    )

Features

Stream Operations

  • Create and manage event streams
  • Append events with automatic batching
  • Read events with offset-based pagination
  • Type hints for IDE autocomplete

SQL Query Engine

  • Core SQL: SELECT (aggregates, GROUP BY/HAVING, UNION, INNER/LEFT JOIN, CTEs, subqueries, window functions), INSERT, UPDATE, DELETE, DDL
  • Parameterized queries with type-safe Value objects
  • Point-in-time queries (AT OFFSET) for compliance audits; AS OF TIMESTAMP planned v0.6
  • All SQL types: NULL, BIGINT, TEXT, BOOLEAN, TIMESTAMP

Python Integration

  • Context managers for automatic resource cleanup
  • Type hints and mypy strict mode support
  • Rich exception hierarchy for error handling
  • Pythonic API design

Compliance Features

  • Query historical state at any log position
  • Immutable audit trail
  • Data classification (PHI, Non-PHI, De-identified)

Usage Examples

Working with Value Types

from kimberlite import Value
from datetime import datetime

# Create values
null_val = Value.null()
int_val = Value.bigint(42)
text_val = Value.text("Hello, 世界!")
bool_val = Value.boolean(True)
ts_val = Value.timestamp(1609459200_000_000_000)

# From Python datetime
dt = datetime(2024, 1, 1, 12, 0, 0)
ts_from_dt = Value.from_datetime(dt)

# Convert timestamp back to datetime
dt_back = ts_val.to_datetime()
print(dt_back.isoformat())  # "2021-01-01T00:00:00"

CRUD Operations

# CREATE
client.execute("""
    CREATE TABLE products (
        id BIGINT PRIMARY KEY,
        name TEXT,
        price BIGINT,
        in_stock BOOLEAN
    )
""")

# INSERT
client.execute(
    "INSERT INTO products (id, name, price, in_stock) VALUES ($1, $2, $3, $4)",
    [Value.bigint(1), Value.text("Widget"), Value.bigint(1999), Value.boolean(True)]
)

# UPDATE
client.execute(
    "UPDATE products SET price = $1 WHERE id = $2",
    [Value.bigint(2499), Value.bigint(1)]
)

# DELETE
client.execute(
    "DELETE FROM products WHERE id = $1",
    [Value.bigint(1)]
)

# SELECT
result = client.query("SELECT * FROM products WHERE in_stock = $1", [Value.boolean(True)])
for row in result.rows:
    print(row)

Compliance Audit Example

from kimberlite.types import Offset

# Record initial state
checkpoint_offset = Offset(client.log_position())  # Hypothetical API

# Make changes
client.execute("UPDATE users SET email = $1 WHERE id = $2", [
    Value.text("newemail@example.com"),
    Value.bigint(1)
])

# Later: Audit what the state was at checkpoint
historical_result = client.query_at(
    "SELECT email FROM users WHERE id = $1",
    [Value.bigint(1)],
    checkpoint_offset
)
# Returns the old email, proving what the state was at that point in time

Documentation

Installation (Development)

# Build FFI library
cd ../../
cargo build -p kimberlite-ffi

# Install Python SDK in development mode
cd sdks/python
pip install -e .

Development Status

SDK Implementation:

  • ctypes-based FFI wrapper
  • Stream operations (create, append, read)
  • SQL query engine (SELECT, INSERT, UPDATE, DELETE, DDL)
  • Parameterized queries with Value types
  • Point-in-time queries (query_at)
  • Type hints and mypy strict mode
  • Comprehensive unit tests (48+ tests for values, 5+ for queries)
  • Integration tests
  • Wheel distribution with bundled binaries
  • PyPI publishing

Value Type System:

  • NULL, BIGINT, TEXT, BOOLEAN, TIMESTAMP
  • DateTime conversion helpers
  • Equality and hashing support
  • Type-safe constructors

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

kimberlite-0.8.0-py3-none-win_amd64.whl (376.0 kB view details)

Uploaded Python 3Windows x86-64

kimberlite-0.8.0-py3-none-manylinux_2_17_x86_64.whl (516.7 kB view details)

Uploaded Python 3manylinux: glibc 2.17+ x86-64

kimberlite-0.8.0-py3-none-macosx_11_0_arm64.whl (469.2 kB view details)

Uploaded Python 3macOS 11.0+ ARM64

File details

Details for the file kimberlite-0.8.0-py3-none-win_amd64.whl.

File metadata

  • Download URL: kimberlite-0.8.0-py3-none-win_amd64.whl
  • Upload date:
  • Size: 376.0 kB
  • Tags: Python 3, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for kimberlite-0.8.0-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 07358dc5590d6c9db29e767c9d0ab9c7b5da3b4f1c00bb9450fd13480230d9fd
MD5 3d9e312c15e8fd2d41aa54ab775541ff
BLAKE2b-256 1872cf5f36632d331d41ccad4d422ae1b94309c15fd19d422e066a850112562c

See more details on using hashes here.

File details

Details for the file kimberlite-0.8.0-py3-none-manylinux_2_17_x86_64.whl.

File metadata

File hashes

Hashes for kimberlite-0.8.0-py3-none-manylinux_2_17_x86_64.whl
Algorithm Hash digest
SHA256 c6bc89ea622ba830be9ca267ec55657bd6f12fe1ba47cc1aafde1e45fc384161
MD5 433da610be6e62b1e93f675c770c9a3b
BLAKE2b-256 b9c989640fc2159f244989fbec0f23272faf572df0595835649ad507f288ff94

See more details on using hashes here.

File details

Details for the file kimberlite-0.8.0-py3-none-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for kimberlite-0.8.0-py3-none-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 a978f2ddf5b84a0c0829beb7843170c25e0c1b97b54ab2101c2e1c79296621b2
MD5 067e6db40073763323d97e031c2e94c1
BLAKE2b-256 54a471332ae7789c4727c4e8862b87e58f001871f0cf6b97a3272724ebf34491

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page