Skip to main content

Python SDK for Kimberlite database

Project description

Kimberlite Python SDK

Status: 🚧 In Progress (Phase 11.2)

Pythonic client library for Kimberlite database.

Installation

pip install kimberlite

Quick Start

Stream Operations

from kimberlite import Client, DataClass

# Connect with context manager
with Client.connect(
    addresses=["localhost:5432"],
    tenant_id=1,
    auth_token="secret"
) as client:
    # Create stream
    stream_id = client.create_stream("events", DataClass.PHI)

    # Append events
    events = [b"event1", b"event2", b"event3"]
    offset = client.append(stream_id, events)

    # Read events
    results = client.read(stream_id, from_offset=0, max_bytes=1024)
    for event in results:
        print(f"Offset {event.offset}: {event.data}")

SQL Queries

from kimberlite import Client, Value

with Client.connect(addresses=["localhost:5432"], tenant_id=1) as client:
    # Create table
    client.execute("""
        CREATE TABLE users (
            id BIGINT PRIMARY KEY,
            name TEXT,
            email TEXT,
            active BOOLEAN,
            created_at TIMESTAMP
        )
    """)

    # Insert data with parameterized queries
    client.execute(
        "INSERT INTO users (id, name, email, active, created_at) VALUES ($1, $2, $3, $4, $5)",
        [
            Value.bigint(1),
            Value.text("Alice"),
            Value.text("alice@example.com"),
            Value.boolean(True),
            Value.timestamp(1609459200_000_000_000)  # 2021-01-01 UTC
        ]
    )

    # Query data
    result = client.query(
        "SELECT * FROM users WHERE active = $1",
        [Value.boolean(True)]
    )

    for row in result.rows:
        id_val = row[result.columns.index('id')]
        name_val = row[result.columns.index('name')]
        print(f"User {id_val.data}: {name_val.data}")

    # Point-in-time query (compliance audit)
    from kimberlite.types import Offset
    historical_offset = Offset(1000)
    historical_result = client.query_at(
        "SELECT COUNT(*) FROM users",
        [],
        historical_offset
    )

Features

Stream Operations

  • Create and manage event streams
  • Append events with automatic batching
  • Read events with offset-based pagination
  • Type hints for IDE autocomplete

SQL Query Engine

  • Core SQL: SELECT (aggregates, GROUP BY/HAVING, UNION, INNER/LEFT JOIN, CTEs, subqueries, window functions), INSERT, UPDATE, DELETE, DDL
  • Parameterized queries with type-safe Value objects
  • Point-in-time queries (AT OFFSET) for compliance audits; AS OF TIMESTAMP planned v0.6
  • All SQL types: NULL, BIGINT, TEXT, BOOLEAN, TIMESTAMP

Python Integration

  • Context managers for automatic resource cleanup
  • Type hints and mypy strict mode support
  • Rich exception hierarchy for error handling
  • Pythonic API design

Compliance Features

  • Query historical state at any log position
  • Immutable audit trail
  • Data classification (PHI, Non-PHI, De-identified)

Usage Examples

Working with Value Types

from kimberlite import Value
from datetime import datetime

# Create values
null_val = Value.null()
int_val = Value.bigint(42)
text_val = Value.text("Hello, 世界!")
bool_val = Value.boolean(True)
ts_val = Value.timestamp(1609459200_000_000_000)

# From Python datetime
dt = datetime(2024, 1, 1, 12, 0, 0)
ts_from_dt = Value.from_datetime(dt)

# Convert timestamp back to datetime
dt_back = ts_val.to_datetime()
print(dt_back.isoformat())  # "2021-01-01T00:00:00"

CRUD Operations

# CREATE
client.execute("""
    CREATE TABLE products (
        id BIGINT PRIMARY KEY,
        name TEXT,
        price BIGINT,
        in_stock BOOLEAN
    )
""")

# INSERT
client.execute(
    "INSERT INTO products (id, name, price, in_stock) VALUES ($1, $2, $3, $4)",
    [Value.bigint(1), Value.text("Widget"), Value.bigint(1999), Value.boolean(True)]
)

# UPDATE
client.execute(
    "UPDATE products SET price = $1 WHERE id = $2",
    [Value.bigint(2499), Value.bigint(1)]
)

# DELETE
client.execute(
    "DELETE FROM products WHERE id = $1",
    [Value.bigint(1)]
)

# SELECT
result = client.query("SELECT * FROM products WHERE in_stock = $1", [Value.boolean(True)])
for row in result.rows:
    print(row)

Compliance Audit Example

from kimberlite.types import Offset

# Record initial state
checkpoint_offset = Offset(client.log_position())  # Hypothetical API

# Make changes
client.execute("UPDATE users SET email = $1 WHERE id = $2", [
    Value.text("newemail@example.com"),
    Value.bigint(1)
])

# Later: Audit what the state was at checkpoint
historical_result = client.query_at(
    "SELECT email FROM users WHERE id = $1",
    [Value.bigint(1)],
    checkpoint_offset
)
# Returns the old email, proving what the state was at that point in time

Documentation

Installation (Development)

# Build FFI library
cd ../../
cargo build -p kimberlite-ffi

# Install Python SDK in development mode
cd sdks/python
pip install -e .

Development Status

SDK Implementation:

  • ctypes-based FFI wrapper
  • Stream operations (create, append, read)
  • SQL query engine (SELECT, INSERT, UPDATE, DELETE, DDL)
  • Parameterized queries with Value types
  • Point-in-time queries (query_at)
  • Type hints and mypy strict mode
  • Comprehensive unit tests (48+ tests for values, 5+ for queries)
  • Integration tests
  • Wheel distribution with bundled binaries
  • PyPI publishing

Value Type System:

  • NULL, BIGINT, TEXT, BOOLEAN, TIMESTAMP
  • DateTime conversion helpers
  • Equality and hashing support
  • Type-safe constructors

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

kimberlite-0.7.0-py3-none-win_amd64.whl (374.1 kB view details)

Uploaded Python 3Windows x86-64

kimberlite-0.7.0-py3-none-manylinux_2_17_x86_64.whl (515.0 kB view details)

Uploaded Python 3manylinux: glibc 2.17+ x86-64

kimberlite-0.7.0-py3-none-macosx_11_0_arm64.whl (467.4 kB view details)

Uploaded Python 3macOS 11.0+ ARM64

File details

Details for the file kimberlite-0.7.0-py3-none-win_amd64.whl.

File metadata

  • Download URL: kimberlite-0.7.0-py3-none-win_amd64.whl
  • Upload date:
  • Size: 374.1 kB
  • Tags: Python 3, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for kimberlite-0.7.0-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 5983eb1459b69e7f8a26a85dc29523eb7a523f7a46022a3f7ba409b7e9671d39
MD5 de385a694a08561c038c5a9a857e2ac4
BLAKE2b-256 a5c878b41e17a2b11ed9000717e6448a7704445b1578226eac3ca7d795563f1d

See more details on using hashes here.

File details

Details for the file kimberlite-0.7.0-py3-none-manylinux_2_17_x86_64.whl.

File metadata

File hashes

Hashes for kimberlite-0.7.0-py3-none-manylinux_2_17_x86_64.whl
Algorithm Hash digest
SHA256 d14aa04c95f3d49caed09889966df8e4f18788b9fb4e659900ea67c99f34e417
MD5 9de1fecea52ae4787ed4497846a7529a
BLAKE2b-256 00d0b02c9042f011a18e1af4c336b661f08b085f3ed6efeb65930a0db74a4232

See more details on using hashes here.

File details

Details for the file kimberlite-0.7.0-py3-none-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for kimberlite-0.7.0-py3-none-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 0a17bc95d631ace918bf6d1aa705acc400795c1163a5aabff78568e436a6245e
MD5 84d2a845fda5302e3204855ffba9a877
BLAKE2b-256 c692e4f1628d879c3c853b142949e500c60c9839a54cf093a5b92760cf9fd055

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page