Skip to main content

A DuckDB server for SQLRooms

Project description

SQLRooms DuckDB Server

PyPi

A Python-based server that runs a local DuckDB instance and supports queries over WebSockets, returning data in either Apache Arrow or JSON format. The server was originally developed for use with SQLRooms, but can be readily used as a generic DuckDB server in other contexts.

Note: This server was initially created as a fork of Mosaic DuckDB Server, with additional features and improvements.

Note: This package provides a local DuckDB server. To instead use SQLRooms with DuckDB-WASM in the browser, stick to the default WasmDuckDbConnector.

Features

  • WebSocket endpoint (WS-only)
  • Arrow IPC results (binary framed) and JSON responses
  • Concurrent query execution using a shared thread pool (per-task cursors)
  • Per-query cancellation via duckdb.interrupt (WebSocket type:"cancel")
  • WebSocket multiplexing with queryId correlation
  • Subscribe/notify over WebSocket using Socketify publish/subscribe
  • Disk-backed result cache with per-key locking (prevents duplicate compute)
  • One-time retry on transaction conflicts (e.g., concurrent UPDATE vs ALTER)
  • Graceful shutdown (SIGINT/SIGTERM): cancel queries, FORCE CHECKPOINT, close, stop executor
  • Optional bearer authentication for HTTP and WebSocket endpoints

Installation and usage

We recommend running the server in an isolated environment with uvx. For example, to directly run the server, use:

uvx sqlrooms-duckdb-server

Alternatively, you can install the server with pip install sqlrooms-duckdb-server. Then you can start the server with sqlrooms-duckdb-server --db-path /absolute/path/to/my.db --port 4000.

Command-line arguments

  • --db-path (optional): Path to DuckDB database file. Defaults to :memory:.

  • --port (default: 4000): Port to listen on.

  • --extensions (optional): Comma-separated list of extensions to preload. Examples:

    • httpfs
    • spatial
    • h3@community
  • --auth-token (optional): If provided, enables bearer authentication. WebSocket clients must first send { "type": "auth", "token": "<TOKEN>" }.

Examples:

# In-memory DB with httpfs only (default)
uv run sqlrooms-duckdb-server

# File-backed DB with multiple extensions
uv run sqlrooms-duckdb-server --db-path /tmp/my.db --port 4000 --extensions httpfs,spatial,h3@community

Developer Setup

We use uv to manage our development setup.

Start the server with:

uv run sqlrooms-duckdb-server --db-path /absolute/path/to/my.db

Run uv run ruff check --fix and uv run ruff format to lint the code.

To run the tests, use uv run pytest.

To set up a local certificate for SSL, use https://github.com/FiloSottile/mkcert.

API

The server supports queries via WebSockets only. Minimal HTTP endpoints are provided for health and diagnostics.

Health Endpoints (HTTP)

  • GET /healthz: returns ok when the process is healthy.
  • GET /readyz: returns ok when DuckDB is initialized; 503 otherwise.
  • GET /version: returns JSON with version info.

WebSocket

  • URL: ws://localhost:<port>/
  • Messages are JSON text frames unless returning Arrow bytes.

Authentication (optional):

  • If started with an auth token, the client must send the first message:

    {"type": "auth", "token": "<TOKEN>"}
    
  • On success, server replies { "type": "authAck" }. Any other message before auth results in an error and the connection is closed.

Supported messages:

  • Query (optional queryId):

    {"type":"arrow","sql":"select 1 as x","queryId":"q1"}
    {"type":"json","sql":"select 1 as x","queryId":"q2"}
    {"type":"exec","sql":"create table t(x int)","queryId":"q3"}
    
  • Result correlation (Arrow): binary frame

    • Layout: [4-byte big-endian length][header JSON][arrow bytes]
    • Header JSON example: { "type": "arrow", "queryId": "q1" }
  • Result correlation (JSON/OK): text frame

    {"type":"json","queryId":"q2","data":"[{\"x\":1}]"}
    {"type":"ok","queryId":"q3"}
    
  • Cancel in-flight query:

    {"type": "cancel", "queryId": "q2"}
    
    • Response: { "type":"cancelAck","queryId":"q2","cancelled":true }
    • If the query is already finishing, you may receive the final result instead of an error.
  • Subscribe/Notify (server-side notifications):

    {"type":"subscribe","channel":"table:orders"}
    {"type":"notify","channel":"table:orders","payload":{"op":"update"}}
    
    • Subscribe ack: { "type":"subscribed","channel":"table:orders" }
    • Notify:
      • Broadcast to subscribers via app.publish
      • Immediate echo to the sender: { "type":"notify","channel":"table:orders","payload":{"op":"update"} }
      • Ack: { "type":"notifyAck","channel":"table:orders" }

Concurrency & Cancellation

  • DuckDB work runs in a shared thread pool with per-task cursors.
  • Per-query cancellation is supported via duckdb.interrupt.
  • WebSocket multiplexing uses queryId correlation in headers/payloads.
  • One-time retry on transaction conflicts (e.g., concurrent UPDATE vs ALTER).

Notes

  • Graceful shutdown: SIGINT/SIGTERM cancel in-flight queries, FORCE CHECKPOINT, close connection, stop executor.
  • Auth token can be supplied via CLI --auth-token.

Publishing

Run the build with uv build. Then test publish with pnpm prerelease. We publish using tokens so when asked, set the username to __token__ and then use your token as the password. Alternatively, create a .pypirc file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sqlrooms_duckdb_server-0.2.2.tar.gz (114.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sqlrooms_duckdb_server-0.2.2-py3-none-any.whl (21.2 kB view details)

Uploaded Python 3

File details

Details for the file sqlrooms_duckdb_server-0.2.2.tar.gz.

File metadata

  • Download URL: sqlrooms_duckdb_server-0.2.2.tar.gz
  • Upload date:
  • Size: 114.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.11

File hashes

Hashes for sqlrooms_duckdb_server-0.2.2.tar.gz
Algorithm Hash digest
SHA256 5bdbe905027dd18bcfbe57e68c9c3406773a1845c07efc00767a9b44bfcfdece
MD5 4052c110f46befdd20db348d5d5cfcb5
BLAKE2b-256 76522cd56acd984fdcc96b5c86e6de3181938f33345a1140878b6765afc440df

See more details on using hashes here.

File details

Details for the file sqlrooms_duckdb_server-0.2.2-py3-none-any.whl.

File metadata

File hashes

Hashes for sqlrooms_duckdb_server-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 9f0f5d48279c2b09e89d893a22fd8d2faa684d24a724ddd751f4dada465540e4
MD5 7e1c152eeb345c681952ece16c2838af
BLAKE2b-256 08b19b296e086a98eea80b8c3cbe7140ae4140df63185f622683da13e90e660f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page