A Pythonic client library for streaming data with ZebraStream.

These details have not been verified by PyPI

Project description

zebrastream-io

Python IO interface for ZebraStream data streaming services.

Disclaimer:
The code in this package is considered pre-production quality. APIs and functionality may change without notice. Use with caution in production environments.

Features

File-like synchronous interface for ZebraStream data streams
Async interface (internal, subject to change)
Easily extensible for other IO interfaces

Installation

pip install zebrastream-io

Usage

Synchronous file-like interface

The synchronous interface provides a familiar, file-like API for reading from and writing to ZebraStream data streams. This design allows you to interact with remote streams using standard Python file IO, making integration with existing codebases straightforward. The goal is to offer a simple and reliable way to handle streaming data without requiring knowledge of asynchronous programming or custom protocols.

Producer

import zebrastream.io.file as zsfile
import time

with zsfile.open(mode="w", stream_path="/my-stream", access_token=token) as f:
    f.write("Hello!")
    f.flush()  # force send buffer
    time.sleep(10)
    f.write("This is ZebraStream")

Real-time log streaming

For real-time applications like log streaming, use auto_flush_delay to automatically flush buffered data:

import zebrastream.io.file as zsfile

# Instant flush (0) - flushes immediately on every write (low-latency)
with zsfile.open(mode="w", stream_path="/alerts", access_token=token, auto_flush_delay=0) as f:
    f.write("CRITICAL: system down\n")  # Flushed immediately

# Timer-based flush (5 seconds) - balances latency and efficiency
with zsfile.open(mode="w", stream_path="/logs", access_token=token, auto_flush_delay=5) as f:
    for log_line in generate_logs():
        f.write(log_line + "\n")
        # Data is automatically flushed within 5 seconds

Use auto_flush_delay=0 for alerts and metrics where every millisecond counts. Use timer-based flushing (1+ seconds) for logs where you can tolerate a few seconds of delay.

Consumer

import zebrastream.io.file as zsfile

with zsfile.open(mode="r", stream_path="/my-stream", access_token=token) as f:
    for line in f:
        print(line, end="")

End-to-End Encryption

⚠️ Experimental: End-to-end encryption support is currently experimental and subject to change.

ZebraStream supports passphrase-based end-to-end encryption using an encryption scheme derived from age, a simple and secure file encryption format. When encryption is enabled, data is encrypted on the sender side before transmission and can only be decrypted by receivers with the correct passphrase. Follow the general security descriptions of the age project.

Two encryption modes are available via the encryption_mode parameter:

Mode	Description	Real-time suitability
`"age-rt"`	age-rt format — variable-size chunks, sends data as it becomes available	✅ Suitable for real-time streaming
`"age"`	Standard age v1 format — fixed 64 KiB blocks	⚠️ Not suitable for real-time use (see below)

Standard age ("age") and real-time streaming: The standard age format requires each encrypted block to be exactly 64 KiB (the final block may be shorter). This means the writer must accumulate a full 64 KiB of plaintext before it can encrypt and transmit a chunk. As a result, auto_flush_delay and manual flush() calls have no effect on transmission timing — data will not be sent until the buffer reaches 64 KiB or the stream is closed. Use "age-rt" for real-time applications such as log streaming, alerts, or interactive pipelines. Use "age" only when compatibility with standard age tooling is required and real-time delivery is not needed (e.g. bulk file transfers).

import zebrastream.io.file as zsfile
import time

# Producer - real-time streaming with encryption (age-rt mode)
with zsfile.open(mode="w", stream_path="/my-stream",
                 access_token=token,
                 passphrase="secret",
                 encryption_mode="age-rt") as f:
    f.write("This is")
    f.flush()      # data is transmitted immediately
    time.sleep(10)
    f.write("encrypted data")

# Producer - bulk transfer with standard age format
with zsfile.open(mode="w", stream_path="/my-stream",
                 access_token=token,
                 passphrase="secret",
                 encryption_mode="age") as f:
    with open("large-file.bin", "rb") as src:
        f.write(src.read())  # transmitted in 64 KiB blocks

# Consumer - decryption is automatic; both formats are detected
with zsfile.open(mode="r", stream_path="/my-stream",
                 access_token=token,
                 passphrase="secret") as f:
    for line in f:
        print(line)

Async interface (unstable)

Async interface for performing network operations using the asyncio event loop.

This interface is currently non-public and subject to change, as it is under active development. The primary goal is to provide an internal, robust reference implementation for ZebraStream, leveraging Python's async/await syntax. At present, the implementation exclusively supports execution within the asyncio event loop, as it relies on the httpio library — the only request library currently offering reliable, full-duplex communication required for complete ZebraStream protocol support.

Future plans include stabilizing the API and exposing standard async streaming interfaces such as asyncio StreamReader/StreamWriter.

Producer

from zebrastream.io._core import AsyncWriter
import asyncio

async def main():
    async with AsyncWriter(stream_path="/my-stream", access_token=token) as writer:
        await writer.write(b"Hello!")
        await writer.flush()
        await asyncio.sleep(10)
        await writer.write("This is ZebraStream")

asyncio.run(main())

Consumer

from zebrastream.io._core import AsyncReader
import asyncio

async def main():
    async with AsyncReader(stream_path="/my-stream", access_token=token) as reader:
        while data := await reader.read_variable_block(4096):
            print(data.decode(), end="")

asyncio.run(main())

Command-Line Interface

A zebrastream CLI is included as an optional extra for streaming data between Unix pipelines and ZebraStream streams. The CLI serves as a reference implementation showcasing the Python SDK's capabilities, with a focus on correctness and protocol compliance rather than maximum performance.

Installation

pip install zebrastream-io[cli]

Usage

The CLI provides write and read subcommands with global options:

# Write from stdin
echo "Hello ZebraStream" | zebrastream write -s /my-stream

# Write from a producer command
zebrastream write -s /my-stream -- pg_dump mydb
zebrastream write --stream-path /my-stream -- sh -c "cat data.txt | gzip"

# Real-time log streaming with instant flush (low-latency)
zebrastream write -s /alerts -d 0 -- tail -f /var/log/critical.log

# Real-time log streaming with timer-based auto-flush (5 seconds)
zebrastream write -s /my-stream --auto-flush-delay 5 -- tail -f /var/log/app.log

# Encrypted real-time streaming (age-rt, default when passphrase is set)
zebrastream write -s /my-stream --passphrase secret -- tail -f /var/log/app.log
zebrastream write -s /my-stream --passphrase secret --encryption-mode age-rt -- tail -f /var/log/app.log

# Encrypted bulk transfer with standard age format (not real-time)
zebrastream write -s /my-stream --passphrase secret --encryption-mode age < large-file.bin

# Read to stdout
zebrastream read -s /my-stream > output.txt
zebrastream read --stream-path /my-stream | tar -xz

# Real-time log streaming with unbuffered output
zebrastream read -s /logs -u | grep ERROR

# Pipe into a consumer command
zebrastream read -s /my-stream -- tar -xz
zebrastream read -s /my-stream -- python process.py

# Global options (--log-level, --config-name, --config-file) come before subcommand
zebrastream --log-level info write -s /my-stream --connect-timeout 30 < data.bin
zebrastream --config-name production read -s /my-stream | jq .

# Stream path can come from config
zebrastream --config-name production write < data.txt

# Using explicit config file path
zebrastream --config-file ~/my-zebrastream-config.yaml write -s /my-stream

# Using environment variable for authentication
ZEBRASTREAM_ACCESS_TOKEN='your_token_here' zebrastream write -s /my-stream < file.txt

Configuration Files: Named configuration files should use the .yaml extension and be stored in ~/.config/zebrastream/streams/ for reusable settings. You can also specify an explicit file path with --config-file. When both are provided, --config-file takes precedence.

Config files must include a mode field (read or write) that matches the subcommand used — this prevents accidentally using a write config with read or vice versa. If stream_path is included, the -s/--stream-path CLI option can be omitted.

You can also specify a command in the config to define the producer (for write mode) or consumer (for read mode) command. This makes it easy to reuse complex pipeline configurations. Commands provided via CLI (-- syntax) take precedence over config commands.

# ~/.config/zebrastream/streams/my-feed.yaml
# Use with: zebrastream --config-name my-feed read

# Required: must match the subcommand (read or write)
mode: read

# Stream path (optional if provided on command line)
stream_path: /userspace/project/my-stream

# Access token — prefer ZEBRASTREAM_ACCESS_TOKEN env var to keep it out of the file
access_token: YOUR_ACCESS_TOKEN

# Producer/consumer command (optional)
# For write mode: producer command that generates data
# For read mode: consumer command that processes data
# Omit to use stdin (write) or stdout (read) instead
# command: tar -xz -C /output

# Passphrase for symmetric end-to-end encryption (optional)
# Both sender and receiver must use the same passphrase
# Prefer ZEBRASTREAM_PASSPHRASE env var
# passphrase: your-secret-passphrase

# Encryption mode (optional, write mode only, requires passphrase)
# "age-rt": variable-size chunks, suitable for real-time streaming (default when passphrase is set)
# "age":    standard age v1 format, fixed 64 KiB blocks — NOT suitable for real-time use
# encryption_mode: age-rt

# Content-Type header (optional, write mode only)
# content_type: application/octet-stream

# Automatic flush delay in seconds (optional, write mode only)
# Set to 0 for instant flush on every write (low-latency)
# Set to 1+ for timer-based flushing (balances latency and efficiency)
# Omit or comment out to disable automatic flushing
# auto_flush_delay: 5

# Unbuffered output (optional, read mode only, default: false)
# When true, disables output buffering for real-time streaming
# Useful for log tailing and interactive output
# unbuffered_output: true

# Override connect API URL (optional, defaults to ZebraStream cloud)
# connect_url: https://connect.zebrastream.io/v0/

# Connection timeout in seconds (optional, default: no timeout)
# connect_timeout: 30

Command Configuration Examples:

# Producer config - backup database to stream
# ~/.config/zebrastream/streams/db-backup.yaml
mode: write
stream_path: /backups/postgres/production
command: pg_dump mydb

# Consumer config - extract tarball from stream
# ~/.config/zebrastream/streams/deploy.yaml
mode: read
stream_path: /releases/app/latest
command: tar -xz -C /var/www/app

Usage with command in config:

# Uses pg_dump command from config
zebrastream --config-name db-backup write

# Override config command with CLI
zebrastream --config-name db-backup write -- pg_dumpall

# Omit command to use stdin instead
cat local-file.txt | zebrastream --config-name db-backup write

For more details on configuration, authentication, and advanced options:

zebrastream --help
zebrastream write --help
zebrastream read --help

Documentation

See ZebraStream documentation for more details.

License

MIT License. See LICENSE for details.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.5.4

May 28, 2026

0.5.3

May 22, 2026

0.5.2

May 5, 2026

0.5.1

Apr 2, 2026

0.5.0

Apr 1, 2026

0.4.0

Feb 26, 2026

0.3.1

Feb 24, 2026

0.3.0

Jan 20, 2026

0.2.0

Aug 27, 2025

0.1.1

Aug 7, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

zebrastream_io-0.5.4.tar.gz (39.2 kB view details)

Uploaded May 28, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

zebrastream_io-0.5.4-py3-none-any.whl (39.4 kB view details)

Uploaded May 28, 2026 Python 3

File details

Details for the file zebrastream_io-0.5.4.tar.gz.

File metadata

Download URL: zebrastream_io-0.5.4.tar.gz
Upload date: May 28, 2026
Size: 39.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.4.1 CPython/3.13.1 Linux/6.17.0-1015-azure

File hashes

Hashes for zebrastream_io-0.5.4.tar.gz
Algorithm	Hash digest
SHA256	`56039eb06bf7663e60dd6dd55b1871a9a7cde46c64a407aa02362749f5d41ec5`
MD5	`2a39b89c025d1f9b46b8c1ae347414a8`
BLAKE2b-256	`a15b1599a8c6ee4e282553de67b06119ad927bc20489d98ed0bd3727feee76a4`

See more details on using hashes here.

File details

Details for the file zebrastream_io-0.5.4-py3-none-any.whl.

File metadata

Download URL: zebrastream_io-0.5.4-py3-none-any.whl
Upload date: May 28, 2026
Size: 39.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.4.1 CPython/3.13.1 Linux/6.17.0-1015-azure

File hashes

Hashes for zebrastream_io-0.5.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c211ca2d766395f9f3010d47625e18919e00ac87e6bad434d90b48c5df1bb686`
MD5	`bb6ce6e9c312dbe8d0cd0108d2b38672`
BLAKE2b-256	`ec3aa71bd69b2ebca4944f465294bccc43ad665bd0728f2b35f46a3448af21f5`

See more details on using hashes here.

zebrastream-io 0.5.4

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

zebrastream-io

Features

Installation

Usage

Synchronous file-like interface

Producer

Real-time log streaming

Consumer

End-to-End Encryption

Async interface (unstable)

Producer

Consumer

Command-Line Interface

Installation

Usage

Documentation

License

See also

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes