Skip to main content

Next-generation Jupyter kernel management with decoupled client connections

Project description

nextgen-kernels-api

A next-generation Jupyter kernel client architecture that enables shared kernel connections and centralized message routing.

Motivation

In the upstream Jupyter Server implementation, each WebSocket connection establishes its own set of ZMQ sockets to the kernel:

graph LR
    WS1[WebSocket 1] --> ZMQ1[ZMQ Client 1]
    WS2[WebSocket 2] --> ZMQ2[ZMQ Client 2]
    WS3[WebSocket 3] --> ZMQ3[ZMQ Client 3]
    ZMQ1 --> K[Kernel]
    ZMQ2 --> K
    ZMQ3 --> K

Problems:

  • Resource overhead: Multiple redundant ZMQ connections per kernel
  • State fragmentation: No centralized view of kernel execution state
  • Lost messages: No way to route kernel messages to server-side consumers (YDocs, etc.)

Architecture

This project introduces a shared kernel client managed by the kernel manager itself:

graph TB
    subgraph Consumers
        WS1[WebSocket 1]
        WS2[WebSocket 2]
        WS3[WebSocket 3]
        YD[YDoc / Server Documents]
    end

    subgraph "Kernel Manager"
        KC[Shared Kernel Client]
    end

    WS1 -->|register as listener| KC
    WS2 -->|register as listener| KC
    WS3 -->|register as listener| KC
    YD -.->|route messages directly| KC

    KC <-->|single ZMQ connection| K[Kernel]

    style YD stroke-dasharray: 5 5

Client Lifecycle

sequenceDiagram
    participant WS as WebSocket
    participant KM as Kernel Manager
    participant KC as Kernel Client
    participant K as Kernel

    Note over KM,KC: Kernel start creates client
    KM->>KC: create kernel_client instance
    KM->>K: start kernel
    KM->>KC: load connection info
    KM->>KC: connect()
    KC->>K: test communication
    KC->>KC: mark connection ready

    WS->>KC: WebSocket connects
    KC->>WS: add_listener(websocket_callback)
    KC-->>WS: broadcast current state

    WS->>KC: send message to kernel
    K-->>KC: kernel response
    KC-->>WS: route to all listeners
    KC-->>YD: route to YDoc (if registered)

Key Design Principles

  1. Single Client per Kernel: One shared ZMQ connection per kernel, used by all consumers
  2. Listener Pattern: WebSockets and server-side components register as message listeners
  3. Centralized State: Track execution state, activity, and lifecycle from one place
  4. Message Queuing: Queue messages during startup, deliver when connection ready
  5. Server-Side Routing: Enable direct message flow to YDocs for accurate state tracking

Features

  • Shared Kernel Client: Single ZMQ connection per kernel, shared across all consumers
  • Message Listener API: Register callbacks to receive kernel messages from all channels
  • Message Filtering: Filter messages by type and channel when adding listeners (see docs)
  • Configurable WebSocket Filtering: Configure message filtering for WebSocket connections via Jupyter Server config (see docs)
  • Connection Management: Robust connect/disconnect/reconnect with health checks
  • State Tracking: Monitor execution state (idle, busy, starting) via status messages
  • Message Queuing: Queue messages during connection setup, deliver when ready
  • Message ID Encoding: Track message origin (channel and cell ID) by encoding directly in message IDs

Integration with Jupyter Server Documents

This architecture is designed to work seamlessly with jupyter-server-documents, enabling server-side YDocs to receive kernel messages directly:

Benefits:

  • Accurate execution state: YDoc always knows if kernel is busy/idle
  • No lost outputs: Cell outputs flow directly to YDoc, even if no WebSocket connected
  • Real-time collaboration: All clients (WebSockets + YDoc) see kernel state simultaneously

To integrate, simply register the YDoc as a listener on the kernel manager's client:

# In your YDoc initialization or extension
# Get the multi-kernel manager from the server app
mkm = app.kernel_manager

# Get the specific kernel manager for your kernel
km = mkm.get_kernel(kernel_id)

# Access the shared kernel client
client = km.kernel_client

# Listen to all messages
client.add_listener(ydoc.handle_kernel_message)

# Or filter to only specific message types
client.add_listener(
    ydoc.handle_kernel_message,
    msg_types=[("execute_result", "iopub"), ("stream", "iopub")]
)

Installation

pip install nextgen-kernels-api

Enable the extension:

jupyter server extension enable nextgen_kernels_api

This extension will automatically override the default Jupyter Server kernel APIs when the server starts.

Configuration

To enable the shared kernel client architecture with message ID encoding, configure Jupyter Server to use the custom classes.

Quick Start

Create a jupyter_config.py file in your Jupyter config directory (or use --config=path/to/config.py):

c = get_config()

# Use the enhanced kernel manager with shared kernel clients
c.ServerApp.kernel_manager_class = "nextgen_kernels_api.services.kernels.kernelmanager.MultiKernelManager"

# Configure which KernelManager class each kernel uses
c.MultiKernelManager.kernel_manager_class = "nextgen_kernels_api.services.kernels.kernelmanager.KernelManager"

# Configure which client class the KernelManager uses
c.KernelManager.client_class = "nextgen_kernels_api.services.kernels.client.JupyterServerKernelClient"

# Configure the WebSocket connection class
c.ServerApp.kernel_websocket_connection_class = "nextgen_kernels_api.services.kernels.connection.kernel_client_connection.KernelClientWebsocketConnection"

Then start Jupyter Server:

jupyter server --config=jupyter_config.py

What Gets Configured

  1. MultiKernelManager: Manages all kernel instances
  2. KernelManager: Creates and manages individual kernels with pre-created shared clients
  3. JupyterServerKernelClient: Shared kernel client with:
    • Custom channel classes (ShellChannel, ControlChannel, StdinChannel) that auto-encode channel names
    • Listener API for multiple consumers
    • Message queuing during startup
  4. KernelClientWebsocketConnection: WebSocket handler that:
    • Registers as a listener on the shared client
    • Encodes cell IDs in outgoing messages
    • Strips encoding from incoming messages

Optional: Message Filtering

You can also configure message filtering for WebSocket connections:

# Only send specific message types to WebSockets
c.KernelClientWebsocketConnection.msg_types = [
    ("execute_result", "iopub"),
    ("stream", "iopub"),
    ("error", "iopub"),
]

# Or exclude specific message types
c.KernelClientWebsocketConnection.exclude_msg_types = [
    ("status", "iopub"),
]

See the included jupyter_config.minimal.py for a complete minimal configuration example.

Technical Details: Message ID Encoding

The Problem

In a shared kernel client architecture, we need to track:

  1. Which channel a message came from (shell vs control)
  2. Which cell originated a request (for routing responses back)

Originally, an approach I took was to have a separate message cache to store this metadata, but this leads to unbounded memory growth.

The Solution: Encode Metadata in Message IDs

Instead of maintaining a separate cache, we encode routing information directly into the message ID itself:

Format: {channel}:{base_msg_id}#{cell_id}

Examples:

  • shell:a1b2c3d4_12345_0#cell-abc123 - Execute request from a cell
  • control:a1b2c3d4_12345_1 - Interrupt request (no cell)
  • a1b2c3d4_12345_2 - Legacy format (backward compatible)

Message Flow

sequenceDiagram
    participant FE as Frontend (Cell)
    participant WS as WebSocket
    participant KC as Kernel Client
    participant K as Kernel

    Note over FE,K: Outgoing: Frontend → Kernel
    FE->>WS: execute_request<br/>msg_id="abc123"<br/>metadata.cellId="cell-xyz"
    WS->>WS: Append cellId<br/>msg_id="abc123#cell-xyz"
    WS->>KC: handle_incoming_message("shell", msg)
    KC->>KC: Prepend channel<br/>msg_id="shell:abc123#cell-xyz"
    KC->>K: execute_request

    Note over FE,K: Incoming: Kernel → Frontend
    K-->>KC: status (busy)<br/>parent_msg_id="shell:abc123#cell-xyz"
    KC->>KC: Parse channel from parent_msg_id<br/>→ channel="shell"
    KC->>KC: Update execution state<br/>(only for shell channel)
    KC-->>WS: Route to listeners
    WS->>WS: Strip encoding<br/>msg_id="abc123"
    WS-->>FE: status message with original msg_id

Encoding Points

1. WebSocket Layer (kernel_client_connection.py):

  • Incoming: Appends #cell_id from metadata to msg_id
  • Outgoing: Strips both channel and cell_id encoding before sending to frontend

2. Kernel Client Layer (client.py):

  • Outgoing: Custom channel classes (ShellChannel, ControlChannel, StdinChannel) automatically prepend channel name to all messages
  • Incoming: Parses channel from parent_msg_id in status messages for state tracking

License

BSD-3-Clause

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nextgen_kernels_api-0.11.0.tar.gz (27.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nextgen_kernels_api-0.11.0-py3-none-any.whl (26.1 kB view details)

Uploaded Python 3

File details

Details for the file nextgen_kernels_api-0.11.0.tar.gz.

File metadata

  • Download URL: nextgen_kernels_api-0.11.0.tar.gz
  • Upload date:
  • Size: 27.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.28.1

File hashes

Hashes for nextgen_kernels_api-0.11.0.tar.gz
Algorithm Hash digest
SHA256 8d80a59ff9d63c739671b74e32bfb62b9556f8e589ca953ce4c203274be74f7d
MD5 a75fcedfa21e5710abbfe77e1a3f7c48
BLAKE2b-256 f29b9a53d881f47f23825667477cdc1a805a2b8deed6413ad98eeadc770a8eb4

See more details on using hashes here.

File details

Details for the file nextgen_kernels_api-0.11.0-py3-none-any.whl.

File metadata

File hashes

Hashes for nextgen_kernels_api-0.11.0-py3-none-any.whl
Algorithm Hash digest
SHA256 5c9d03ec6c6d5785aa7db8d616878be2ebff5cabcfd07892dbb1738935b97a5c
MD5 89ff9d2b58909b61367f65c052830aa9
BLAKE2b-256 0c05cc862176607294308588776ae51693356d8ef34d147cdf906e055e19a90e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page