Skip to main content

Premium Python SDK for Superbrain Distributed Memory Fabric

Project description

🧠 superbrain-sdk v0.2.0 — Python

PyPI version License: MIT Python

The Distributed RAM Fabric for AI Agents — Share memory across machines at microsecond speeds. Now with Phase 3: Automated AI Memory Controller, LangChain & PyTorch integration, and self-healing KV cache pooling.


🚀 What Is SuperBrain?

SuperBrain is a distributed RAM network where multiple AI agents on different machines share memory via 36-byte UUID pointers — instead of copying massive JSON blobs over slow APIs.

Key Numbers:

  • ~100 MB/s write throughput per node (gigabit saturation)
  • ~1–2ms read/write latency on LAN
  • 36 bytes to share any amount of memory between agents
  • Zero-copy context passing for multi-agent workflows

📦 Installation

pip install superbrain-sdk

✨ New in v0.2.0 — Phase 3: Automated AI Memory Controller

Zero-Config Cluster Discovery

from superbrain import AutoMemoryController

# Finds your SuperBrain cluster automatically via mDNS
memory = AutoMemoryController()

Shared Context Across Multiple LLMs

@memory.shared_context("research-session")
def researcher(ctx, document):
    ctx.write("findings", {"summary": "...", "confidence": 0.95})

@memory.shared_context("research-session")  # Same context!
def strategist(ctx, findings_ptr):
    return ctx.read("findings")             # Microsecond access

# Different LLMs, same shared memory:
researcher("War and Peace, all 1200 pages")
result = strategist(None)   # Claude reads what GPT-4 wrote!

Automatic KV Cache Deduplication

from superbrain import DistributedContextFabric

fabric = DistributedContextFabric(coordinator="localhost:50050")

# Same system prompts across 1000 agents → stored ONCE
ptr = fabric.store_kv_cache(b"You are a helpful assistant", model="gpt-4")
# Claude, Llama, and GPT-4 all reuse the same pointer

LangChain Memory Adapter

from superbrain.integrations.langchain import SuperBrainMemory
from langchain.chains import ConversationChain

memory = AutoMemoryController()
sb_memory = SuperBrainMemory(memory, session_id="user-123")

chain = ConversationChain(llm=your_llm, memory=sb_memory)
# Conversation history persisted in distributed RAM!
# Survives LLM restarts. Shared across machines.

PyTorch / HuggingFace KV-Cache Offloading

from superbrain.integrations.pytorch import enable_distributed_kv_cache

enable_distributed_kv_cache(fabric, max_local_layers=4)

# NOW: When GPU VRAM is full, KV caches page to cluster RAM
# instead of crashing or swapping to slow disk
model.generate(input_ids, max_length=100_000)  # Long context just works!

🔧 Core API

from superbrain import DistributedContextFabric
from superbrain.monitor import MonitorServer

# Initialize with all Phase 3 subsystems
fabric = DistributedContextFabric(coordinator="your-host:50050")

# Start live monitoring dashboard at http://localhost:9090
MonitorServer(fabric).start()

# Allocate + write data to distributed RAM
ptr = fabric.allocate_and_write(b"My huge AI context", agent_id="agent-1")

# Any machine anywhere can read it with just the pointer
data = fabric.read(ptr, 0, 0)

# Named shared contexts
ctx = fabric.create_context("agent-swarm")
ctx.write("state", {"step": 42, "done": False})
state = ctx.read("state")

# Get full telemetry
fabric.print_stats()

📊 Performance Telemetry

stats = fabric.stats()

# {
#   "telemetry": {
#     "throughput": {"write_mbps": 98.4, "read_mbps": 102.1},
#     "kv_cache": {"hit_ratio": 0.87},
#     "operations": {"write": {"p50_ms": 1.2, "p95_ms": 3.1, "p99_ms": 5.4}}
#   },
#   "kv_pool": {"total_segments": 142, "compressed_segments": 32},
#   "anomalies": []
# }

🔐 Zero-Trust Security

from superbrain.security import KeyManager, AnomalyDetector

# Per-context AES-256 key derivation
km = KeyManager(master_secret=os.urandom(32))
key = km.key_for("session-user-abc")
km.schedule_rotation("session-user-abc", interval_s=3600)

# Anomaly detection on access patterns (Z-score, 3σ)
det = AnomalyDetector()
# Automatically alerts when an agent accesses 100x more bytes than normal

🧹 Memory Management — When to Call free()

TL;DR — Use SharedContext or store_kv_cache() and you never need to call free().

What you call Need free()? Best for
client.allocate() Yes Raw low-level control
ctx.write("key", data) No Agent-to-agent context sharing
fabric.create_context("name") No Multi-LLM session state
fabric.store_kv_cache(prefix) No Shared system prompts, long contexts
SuperBrainMemory (LangChain) No Chat history across restarts
enable_distributed_kv_cache() No PyTorch/HuggingFace VRAM overflow
# ❌ Raw Client — you must free manually
ptr = client.allocate(100 * 1024 * 1024)
client.write(ptr, 0, b"data")
client.free(ptr)  # ← required!

# ✅ SharedContext — no free, ever
ctx = fabric.create_context("my-session")
ctx.write("findings", {"summary": "..."})   # stored in distributed RAM
ctx.read("findings")                        # read from anywhere

# ✅ KV Cache Pool — no free, auto-evicted
ptr = fabric.store_kv_cache(b"System prompt", model="gpt-4")
# 1000 agents → same ptr, stored once ✅

Full Memory Management Guide with diagrams


🗺️ Roadmap

Version Milestone Status
v0.1.0 Core Distributed RAM (Allocate/Read/Write/Free) ✅ Shipped
v0.1.1 Secure Fabric (mTLS, E2EE, Pub/Sub) ✅ Shipped
v0.2.0 Phase 3: Automated AI Memory Controller Current
v0.3.0 Raft Replication (Fault-Tolerant Memory) 🚧 Planned
v0.4.0 NVMe Spilling ("Infinite Memory") 🚧 Planned
v0.5.0 GPUDirect RDMA (Zero-Copy GPU→Network) 🔬 Research

🖥️ Server Requirements

This SDK connects to a SuperBrain cluster. To run one locally:

docker compose up -d   # From the main repo: github.com/anispy211/memorypool
# Dashboard: http://localhost:8080

📚 Documentation

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

superbrain_sdk-0.2.1.tar.gz (31.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

superbrain_sdk-0.2.1-py3-none-any.whl (34.5 kB view details)

Uploaded Python 3

File details

Details for the file superbrain_sdk-0.2.1.tar.gz.

File metadata

  • Download URL: superbrain_sdk-0.2.1.tar.gz
  • Upload date:
  • Size: 31.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for superbrain_sdk-0.2.1.tar.gz
Algorithm Hash digest
SHA256 3de5b26ddbb56f5d1984bba8327b5aaa2d09911648cd838ae98b850c86aa63b2
MD5 d7ce7dc1b518f0aed9a2d4a8c918fd17
BLAKE2b-256 b358c167c2eda6b05995942822cdd38905c37a630ac74de39fadf99a5c9b1553

See more details on using hashes here.

File details

Details for the file superbrain_sdk-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: superbrain_sdk-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 34.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for superbrain_sdk-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 7edb17535775eff4b681f8d5d35e7a0c9fd857caf6a199b502f283f52ba4ee00
MD5 84c4e2aeaafd4610aa90cfdba98eb6d0
BLAKE2b-256 66c54255d22177c86f97dbfd2ae9150d2df89ad5e09ede02ba1ee1545fba1aa9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page