Python SDK for the Sonzai Mind Layer API

These details have not been verified by PyPI

Project links

Project description

Sonzai Python SDK

The official Python SDK for the Sonzai Mind Layer API. Build AI agents with persistent memory, evolving personality, and proactive behaviors.

Installation

pip install sonzai

Staying in sync with the production API

This SDK tracks https://api.sonz.ai/docs/openapi.json. A git pre-push hook checks for drift; run just install-hooks once after cloning. To refresh the committed spec snapshot, run just sync-spec and commit the diff.

Benchmarks

Sonzai leads on three independent benchmarks (LoCoMo, LongMemEval, SOTOPIA), running on the cheap end of the LLM stack — chat, judge, reader, and partner agent all run on Gemini 3.1 Flash Lite. No frontier-model arms race propping up the numbers; the lift is from the memory architecture. Drop in a heavier model and the ceiling goes up from there.

LoCoMo — long-term conversational memory (mem0's home turf)

10 peer-to-peer dialogues, 19–35 sessions each, 1540 QAs across 4 reasoning categories. Run via mem0's published evaluation pipeline byte-for-byte (their ANSWER_PROMPT + ACCURACY_PROMPT, dual-perspective ingest, dual search) so numbers are directly comparable.

Category	n	Sonzai (J)	mem0 (J, published)
1. single-hop	282	0.720	~0.65
2. multi-hop	321	0.723	~0.55
3. temporal reasoning	96	0.531	~0.55
4. open-domain	841	0.762	~0.71
Overall	1540	0.732 ✅	~0.67

Multi-hop is Sonzai's strongest category (+~17 points over mem0) — the hardest LoCoMo bucket and the one mem0's graph variant typically claims its lift on. Sonzai matches/beats without graph-specific machinery.

LongMemEval — retrieval (MemPalace's home turf)

Metric	Sonzai	MemPalace (hybrid_v4)
R@G (overall recall)	0.773	0.741
R@1 (top-hit accuracy)	0.800	0.770
Recall@10, multi-session	1.000	1.000

SOTOPIA longitudinal — compounding across sessions

Sonzai's USP: agents that compound. Same agent, same partner, N sessions, advance_time between each. Canonical SOTOPIA scores session 1 only — we also run it at s10, s20, s30 and add an 8th judge-scored dim memory_continuity (0..10) grading whether the agent treats the relationship as continuous with prior sessions.

Head-to-head at session 1 (no accumulated memory, standard SOTOPIA):

Dimension (session 1)	Sonzai	MemPalace	Δ
Believability (0..10)	9.00	9.00	tie
Relationship (−5..5)	4.25	4.00	+0.25
Knowledge (0..10)	7.75	6.50	+1.25
Goal (0..10)	9.00	8.75	+0.25
Overall	8.44	8.03	+0.41 ✅

Sonzai improves across sessions (same agent, rolling history):

Dim	s1	s10	s20	s30	Δ s1→s30
Believability (0..10)	9.00	9.75	9.62	10.00 (ceiling)	+1.00 ↑
Relationship (−5..5)	4.25	5.00	4.75	5.00 (ceiling)	+0.75 ↑
Knowledge (0..10)	7.75	8.50	7.75	8.50	+0.75 ↑
Goal (0..10)	9.00	9.75	9.50	9.75	+0.75 ↑
`memory_continuity` (0..10)	5.00	10.00 (ceiling)	9.75	10.00 (ceiling)	+5.00 ↑
Overall	8.44	9.45	9.38	9.56	+1.13 ↑

Every non-floor dim climbs. Believability and relationship hit the rubric ceiling by s30; memory_continuity hits the ceiling by s10 — Sonzai's identity model is producing accurate unprompted callbacks before a verbatim-retrieval baseline has history to compete.

Full scores, methodology, per-question-type breakdown, and reproduction steps (including comparison against MemPalace's canonical longmemeval_bench.py):

→ benchmarks/README.md

Quick Start

from sonzai import Sonzai

client = Sonzai(api_key="your-api-key")

# Chat with an agent
response = client.agents.chat(
    "your-agent-id",
    messages=[{"role": "user", "content": "Hello! What's your favorite hobby?"}],
    user_id="user-123",
)
print(response.content)

client.close()

Authentication

Get your API key from the Sonzai Dashboard under Projects > API Keys.

# Pass directly
client = Sonzai(api_key="sk-...")

# Or set the environment variable
# export SONZAI_API_KEY=sk-...
client = Sonzai()

Bring Your Own Key (BYOK)

BYOK lets you register your own LLM provider API keys with a project. Once set, upstream LLM calls for that project route through your key — token billing falls on your provider account, not Sonzai's. Keys are encrypted at rest server-side and are never returned by the API (only the key prefix and health metadata are exposed). Requires read:byok / write:byok scopes on the API key you use to call these endpoints.

# List all configured BYOK providers for a project
keys = client.byok.list("project-id")
for k in keys:
    print(f"{k.provider}: {k.health_status} (active: {k.is_active})")

# Store or replace a key (validated against the provider before saving)
key = client.byok.set("project-id", "openai", api_key="sk-...")
print(f"Stored prefix: {key.api_key_prefix}")

# Enable or disable without rotating
client.byok.set_active("project-id", "openai", is_active=False)

# Re-run the provider health check on a stored key
result = client.byok.test("project-id", "gemini")
print(result.health_status)  # "healthy" | "invalid" | "unknown"

# Remove a stored key (project falls back to platform billing)
client.byok.delete("project-id", "xai")

Supported providers: "openai" | "gemini" | "xai" | "openrouter". REST path: /api/v1/projects/{project_id}/byok-keys[/{provider}[/test]].

Usage

Chat (Streaming)

for event in client.agents.chat(
    "agent-id",
    messages=[{"role": "user", "content": "Tell me a story"}],
    stream=True,
):
    print(event.content, end="", flush=True)

Chat (Non-streaming)

response = client.agents.chat(
    "agent-id",
    messages=[{"role": "user", "content": "Hello!"}],
    user_id="user-123",
    session_id="session-456",  # optional, auto-created if omitted
)
print(response.content)
print(f"Tokens used: {response.usage.total_tokens}")

Chat (Advanced Options)

response = client.agents.chat(
    "agent-id",
    messages=[{"role": "user", "content": "Hello!"}],
    user_id="user-123",
    user_display_name="Alex",
    provider="openai",
    model="gpt-4o",
    language="en",
    timezone="America/New_York",
    compiled_system_prompt="You are a helpful assistant.",
    tool_capabilities={"web_search": True, "remember_name": True, "image_generation": False},
    tool_definitions=[
        {"name": "get_weather", "description": "Get current weather", "parameters": {"type": "object", "properties": {"city": {"type": "string"}}}},
    ],
    temperature=0.7,   # see Temperature note below
)

Temperature

temperature is optional and defaults to omitted from the wire payload — the AI service picks its own per-model default. The Platform automatically adapts or omits this value for providers whose models require it, so callers do not need to know provider-specific constraints: pass the value you want, and the Platform will silently reconcile it where necessary.

A temperature=0.0 request is preserved on the wire (deterministic output). Only temperature=None (the default) omits the field.

Streaming and cancellation

The streaming chat call (stream=True) honours the caller's asyncio-task cancellation by default — task.cancel() aborts the in-flight AI generation. That is usually what you want.

When it isn't: a NATS handler, Watermill subscriber, or short-lived HTTP request that returns to the client before the AI generation completes. Cancelling the caller would abort the LLM mid-stream, burning the upstream quota for no result. Use chat_detached / chat_stream_detached instead — the upstream call is shielded from caller cancellation via asyncio.shield(...), with a default 5-minute hard timeout cap and a watchdog that warns (or fires a callback) if the caller bails mid-call.

# Aggregate variant — returns ChatResponse when the AI finishes.
resp = await client.agents.chat_detached(
    "agent-id",
    messages=[{"role": "user", "content": "Plan my week."}],
    timeout_seconds=300.0,                # default DEFAULT_DETACHED_TIMEOUT_SECONDS = 300
    on_parent_cancel=lambda exc: metrics.inc("detached.parent_cancelled"),
)

# Streaming variant — drains the SSE stream into memory first, then yields.
async for event in client.agents.chat_stream_detached(
    "agent-id",
    messages=[{"role": "user", "content": "Plan my week."}],
):
    print(event.content, end="", flush=True)

The sync Sonzai.agents.chat_detached(...) / chat_stream_detached(...) methods exist for API parity but delegate to the regular chat(...) — Python's synchronous HTTP path has no caller-cancellation primitive, so sync blocking calls are effectively already detached.

See sonzai.DetachOptions and sonzai.DEFAULT_DETACHED_TIMEOUT_SECONDS for the tunables.

Chat (Async with Polling)

For chats that may run longer than your network can hold an SSE stream open (Cloudflare/LB cuts at ~100s), queue the request and poll for the result. Cancelling the poll locally does not cancel the server-side task — re-poll the same processing_id later if needed.

# Fire-and-forget — returns {"processing_id": "...", "status": "queued"}.
queued = client.agents.chat_async(
    "agent-id",
    messages=[{"role": "user", "content": "Plan my week."}],
    user_id="user-123",
    session_id="session-456",
    provider="openai",
    model="gpt-4o",
)
processing_id = queued["processing_id"]

# Poll until terminal. Recommended backoff: 1s → 2s → 4s, capped at 5s.
import time
delay = 1.0
while True:
    result = client.agents.poll_chat_result("agent-id", processing_id)
    # status: queued | running | complete | failed
    # while running, `response` carries partial assistant text and
    # `phase` / `tool` reflect the latest progressive-elaboration event.
    if result["status"] in ("complete", "failed"):
        break
    time.sleep(delay)
    delay = min(delay * 2, 5.0)

print(result["response"])
print(result.get("side_effects"))   # populated on terminal frame

# Convenience: queue + poll until terminal in one call.
result = client.agents.chat_async_blocking(
    "agent-id",
    messages=[{"role": "user", "content": "Plan my week."}],
    user_id="user-123",
    poll_interval_seconds=1.0,
    max_poll_interval_seconds=5.0,
    timeout_seconds=600.0,    # matches the server's CE_AGENT_CHAT_DEADLINE_MS
)

Memory

# Get memory tree
memory = client.agents.memory.list("agent-id", user_id="user-123")
for node in memory.nodes:
    print(f"{node.name} (importance: {node.importance})")

# Search memories
results = client.agents.memory.search("agent-id", query="favorite food")
for fact in results.results:
    print(f"{fact.content} (score: {fact.score})")

# Get memory timeline
timeline = client.agents.memory.timeline(
    "agent-id",
    user_id="user-123",
    start="2026-01-01",
    end="2026-03-01",
)

# Bulk create up to 1000 pre-formed facts in one request.
# source_type="manual" — no LLM extraction.
client.agents.memory.bulk_create_facts(
    "agent-id",
    user_id="user-123",
    facts=[
        {"content": "prefers espresso"},
        {"content": "based in Singapore", "fact_type": "location"},
    ],
)

# Single-call enriched context — fact retrieval runs query-conditioned
# (two-pass: entity-filtered + raw-text vector). recent_turns surfaces this
# session's raw messages before consolidation has run, so mid-session
# "remember what I just said" works immediately.
ctx = client.agents.get_context(
    "agent-id",
    user_id="user-123",
    query="what did we discuss earlier about espresso?",
)
for turn in ctx.recent_turns or []:
    print(f"[{turn.timestamp}] {turn.role}: {turn.content}")

Personality

personality = client.agents.personality.get("agent-id")
print(f"Name: {personality.profile.name}")
print(f"Openness: {personality.profile.big5.openness.score}")
print(f"Warmth: {personality.profile.dimensions.warmth}/10")

Sessions (real-time turn loop)

sessions.start() returns a Session handle bundling the identity tuple (agent_id, user_id, session_id, instance_id) plus provider/model defaults. The handle drives the per-turn loop with one fresh enriched context fetched per turn — so the LLM you call out to (OpenAI, Anthropic, Gemini, your own) sees up-to-date mood, recalled facts, and recent turns on every message.

# Start the session — returns a Session handle (not void).
session = client.agents.sessions.start(
    "agent-id",
    user_id="user-123",
    session_id="session-456",
    provider="gemini",                              # session-level default
    model="gemini-3.1-flash-lite",          # (per-turn override OK)
)

# Per-turn loop: fetch enriched context, hand it to your LLM, submit the turn.
ctx = session.context(query="what's the user about to say?")
# ... build your prompt with ctx, call your LLM, get assistant_reply ...

result = session.turn(
    messages=[
        {"role": "user", "content": "what did we talk about last week?"},
        {"role": "assistant", "content": assistant_reply},
    ],
    # Prefetch the *next* enriched context in the same round-trip
    # so the next user message renders without a second fetch.
    fetch_next_context={"query": "anticipated next user message"},
)
print(result.mood)              # sync mood update
print(result.extraction_id)     # async fact-extraction job id
print(result.next_context)      # populated when fetch_next_context is set

# Poll deferred extraction (memory write-back) when you need to know it landed.
status = session.status(result.extraction_id)
print(status.state)             # queued | running | done | failed

# End the session. wait=True forces the CE pipeline to run synchronously
# (use in benchmarks/tests that query memory immediately after).
session.end(total_messages=10, duration_seconds=300, wait=True)

Per-call provider/model on session.turn(...) and session.end(...) override the session defaults; omit them to fall through to the session default (or the server-side resolver).

Legacy void-style start/end

client.agents.sessions.end("agent-id", user_id=..., session_id=...) still works for callers that don't need the handle:

client.agents.sessions.end(
    "agent-id",
    user_id="user-123",
    session_id="session-456",
    total_messages=10,
    duration_seconds=300,
)

Agent Instances

# List instances
instances = client.agents.instances.list("agent-id")

# Create a new instance
instance = client.agents.instances.create("agent-id", name="Test Instance")
print(f"Created: {instance.instance_id}")

# Reset an instance
client.agents.instances.reset("agent-id", instance.instance_id)

# Delete an instance
client.agents.instances.delete("agent-id", instance.instance_id)

Notifications

# Get pending notifications
notifications = client.agents.notifications.list("agent-id", status="pending")
for n in notifications.notifications:
    print(f"[{n.check_type}] {n.generated_message}")

# Consume a notification
client.agents.notifications.consume("agent-id", n.message_id)

# Get notification history
history = client.agents.notifications.history("agent-id")

Capabilities (sync/async memory recall)

Supplementary memory recall can run synchronously (blocks context build until recall returns — every fact lands in the current turn) or asynchronously (races a deadline — slow hits spill to the next turn for lower first-token latency). Default is sync.

memory_mode is an agent-wide capability — set it once, every subsequent chat uses that mode until you change it.

# Read the current capabilities
caps = client.agents.get_capabilities("agent-id")
print(caps.memory_mode)  # "sync" or "async"

# Switch to async for lower first-token latency
client.agents.update_capabilities("agent-id", memory_mode="async")

# Switch back to sync
client.agents.update_capabilities("agent-id", memory_mode="sync")

# Other capabilities (all optional, PATCH-style — omitted fields are left unchanged)
client.agents.update_capabilities(
    "agent-id",
    memory_mode="async",
    knowledge_base=True,
    web_search=True,
    remember_name=True,
    image_generation=False,
    inventory=False,
)

You can also set memory_mode (and knowledge_base) at creation time via the tool_capabilities dict:

agent = client.agents.create(
    name="Luna",
    tool_capabilities={
        "web_search": True,
        "remember_name": True,
        "image_generation": False,
        "inventory": False,
        "knowledge_base": True,     # enable project-scoped KB search
        "memory_mode": "async",      # "sync" (default) or "async"
    },
)

Context Engine Data

# Mood
mood = client.agents.get_mood("agent-id", user_id="user-123")

# Relationships
relationships = client.agents.get_relationships("agent-id", user_id="user-123")

# Habits, Goals, Interests
habits = client.agents.get_habits("agent-id")
goals = client.agents.get_goals("agent-id")
interests = client.agents.get_interests("agent-id")

# Diary
diary = client.agents.get_diary("agent-id")

# Users
users = client.agents.get_users("agent-id")

Evaluation

# Evaluate an agent
result = client.agents.evaluate(
    "agent-id",
    messages=[
        {"role": "user", "content": "I'm feeling sad today"},
        {"role": "assistant", "content": "I'm sorry to hear that..."},
    ],
    template_id="template-uuid",
)
print(f"Score: {result.score}")
print(f"Feedback: {result.feedback}")

Simulation

# Run a simulation (streaming — launches run, then streams events)
for event in client.agents.simulate(
    "agent-id",
    user_persona={
        "name": "Alex",
        "background": "College student",
        "personality_traits": ["curious", "friendly"],
        "communication_style": "casual",
    },
    config={
        "max_sessions": 3,
        "max_turns_per_session": 10,
    },
):
    print(f"[{event.type}] {event.message}")

# Fire-and-forget (returns RunRef immediately)
ref = client.agents.simulate_async(
    "agent-id",
    user_persona={"name": "Alex", "background": "Student"},
    config={"max_sessions": 2},
)
print(f"Run started: {ref.run_id}")

# Reconnect to stream later (supports resuming via from_index)
for event in client.eval_runs.stream_events(ref.run_id, from_index=0):
    print(f"[{event.type}] {event.message}")

Run Eval (Simulation + Evaluation)

# Combined simulation + evaluation
for event in client.agents.run_eval(
    "agent-id",
    template_id="template-uuid",
    user_persona={"name": "Alex", "background": "Student"},
    simulation_config={"max_sessions": 2, "max_turns_per_session": 5},
):
    print(f"[{event.type}] {event.message}")

# Fire-and-forget
ref = client.agents.run_eval_async(
    "agent-id",
    template_id="template-uuid",
    simulation_config={"max_sessions": 2},
)
print(f"Run started: {ref.run_id}")

Re-evaluate (Eval Only)

# Re-evaluate an existing run with a different template
for event in client.agents.eval_only(
    "agent-id",
    template_id="new-template-uuid",
    source_run_id="existing-run-uuid",
):
    print(f"[{event.type}] {event.message}")

Custom States

# Create a custom state
state = client.agents.custom_states.create(
    "agent-id",
    key="player_level",
    value={"level": 15, "xp": 2400},
    scope="user",
    content_type="json",
    user_id="user-123",
)

# List states
states = client.agents.custom_states.list("agent-id", scope="global")

# Upsert by composite key (create or update)
state = client.agents.custom_states.upsert(
    "agent-id",
    key="player_level",
    value={"level": 16, "xp": 3000},
    scope="user",
    user_id="user-123",
)

# Get by composite key
state = client.agents.custom_states.get_by_key(
    "agent-id",
    key="player_level",
    scope="user",
    user_id="user-123",
)

# Delete by composite key
client.agents.custom_states.delete_by_key(
    "agent-id",
    key="player_level",
    scope="user",
    user_id="user-123",
)

Eval Templates

# List templates
templates = client.eval_templates.list()

# Create a template
template = client.eval_templates.create(
    name="Empathy Check",
    scoring_rubric="Evaluate emotional awareness and response quality",
    categories=[
        {"name": "Emotional Awareness", "weight": 0.5, "criteria": "..."},
        {"name": "Response Quality", "weight": 0.5, "criteria": "..."},
    ],
)

# Update a template
client.eval_templates.update(template.id, name="Updated Name")

# Delete a template
client.eval_templates.delete(template.id)

Eval Runs

# List eval runs
runs = client.eval_runs.list(agent_id="agent-id")

# Get a specific run
run = client.eval_runs.get("run-id")
print(f"Status: {run.status}, Turns: {run.total_turns}")

# Stream events from a running eval (reconnectable)
for event in client.eval_runs.stream_events("run-id"):
    print(f"[{event.type}] {event.message}")

# Delete a run
client.eval_runs.delete("run-id")

Async Support

Every method is also available as an async variant:

import asyncio
from sonzai import AsyncSonzai

async def main():
    async with AsyncSonzai(api_key="your-api-key") as client:
        # Non-streaming
        response = await client.agents.chat(
            "agent-id",
            messages=[{"role": "user", "content": "Hello!"}],
        )
        print(response.content)

        # Streaming
        async for event in await client.agents.chat(
            "agent-id",
            messages=[{"role": "user", "content": "Tell me a story"}],
            stream=True,
        ):
            print(event.content, end="", flush=True)

asyncio.run(main())

Configuration

client = Sonzai(
    api_key="sk-...",            # or SONZAI_API_KEY env var
    base_url="https://api.sonz.ai",  # or SONZAI_BASE_URL env var
    timeout=30.0,                # request timeout in seconds
    max_retries=2,               # retry count for failed requests
)

Error Handling

from sonzai import (
    Sonzai,
    AuthenticationError,
    NotFoundError,
    BadRequestError,
    RateLimitError,
    InternalServerError,
    SonzaiError,
)

try:
    response = client.agents.chat("agent-id", messages=[...])
except AuthenticationError:
    print("Invalid API key")
except NotFoundError:
    print("Agent not found")
except RateLimitError:
    print("Rate limit exceeded, try again later")
except SonzaiError as e:
    print(f"API error: {e}")

Development

# Clone the repo
git clone https://github.com/sonz-ai/sonzai-python.git
cd sonzai-python

# Install dev dependencies
pip install -e ".[dev]"

# Run tests
pytest

# Lint
ruff check src/

# Type check
mypy src/

License

MIT License - see LICENSE for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.7.6

Jun 23, 2026

1.7.5

Jun 23, 2026

1.7.4

Jun 23, 2026

1.7.3

Jun 20, 2026

1.7.2

Jun 18, 2026

1.7.1

Jun 17, 2026

1.6.0

May 22, 2026

1.5.6

May 12, 2026

1.5.5

May 12, 2026

1.5.4

May 12, 2026

1.5.3

May 8, 2026

1.5.2

May 7, 2026

1.5.0

May 7, 2026

1.4.5

May 7, 2026

1.4.4

Apr 24, 2026

1.4.3

Apr 23, 2026

1.4.2

Apr 23, 2026

1.4.0

Apr 22, 2026

1.3.0

Apr 21, 2026

1.2.5

Apr 21, 2026

1.2.2

Apr 19, 2026

1.2.1

Apr 17, 2026

1.2.0

Apr 17, 2026

1.1.4

Apr 17, 2026

1.1.3

Apr 16, 2026

1.1.2

Apr 16, 2026

1.1.1

Apr 16, 2026

1.1.0

Apr 12, 2026

1.0.13

Apr 11, 2026

1.0.11

Apr 8, 2026

1.0.9

Apr 8, 2026

1.0.8

Apr 7, 2026

1.0.7

Apr 7, 2026

1.0.6

Apr 7, 2026

1.0.3

Apr 1, 2026

1.0.2

Mar 31, 2026

1.0.1

Mar 31, 2026

1.0.0

Mar 31, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sonzai-1.7.6.tar.gz (25.7 MB view details)

Uploaded Jun 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sonzai-1.7.6-py3-none-any.whl (274.9 kB view details)

Uploaded Jun 23, 2026 Python 3

File details

Details for the file sonzai-1.7.6.tar.gz.

File metadata

Download URL: sonzai-1.7.6.tar.gz
Upload date: Jun 23, 2026
Size: 25.7 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.8

File hashes

Hashes for sonzai-1.7.6.tar.gz
Algorithm	Hash digest
SHA256	`124d50fd89b96c5116a75d0980780bd58ae662e6b9a89a8b3561a09f0985942e`
MD5	`17f5cac54476e6dc61e1d9bd149d293e`
BLAKE2b-256	`8bb4b2381d936d223adf7c3b20edfd8ea10d0445cf20736c9723baf62fb69fe5`

See more details on using hashes here.

File details

Details for the file sonzai-1.7.6-py3-none-any.whl.

File metadata

Download URL: sonzai-1.7.6-py3-none-any.whl
Upload date: Jun 23, 2026
Size: 274.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.8

File hashes

Hashes for sonzai-1.7.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`26ecd6e16d6da5421847b337fa840281afc605ed0aef4ab603f2e6cfdce043c7`
MD5	`461e9cc47ca3134127f289839e51f3e7`
BLAKE2b-256	`fd9edc64962cf7724e6d96eb1fffe910cf92c428a8e6e22a40900b7f9cb41ae0`

See more details on using hashes here.

sonzai 1.7.6

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Sonzai Python SDK

Installation

Staying in sync with the production API

Benchmarks

LoCoMo — long-term conversational memory (mem0's home turf)

LongMemEval — retrieval (MemPalace's home turf)

SOTOPIA longitudinal — compounding across sessions

Quick Start

Authentication

Bring Your Own Key (BYOK)

Usage

Chat (Streaming)

Chat (Non-streaming)

Chat (Advanced Options)

Temperature

Streaming and cancellation

Chat (Async with Polling)

Memory

Personality

Sessions (real-time turn loop)

Legacy void-style start/end

Agent Instances

Notifications

Capabilities (sync/async memory recall)

Context Engine Data

Evaluation

Simulation

Run Eval (Simulation + Evaluation)

Re-evaluate (Eval Only)

Custom States

Eval Templates

Eval Runs

Async Support

Configuration

Error Handling

Development

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes