simple flask server to host OpenVoiceOS persona plugins as a service

These details have not been verified by PyPI

Project links

Project description

ovos-persona-server

A single HTTP server that exposes one OVOS Persona as eight concurrent API surfaces — so any LLM client (OpenAI SDK, LangChain, Ollama tools, Anthropic SDK, Google Gemini SDK, Cohere SDK, HuggingFace TGI client, AWS Bedrock client, or any A2A agent) can talk to your OVOS persona without changes.

What is a Persona?
Installation
Quick Start
API Surfaces
A2A Endpoint
Persona Config Examples
Streaming
Embeddings
Authentication
Troubleshooting

What is a Persona?

An OVOS Persona is a JSON file that chains together one or more solver plugins. Solvers are tried in order until one returns an answer. You can mix LLMs, knowledge bases, and fallback bots in a single persona — no GPU required for non-LLM setups.

{
  "name": "OldSchoolBot",
  "solvers": [
    "ovos-solver-wikipedia-plugin",
    "ovos-solver-ddg-plugin",
    "ovos-solver-plugin-wolfram-alpha",
    "ovos-solver-wordnet-plugin",
    "ovos-solver-rivescript-plugin",
    "ovos-solver-failure-plugin"
  ],
  "ovos-solver-plugin-wolfram-alpha": { "appid": "YOUR_API_KEY" }
}

Find solver plugins at github.com/OpenVoiceOS.

Installation

# Base server (no A2A)
pip install ovos-persona-server

# With A2A server support
pip install 'ovos-persona-server[a2a]'

With uv (recommended in OVOS workspaces):

uv pip install 'ovos-persona-server[a2a]'

Quick Start

# Start serving a persona on port 8337
ovos-persona-server --persona /path/to/my-persona.json

# Also expose it as an A2A agent
ovos-persona-server \
  --persona /path/to/my-persona.json \
  --a2a-base-url http://localhost:8337/a2a

The server binds to 0.0.0.0:8337 by default. Visit http://localhost:8337/docs for the interactive API reference (Swagger UI).

API Surfaces

Every API is served on a vendor-prefixed path so multiple clients can coexist without conflict.

API	Prefix	Key endpoints
OpenAI	`/openai/v1`	`POST /chat/completions`, `POST /completions`, `GET /models`, `POST /embeddings`
Ollama	`/ollama/api`	`POST /chat`, `POST /generate`, `GET /tags`, `POST /embeddings`
Anthropic	`/anthropic/v1`	`POST /messages`
Google Gemini	`/gemini/v1beta/models`	`POST /{model}:generateContent`, `POST /{model}:streamGenerateContent`
Cohere	`/cohere/v1`	`POST /chat`
HuggingFace TGI	`/tgi`	`POST /generate`, `POST /generate_stream`
AWS Bedrock	`/bedrock/model`	`POST /{model}/invoke`, `POST /{model}/invoke-with-response-stream`
A2A	`/a2a`	`GET /.well-known/agent.json`, `POST /`

Deprecated legacy paths

For backwards compatibility, /v1/... maps to /openai/v1/... and /api/... maps to /ollama/api/.... These paths send Deprecation and Link response headers and will be removed in a future major version. Migrate to the prefixed paths.

Quick test with curl

# OpenAI-compatible chat
curl -s http://localhost:8337/openai/v1/chat/completions \
  -H 'Content-Type: application/json' \
  -d '{"model":"","messages":[{"role":"user","content":"hello"}]}' \
  | python3 -m json.tool

# Ollama-compatible chat
curl -s http://localhost:8337/ollama/api/chat \
  -H 'Content-Type: application/json' \
  -d '{"model":"","messages":[{"role":"user","content":"hello"}]}'

A2A Endpoint

ovos-persona-server can expose your persona as a standard A2A agent server, enabling any A2A client to interact with it — including ovos-a2a-agent running on another OVOS instance.

Enable A2A

ovos-persona-server \
  --persona my-persona.json \
  --a2a-base-url http://myhost:8337/a2a

The --a2a-base-url flag:

Activates the A2A endpoint at /a2a.
Sets the url field in the Agent Card returned at GET /a2a/.well-known/agent.json.
Must be the publicly reachable URL of the /a2a mount — this is what A2A clients use to discover the server.

Verify

# Fetch the Agent Card
curl http://localhost:8337/a2a/.well-known/agent.json | python3 -m json.tool

# Send a message
curl -X POST http://localhost:8337/a2a/ \
  -H 'Content-Type: application/json' \
  -d '{
    "jsonrpc": "2.0",
    "id": "1",
    "method": "message/send",
    "params": {
      "message": {
        "role": "user",
        "parts": [{"kind": "text", "text": "hello"}]
      }
    }
  }'

Connecting ovos-a2a-agent to this server

On another OVOS instance:

{
  "name": "remote-persona",
  "chat_module": "ovos-a2a-agent",
  "ovos-a2a-agent": {
    "url": "http://myhost:8337/a2a"
  }
}

A2A streaming

The A2A endpoint supports message/stream. Persona sentence chunks are emitted as TaskArtifactUpdateEvent SSE events. Enable streaming on the client side (e.g. "streaming": true in ovos-a2a-agent config).

A2A without `a2a-sdk`

If a2a-sdk is not installed and --a2a-base-url is provided, the server starts normally and logs a warning. All other API surfaces continue to work.

Persona Config Examples

LLM persona (OpenAI-compatible backend)

{
  "name": "gpt-persona",
  "chat_module": "ovos-openai-plugin",
  "ovos-openai-plugin": {
    "api_key": "sk-...",
    "model": "gpt-4o-mini"
  }
}

Knowledge-base + LLM fallback

{
  "name": "smart-assistant",
  "solvers": [
    "ovos-solver-wikipedia-plugin",
    "ovos-solver-ddg-plugin",
    "ovos-solver-wordnet-plugin",
    "ovos-openai-plugin",
    "ovos-solver-failure-plugin"
  ],
  "ovos-openai-plugin": {
    "api_key": "sk-...",
    "model": "gpt-4o-mini"
  }
}

Rivescript chatbot (no GPU, no API key)

{
  "name": "rivescript-bot",
  "solvers": [
    "ovos-solver-rivescript-plugin",
    "ovos-solver-failure-plugin"
  ]
}

Streaming

All seven non-A2A APIs support SSE streaming where the upstream spec defines it. Pass "stream": true (OpenAI / Cohere / TGI) or the equivalent for each API. See docs/streaming.md for per-API details.

OPM Tool Plugins — MCP and UTCP exposure

Installed ToolBox plugins (OPM entry-point group opm.agents.toolbox) are automatically surfaced over two protocols when the server starts.

Installing the MCP extra

pip install ovos-persona-server[mcp]

Without the [mcp] extra only the UTCP endpoints are active.

UTCP — Universal Tool Calling Protocol

Two endpoints are added at /tools:

Method	Path	Description
`GET`	`/tools/manual`	Returns a UTCP manual JSON listing all tools
`POST`	`/tools/{name}`	Invoke a tool by name with a JSON body

Fetch the manual:

curl http://localhost:8337/tools/manual

Response shape:

{
  "utcp_version": "1.0",
  "tools": [
    {
      "name": "my_tool",
      "description": "Does something useful.",
      "tool_provider": {
        "type": "http",
        "method": "POST",
        "url": "http://localhost:8337/tools/my_tool",
        "content_type": "application/json"
      },
      "inputs": [
        {"name": "query", "type": "string", "required": true, "description": "Search query"}
      ],
      "output_schema": { ... }
    }
  ]
}

Invoke a tool:

curl -X POST http://localhost:8337/tools/my_tool \
     -H "Content-Type: application/json" \
     -d '{"query": "hello"}'

MCP — Model Context Protocol

When the [mcp] extra is installed, the server mounts an MCP SSE endpoint at /mcp. Each installed ToolBox tool is registered as an MCP tool with the name, description, and JSON Schema derived from its OPM definition.

Claude Desktop / MCP client config:

{
  "mcpServers": {
    "ovos-persona-tools": {
      "url": "http://localhost:8337/mcp/sse"
    }
  }
}

Standalone stdio MCP server (for clients that spawn a subprocess):

ovos-persona-tools-mcp

This runs the same tool set over the stdio MCP transport.

Writing a ToolBox plugin

Implement ToolBox from ovos_plugin_manager.templates.agent_tools and register it under the opm.agents.toolbox entry-point group:

# pyproject.toml
[project.entry-points."opm.agents.toolbox"]
my_toolbox = "my_package.toolbox:MyToolBox"

The server picks it up automatically on the next start.

Client side usage

The OpenAI and Ollama routers expose /embeddings endpoints. These require a solver plugin that implements get_embeddings(text). If no such solver is loaded the endpoint returns HTTP 501. See docs/embeddings.md.

Authentication

The server itself does not enforce authentication — deploy behind a reverse proxy (nginx, Caddy, Traefik) with TLS and auth if public exposure is required. For the A2A endpoint, A2A clients that require bearer tokens can be configured on the client side (api_key in ovos-a2a-agent config).

Troubleshooting

Failed to load persona (500 on startup) The persona JSON file was not found or is invalid. Check the --persona path and validate the JSON.

All requests return 500 Persona chat failed The underlying solver chain failed. Check solver plugin installation and their individual configs (API keys, model paths, etc.).

A2A endpoint not available after starting with --a2a-base-url a2a-sdk is not installed. Install it:

uv pip install 'ovos-persona-server[a2a]'

Then restart the server.

Embeddings return 501 No solver with get_embeddings() is loaded. Add an embeddings solver to the persona's solvers list.

Legacy /v1/ paths return responses with Deprecation header This is expected. Migrate to /openai/v1/ paths. See docs/deprecation.md.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.12.0a4 pre-release

Jun 13, 2026

0.12.0a3 pre-release

Jun 12, 2026

This version

0.12.0a2 pre-release

Jun 12, 2026

0.12.0a1 pre-release

Jun 12, 2026

0.11.0a1 pre-release

Jun 12, 2026

0.10.0a1 pre-release

Jun 12, 2026

0.9.0a1 pre-release

Jun 12, 2026

0.8.0a1 pre-release

Jun 12, 2026

0.7.0a1 pre-release

Jun 12, 2026

0.6.0a1 pre-release

Jun 12, 2026

0.5.2a2 pre-release

Jun 11, 2026

0.5.2a1 pre-release

Jun 11, 2026

0.5.1a3 pre-release

Jun 10, 2026

0.5.1a1 pre-release

Dec 18, 2025

0.5.0

Nov 8, 2025

0.4.1a1 pre-release

Oct 17, 2025

0.4.0

Apr 10, 2025

0.4.0a1 pre-release

Apr 10, 2025

0.3.2

Mar 5, 2025

0.3.2a1 pre-release

Mar 5, 2025

0.3.1

Mar 5, 2025

0.3.1a1 pre-release

Mar 5, 2025

0.3.0

Mar 5, 2025

0.3.0a1 pre-release

Mar 5, 2025

0.2.1

Mar 5, 2025

0.2.0

Jan 28, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ovos_persona_server-0.12.0a2.tar.gz (65.3 kB view details)

Uploaded Jun 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ovos_persona_server-0.12.0a2-py3-none-any.whl (56.0 kB view details)

Uploaded Jun 12, 2026 Python 3

File details

Details for the file ovos_persona_server-0.12.0a2.tar.gz.

File metadata

Download URL: ovos_persona_server-0.12.0a2.tar.gz
Upload date: Jun 12, 2026
Size: 65.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for ovos_persona_server-0.12.0a2.tar.gz
Algorithm	Hash digest
SHA256	`a5b0d4defef808300d56cae7458055bc3b9a85cb1ee19479ea13d12bf593a235`
MD5	`b61469560810d82a002c26dd4cdb9801`
BLAKE2b-256	`e9c05b3658b15969a366b6c526c7a2bd4382f8fe8aa65c0cdb964cd9e4019c78`

See more details on using hashes here.

File details

Details for the file ovos_persona_server-0.12.0a2-py3-none-any.whl.

File metadata

Download URL: ovos_persona_server-0.12.0a2-py3-none-any.whl
Upload date: Jun 12, 2026
Size: 56.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for ovos_persona_server-0.12.0a2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`263e5fbf006a36291488d64b78a961e47e7a542011fd007361ff1b57db5d77c3`
MD5	`58dc716cc45f91da38d538671b685b31`
BLAKE2b-256	`5c07018b97cb21330ae6980260512ad5ecc3bafeb621df70d7e1b63350e67f86`

See more details on using hashes here.

ovos-persona-server 0.12.0a2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

ovos-persona-server

Table of Contents

What is a Persona?

Installation

Quick Start

API Surfaces

Deprecated legacy paths

Quick test with curl

A2A Endpoint

Enable A2A

Verify

Connecting ovos-a2a-agent to this server

A2A streaming

A2A without a2a-sdk

Persona Config Examples

LLM persona (OpenAI-compatible backend)

Knowledge-base + LLM fallback

Rivescript chatbot (no GPU, no API key)

Streaming

OPM Tool Plugins — MCP and UTCP exposure

Installing the MCP extra

UTCP — Universal Tool Calling Protocol

MCP — Model Context Protocol

Writing a ToolBox plugin

Client side usage

Authentication

Troubleshooting

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

A2A without `a2a-sdk`