voice-soundboard

Text-to-speech for AI agents and developers. Compiler → Graph → Engine architecture.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

mikeyfrilot

These details have not been verified by PyPI

Project description

MCP Voice Soundboard

48 voices • 9 languages • 5 presets • 8 emotions • SSML-lite • SFX tags • multi-speaker dialogue
Swappable TTS backends. Guardrails built in. Ships as a single npx command.

Highlights

MCP native — stdio transport, works with Claude Desktop, Cursor, and any MCP client
5 tools — voice_speak, voice_dialogue, voice_status, voice_interrupt, voice_inner_monologue
48 approved voices, 9 languages — English (American + British), Japanese, Mandarin, Spanish, French, Hindi, Italian, Brazilian Portuguese. Curated presets: narrator, announcer, whisper, storyteller, assistant
Emotion spans — 8 emotions via {joy}...{/joy} inline markup
SSML-lite — <break>, <emphasis>, <prosody> without full SSML complexity
SFX tags — [ding], [chime], [whoosh], [tada], [pop], [click] inline sound effects
Multi-speaker dialogue — Speaker: line format with auto-cast and pause directives
Guardrails — rate limiting, concurrency semaphore, request timeouts, path traversal protection, secret redaction
Swappable backends — Mock (built-in), HTTP proxy, Python bridge, or bring your own

Quick Start

npx @mcptoolshop/voice-soundboard-mcp

Or install globally:

npm install -g @mcptoolshop/voice-soundboard-mcp
voice-soundboard-mcp

Claude Desktop / MCP Client Config

Add to your MCP client configuration (e.g. claude_desktop_config.json):

{
  "mcpServers": {
    "voice-soundboard": {
      "command": "npx",
      "args": ["-y", "@mcptoolshop/voice-soundboard-mcp"]
    }
  }
}

With options:

{
  "mcpServers": {
    "voice-soundboard": {
      "command": "npx",
      "args": [
        "-y", "@mcptoolshop/voice-soundboard-mcp",
        "--artifact=path",
        "--output-dir=/tmp/voice-output",
        "--timeout=30000",
        "--max-concurrent=2"
      ]
    }
  }
}

MCP Tools

`voice_speak`

Synthesize speech from text.

text:         "Hello world!"
voice?:       "am_fenrir"          # Voice ID or preset name
speed?:       1.0                  # 0.5 - 2.0
mood?:        "dry"                # Humor mood (sensor-humor integration)
format?:      "wav"                # wav | mp3 | ogg | raw
artifactMode?: "path"             # path | base64
outputDir?:   "subdir"            # Subdirectory within output root
sfx?:         true                # Enable [ding], [chime] etc.

`voice_dialogue`

Multi-speaker dialogue synthesis.

script:       "Alice: Hello!\nBob: Hey there!"
cast?:        { "Alice": "af_sky", "Bob": "am_fenrir" }
speed?:       1.0
concat?:      true                 # Combine into single file
debug?:       true                 # Include cue_sheet
artifactMode?: "path"             # path | base64
outputDir?:   "subdir"            # Subdirectory within output root

`voice_status`

Returns engine health, available voices, presets, and backend info. No arguments.

`voice_interrupt`

Stop or rollback active synthesis.

streamId?:    "stream-123"
reason?:      "user_spoke"         # user_spoke | context_change | timeout | manual

`voice_inner_monologue`

Ephemeral micro-utterances for ambient narration. Requires --ambient flag or VOICE_SOUNDBOARD_AMBIENT_ENABLED=1.

text:         "Interesting..."     # Max 500 chars, auto-redacted
category?:    "thinking"           # general | thinking | observation | debug

Voices

48 voices across 9 languages. Language is auto-inferred from the voice ID prefix — no configuration required.

Prefix	Language
`af_` / `am_`	English (American)
`bf_` / `bm_`	English (British)
`jf_` / `jm_`	Japanese
`zf_` / `zm_`	Mandarin Chinese
`ef_` / `em_`	Spanish
`ff_`	French
`hf_` / `hm_`	Hindi
`if_` / `im_`	Italian
`pf_` / `pm_`	Brazilian Portuguese

English — American

ID	Name	Gender	Style
`af_aoede`	Aoede	Female	Musical
`af_bella`	Bella	Female	Warm
`af_heart`	Heart	Female	Caring
`af_jessica`	Jessica	Female	Professional
`af_kore`	Kore	Female	Youthful
`af_nicole`	Nicole	Female	Soft
`af_sarah`	Sarah	Female	Clear
`af_sky`	Sky	Female	Airy
`am_eric`	Eric	Male	Confident
`am_fenrir`	Fenrir	Male	Powerful
`am_liam`	Liam	Male	Friendly
`am_michael`	Michael	Male	Deep
`am_onyx`	Onyx	Male	Smooth
`am_puck`	Puck	Male	Playful

English — British

ID	Name	Gender	Style
`bf_alice`	Alice	Female	Proper
`bf_emma`	Emma	Female	Refined
`bf_isabella`	Isabella	Female	Warm
`bm_fable`	Fable	Male	Storytelling
`bm_george`	George	Male	Authoritative
`bm_lewis`	Lewis	Male	Friendly

Japanese

ID	Name	Gender	Style
`jf_alpha`	Alpha	Female	Clear
`jf_gongitsune`	Gongitsune	Female	Storytelling
`jf_nezuko`	Nezuko	Female	Gentle
`jf_tebukuro`	Tebukuro	Female	Warm
`jm_kumo`	Kumo	Male	Calm

Mandarin Chinese

ID	Name	Gender	Style
`zf_xiaobei`	Xiaobei	Female	Bright
`zf_xiaoni`	Xiaoni	Female	Gentle
`zf_xiaoxiao`	Xiaoxiao	Female	Clear
`zf_xiaoyi`	Xiaoyi	Female	Warm
`zm_yunjian`	Yunjian	Male	Authoritative
`zm_yunxi`	Yunxi	Male	Friendly
`zm_yunxia`	Yunxia	Male	Calm
`zm_yunyang`	Yunyang	Male	Confident

Spanish

ID	Name	Gender	Style
`ef_dora`	Dora	Female	Warm
`em_alex`	Alex	Male	Confident
`em_santa`	Santa	Male	Jolly

French

ID	Name	Gender	Style
`ff_siwis`	Siwis	Female	Refined

Hindi

ID	Name	Gender	Style
`hf_alpha`	Alpha	Female	Clear
`hf_beta`	Beta	Female	Warm
`hm_omega`	Omega	Male	Deep
`hm_psi`	Psi	Male	Calm

Italian

ID	Name	Gender	Style
`if_sara`	Sara	Female	Warm
`im_nicola`	Nicola	Male	Confident

Brazilian Portuguese

ID	Name	Gender	Style
`pf_dora`	Dora	Female	Warm
`pm_alex`	Alex	Male	Confident
`pm_santa`	Santa	Male	Jolly

Presets

Preset	Voice	Speed	Description
`narrator`	`bm_george`	0.95	Calm, clear, documentary style
`announcer`	`am_eric`	1.1	Bold, energetic, broadcast style
`whisper`	`af_sky`	0.85	Soft, intimate, gentle
`storyteller`	`bf_emma`	0.90	Expressive, varied pacing
`assistant`	`af_jessica`	1.0	Friendly, helpful, conversational

Emotion Spans

Wrap text in emotion tags to control prosody and voice routing:

{joy}Great news!{/joy} But {calm}let me explain.{/calm}

Supported: neutral, serious, friendly, professional, calm, joy, urgent, whisper

CLI Flags

Flag	Default	Description
`--artifact=path\|base64`	`path`	Audio delivery mode
`--output-dir=<path>`	`<tmpdir>/voice-soundboard/`	Output directory
`--backend=mock\|http\|python`	`mock`	Backend selection
`--ambient`	off	Enable inner-monologue system
`--max-concurrent=<n>`	`3`	Max concurrent synthesis requests
`--timeout=<ms>`	`60000`	Per-request timeout
`--retention-minutes=<n>`	`240`	Auto-cleanup age (0 to disable)

Packages

This is a pnpm monorepo with two publishable packages:

Package	Description	npm
`@mcptoolshop/voice-soundboard-core`	Backend-agnostic core library (validation, SSML, chunking, schemas)
`@mcptoolshop/voice-soundboard-mcp`	MCP server with CLI, guardrails, and transport

Development

# Install
pnpm install

# Build
pnpm build

# Test (363 tests)
pnpm test

# Build + test in one step
pnpm verify

Part of MCP Tool Shop

Project Structure

mcp-voice-soundboard/
  packages/
    core/               @mcptoolshop/voice-soundboard-core
      src/
        limits.ts         SHIP_LIMITS, text/chunk limits
        schemas.ts        VoiceRequest, VoiceResponse, error codes
        artifact.ts       resolveOutputDir, path sandbox
        voices.ts         Approved voice registry + presets
        emotion.ts        Emotion span parser
        ssml/             SSML-lite parser + limits
        chunking/         Text chunker
        sfx/              SFX tag parser + registry
        sandbox.ts        Safe filenames, symlink checks
        ambient.ts        AmbientEmitter for inner monologue
        redact.ts         PII/secret redaction
    mcp-server/         @mcptoolshop/voice-soundboard-mcp
      src/
        server.ts         MCP tool registration + guardrail wiring
        cli.ts            CLI entrypoint (stdio transport)
        backend.ts        Backend abstraction + mock/HTTP
        concurrency.ts    SynthesisSemaphore
        rateLimit.ts      ToolRateLimiter (sliding window)
        timeout.ts        withTimeout utility
        retention.ts      Output file cleanup timer
        redact.ts         Server-level redaction
        validation.ts     Synthesis result validation
        tools/            Individual tool handlers
  assets/               Logo, audio event manifests
  docs/                 Architecture docs

Privacy

No telemetry. This tool collects no usage data, sends no analytics, and makes no network requests except to the TTS backend you configure. All processing is local.

Security

See SECURITY.md for vulnerability reporting.

See THREAT_MODEL.md for the full threat surface analysis.

Project	Description
soundboard-plugin	Claude Code plugin — slash commands, emotion-aware narration

Support

Questions / help: Discussions
Bug reports: Issues
Security: SECURITY.md

Scorecard

Category	Score	Notes
A. Security	10/10	SECURITY.md, THREAT_MODEL.md, redaction, no telemetry
B. Error Handling	8/10	Structured error contract (code/hint/retryable), toToolError pattern
C. Operator Docs	9/10	README, CHANGELOG, HANDBOOK, tool docs
D. Shipping Hygiene	9/10	CI, verify script, dependabot, lockfile
E. Identity	10/10	Logo, translations, landing page, metadata
Total	46/50

License

MIT

Built by MCP Tool Shop

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

mikeyfrilot

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

2.5.2

Jun 15, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

voice_soundboard-2.5.2.tar.gz (1.1 MB view details)

Uploaded Jun 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

voice_soundboard-2.5.2-py3-none-any.whl (387.5 kB view details)

Uploaded Jun 15, 2026 Python 3

File details

Details for the file voice_soundboard-2.5.2.tar.gz.

File metadata

Download URL: voice_soundboard-2.5.2.tar.gz
Upload date: Jun 15, 2026
Size: 1.1 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for voice_soundboard-2.5.2.tar.gz
Algorithm	Hash digest
SHA256	`691e298c5453b04ea0986a2a0fc2b35d91e4a87d2241623f4cbb67bb256e8b95`
MD5	`51959fe13ca495c513702acfd501884a`
BLAKE2b-256	`f114d37ce90732365b69da738e3c08d07e1c85988ab644162ec2457562368374`

See more details on using hashes here.

Provenance

The following attestation bundles were made for voice_soundboard-2.5.2.tar.gz:

Publisher: release.yml on mcp-tool-shop-org/mcp-voice-soundboard

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: voice_soundboard-2.5.2.tar.gz
- Subject digest: 691e298c5453b04ea0986a2a0fc2b35d91e4a87d2241623f4cbb67bb256e8b95
- Sigstore transparency entry: 1829770273
- Sigstore integration time: Jun 15, 2026
Source repository:
- Permalink: mcp-tool-shop-org/mcp-voice-soundboard@4521ef399446ec422797bad2f3ab547fed4de891
- Branch / Tag: refs/heads/main
- Owner: https://github.com/mcp-tool-shop-org
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@4521ef399446ec422797bad2f3ab547fed4de891
- Trigger Event: workflow_dispatch

File details

Details for the file voice_soundboard-2.5.2-py3-none-any.whl.

File metadata

Download URL: voice_soundboard-2.5.2-py3-none-any.whl
Upload date: Jun 15, 2026
Size: 387.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for voice_soundboard-2.5.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`82eb75c084d4404deb86017d15bb71d8b4d495e30806d6f784cd45e3f1ed33ac`
MD5	`328ffbf4ce7402cf87e8966993d47b00`
BLAKE2b-256	`2d69ebfc11b3006cbd8de66a683916c0b9de74336ea0e9bc25ad5721bab30e65`

See more details on using hashes here.

Provenance

The following attestation bundles were made for voice_soundboard-2.5.2-py3-none-any.whl:

Publisher: release.yml on mcp-tool-shop-org/mcp-voice-soundboard

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: voice_soundboard-2.5.2-py3-none-any.whl
- Subject digest: 82eb75c084d4404deb86017d15bb71d8b4d495e30806d6f784cd45e3f1ed33ac
- Sigstore transparency entry: 1829770539
- Sigstore integration time: Jun 15, 2026
Source repository:
- Permalink: mcp-tool-shop-org/mcp-voice-soundboard@4521ef399446ec422797bad2f3ab547fed4de891
- Branch / Tag: refs/heads/main
- Owner: https://github.com/mcp-tool-shop-org
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@4521ef399446ec422797bad2f3ab547fed4de891
- Trigger Event: workflow_dispatch

voice-soundboard 2.5.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

Highlights

Quick Start

Claude Desktop / MCP Client Config

MCP Tools

voice_speak

voice_dialogue

voice_status

voice_interrupt

voice_inner_monologue

Voices

English — American

English — British

Japanese

Mandarin Chinese

Spanish

French

Hindi

Italian

Brazilian Portuguese

Presets

Emotion Spans

CLI Flags

Packages

Development

Project Structure

Privacy

Security

Related

Support

Scorecard

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`voice_speak`

`voice_dialogue`

`voice_status`

`voice_interrupt`

`voice_inner_monologue`