Give your AI agent a voice — an MCP server for text-to-speech

These details have not been verified by PyPI

Project links

Project description

agent-fm

Give your AI agent a voice.

An MCP server that lets AI coding agents speak to you via text-to-speech. The agent decides when and what to say — like a colleague tapping your shoulder.

No cloud API. No API keys. Runs locally on CPU.

Quick Start

# Add to Claude Code
claude mcp add agent-fm -- uvx agent-fm

# Pre-download models + verify setup (~340MB, one-time)
uvx agent-fm warmup

That's it. Your agent can now talk to you.

Linux only

# sounddevice needs PortAudio
sudo apt install libportaudio2

macOS and Windows need zero system dependencies.

What It Does

agent-fm gives your AI agent a speak() tool. Instead of you polling the terminal to check if your agent is done, it tells you:

"Hey, the auth refactor is done. All tests pass."
"Quick question — should I use Redis or in-memory caching here?"
"Heads up, there's a circular import in the payments module."

The agent decides when to speak based on instructions you can customize. You just work — it interrupts you only when it matters.

How It Works

MCP server exposes speak, list_voices, and set_voice tools
Kokoro TTS (82M params) generates speech locally — 54 voices, 9 languages
Audio queue plays messages through your speakers, one at a time
AGENTS.md teaches your agent when to speak and when to stay quiet

Install

Claude Code

claude mcp add agent-fm -- uvx agent-fm

Cursor

Add to .cursor/mcp.json:

{
  "mcpServers": {
    "agent-fm": {
      "command": "uvx",
      "args": ["agent-fm"]
    }
  }
}

VS Code / Copilot

Add to .vscode/mcp.json:

{
  "servers": {
    "agent-fm": {
      "command": "uvx",
      "args": ["agent-fm"]
    }
  }
}

Cline / Windsurf

Same "command": "uvx", "args": ["agent-fm"] pattern in your platform's MCP config.

pip (alternative)

pip install agent-fm
python -m agent_fm

Tools

Tool	Description
`speak`	Speak a message aloud. Params: `message`, `urgency` (info/warning/critical), `voice`, `speed`
`list_voices`	List available voices, filterable by language
`set_voice`	Change the default voice and speed for the session

Teaching Your Agent When to Speak

agent-fm ships with an AGENTS.md you can drop into your project or ~/.claude/CLAUDE.md:

## Voice (agent-fm)

You have a `speak` tool. Use it to talk aloud — the user may not be watching the screen.

When to speak:
- Finished a task: speak("Done with the auth refactor. All tests pass.")
- Need input: speak("Quick question — should I use Redis or an in-memory cache here?")
- Found a problem: speak("Heads up, there's a circular import in the payments module.")
- About to do something big: speak("Starting the full test suite, this'll take a minute.")

Don't speak for trivial ops, every step, or to repeat what's already on screen.
1-2 sentences max. Talk like a colleague, not a robot.

This is what makes agent-fm different from a simple TTS wrapper — it teaches the agent judgment about when to interrupt you.

Voices

54 voices across 9 languages:

Language	Voices
English (US)	20 (11 female, 9 male)
English (UK)	8 (4 female, 4 male)
Japanese	5
Mandarin	8
Hindi	4
Spanish	3
French	1
Italian	2
Portuguese	3

Default voice: am_fenrir. Change anytime:

Use the set_voice tool to switch to af_heart

CLI

agent-fm              # Run MCP server (stdio transport)
agent-fm warmup       # Download models + test setup
agent-fm --version    # Show version

System Requirements

Platform	System deps	Notes
macOS	None	Just works
Windows	None	Just works
Linux	`sudo apt install libportaudio2`	PortAudio for audio playback

Python 3.10-3.12 recommended. espeak-ng is bundled automatically via espeakng-loader.

Models auto-download to ~/.agent-fm/models/ on first use (~340MB).

Uninstall

# 1. Remove from Claude Code
claude mcp remove agent-fm
claude mcp remove agent-fm -s user    # if added globally

# 2. Remove package
uv tool uninstall agent-fm            # if installed with uvx
# or: pip uninstall agent-fm          # if installed with pip

# 3. Remove cached models (~340MB)
rm -rf ~/.agent-fm/

Built With

FastMCP — MCP server framework
Kokoro-82M — neural TTS model (ONNX, CPU)
sounddevice — cross-platform audio

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Apr 20, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agent_fm-0.1.0.tar.gz (11.7 kB view details)

Uploaded Apr 20, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agent_fm-0.1.0-py3-none-any.whl (13.6 kB view details)

Uploaded Apr 20, 2026 Python 3

File details

Details for the file agent_fm-0.1.0.tar.gz.

File metadata

Download URL: agent_fm-0.1.0.tar.gz
Upload date: Apr 20, 2026
Size: 11.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.7

File hashes

Hashes for agent_fm-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`55d81eabca118f7d8c47e6dff6bb1a3cfa9e44f5e4fc715c1b6289c5d3631c7d`
MD5	`80ca016e1a464e246359af01de581d90`
BLAKE2b-256	`2600caa12c5bb5226b352b5846f9f31e78418b3a822f7d1ed93f4bed1f491fe7`

See more details on using hashes here.

File details

Details for the file agent_fm-0.1.0-py3-none-any.whl.

File metadata

Download URL: agent_fm-0.1.0-py3-none-any.whl
Upload date: Apr 20, 2026
Size: 13.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.7

File hashes

Hashes for agent_fm-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cd173fecf2ca75ac2b98fa58159a920f52c3a63dad51cd3fa9c60e3dd037176c`
MD5	`c772cf541734b4c1ed6a0621188ac300`
BLAKE2b-256	`36fd35a8d0da7da616dd3e220248ff2dedae560b98976d91d0913f9aa5a52e59`

See more details on using hashes here.

agent-fm 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

agent-fm

Quick Start

Linux only

What It Does

How It Works

Install

Claude Code

Cursor

VS Code / Copilot

Cline / Windsurf

pip (alternative)

Tools

Teaching Your Agent When to Speak

Voices

CLI

System Requirements

Uninstall

Built With

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes