Skip to main content

TTS MCP server and CLI for language learning (ElevenLabs, AWS Polly, OpenAI)

Project description

langlearn-tts

PyPI GitHub Tests Python

A Claude Desktop extension that gives Claude the ability to speak. Ask Claude to pronounce words, generate audio flashcards, or run a full language lesson with audio — in 70+ languages.

Quick Start

1. Get a TTS API key

You need an account with at least one text-to-speech provider:

  • ElevenLabs — highest quality, 5,000+ voices, 70+ languages. Free tier: 10K chars/month.
  • AWS Polly — best per-language pronunciation, 93 voices. Requires an AWS account (setup guide).
  • OpenAI TTS — easiest setup, 9 voices, 57 languages.

2. Install in Claude Desktop

Download the .mcpb Desktop Extension from the latest release and double-click to install. Claude Desktop will prompt you for your API key and an output directory.

3. Set up a tutor project (optional)

langlearn-tts ships with 28 tutor prompts — one for each combination of 7 languages and 4 levels. Setting up a project gives Claude a tutor persona that generates audio during lessons.

  1. In Claude Desktop, click Projects in the sidebar
  2. Click Create Project and name it (e.g., "German with Herr Schmidt")
  3. Open the project, click Set custom instructions
  4. Copy a prompt from the prompts directory and paste it into the Instructions field
  5. Start a new conversation within that project
Language High School 1st Year 2nd Year Advanced
German Herr Schmidt Professorin Weber Professor Hartmann Professor Becker
Spanish Profesora Elena Profesor Garcia Profesora Carmen Profesora Reyes
French Madame Moreau Professeur Laurent Professeur Dubois Professeur Beaumont
Russian Irina Petrovna Professor Dmitri Professor Natasha Professor Mikhail
Korean Kim-seonsaengnim Professor Park Professor Kim Professor Yoon
Japanese Tanaka-sensei Yamamoto-sensei Suzuki-sensei Mori-sensei
Chinese Laoshi Wang Professor Chen Professor Zhang Professor Wei

Each prompt is calibrated to the student's level, based on Mollick & Mollick's "Assigning AI" framework.

4. Try it out

In any Claude Desktop conversation, try:

"Say 'Guten Morgen' in German"

"Create an audio flashcard: 'good morning' in English, then 'Guten Morgen' in German"

"Synthesize these Spanish words as a merged audio file: hola, gracias, por favor, de nada"

"Generate pair flashcards for these German vocabulary words: strong/stark, house/Haus, book/Buch"

Audio plays automatically after each request. Files are saved to your output directory (~/langlearn-audio by default).

Features

  • Pronounce anything — ask Claude to say a word or phrase and hear it spoken aloud
  • Audio flashcards — Claude creates an MP3 with English first, then the target language, with a pause between them
  • Vocabulary lists — give Claude a list of words and get back individual or merged audio files
  • 70+ languages — German, Spanish, French, Russian, Korean, Japanese, Chinese, and many more
  • Tutor mode — 28 built-in tutor personas that teach with audio throughout the lesson
  • Multiple voices — each provider offers a range of voices; ask Claude to use a specific one by name
  • Adjustable speed — audio defaults to 90% speed so learners can hear pronunciation clearly

Troubleshooting

If something isn't working, ask Claude to run a health check:

"Run the doctor command to check if everything is set up correctly"

Logs are written to ~/.langlearn-tts/logs/langlearn-tts.log (never contains the text you synthesize). See PRIVACY.md for details.


Developer Reference

Everything below is for developers using the CLI, integrating with other MCP clients, or contributing to the project.

CLI Installation

Install uv (manages Python automatically), then:

uv tool install langlearn-tts

Install ffmpeg for audio stitching (pairs, merged batches):

# macOS (requires Homebrew — install from https://brew.sh if needed)
brew install ffmpeg

# Linux — see https://ffmpeg.org/download.html for your distro

# Windows
winget install --id Gyan.FFmpeg

Verify:

langlearn-tts doctor

Claude Desktop setup via CLI

langlearn-tts install

Writes to ~/Library/Application Support/Claude/claude_desktop_config.json (macOS). Options: --provider NAME, --output-dir PATH, --uvx-path PATH. Restart Claude Desktop after running.

Or add manually:

{
  "mcpServers": {
    "langlearn-tts": {
      "command": "/absolute/path/to/uvx",
      "args": ["--from", "langlearn-tts", "langlearn-tts-server"],
      "env": {
        "LANGLEARN_TTS_OUTPUT_DIR": "/absolute/path/to/output/directory"
      }
    }
  }
}

Claude Desktop does not inherit your shell environment. API keys must be literal values (env var references are not supported). Restart after editing.

Environment variables

Env var Required Description
LANGLEARN_TTS_PROVIDER No elevenlabs, polly (default when no API key), or openai
ELEVENLABS_API_KEY For ElevenLabs Your API key
OPENAI_API_KEY For OpenAI Your API key
LANGLEARN_TTS_OUTPUT_DIR No Output directory (default: ~/langlearn-audio)
LANGLEARN_TTS_MODEL No Model name. ElevenLabs: eleven_v3 (default). OpenAI: tts-1, tts-1-hd

For Polly, AWS credentials are read from ~/.aws/credentials.

CLI Usage

# Single synthesis
langlearn-tts synthesize "Guten Morgen" --voice daniel -o morning.mp3

# Custom speech rate (percentage, default 90)
langlearn-tts synthesize "Привет" --voice tatyana --rate 70 -o privet.mp3

# ElevenLabs with voice settings
langlearn-tts synthesize "Guten Morgen" --voice Rachel \
  --stability 0.5 --similarity 0.7 --style 0.3 --speaker-boost

# Pair: English + German stitched with a pause
langlearn-tts synthesize-pair "good morning" "Guten Morgen" \
  --voice1 joanna --voice2 daniel -o pair.mp3

# Batch from JSON file (["hello", "world", "good morning"])
langlearn-tts synthesize-batch words.json -d output/

# Batch merged into single file
langlearn-tts synthesize-batch words.json -d output/ --merge --pause 800

# Pair batch from JSON file ([["strong", "stark"], ["house", "Haus"]])
langlearn-tts synthesize-pair-batch pairs.json -d output/

# Browse AI tutor prompts
langlearn-tts prompt list
langlearn-tts prompt show german-high-school | pbcopy

Voices

ElevenLabs — 5,000+ voices. Any voice works with any language. You can also pass a voice ID directly (the 20-character string from the ElevenLabs dashboard). Voice settings: --stability, --similarity, --style (0.0–1.0), --speaker-boost (flag).

AWS Polly — 93 voices from the AWS Polly voice list. Each voice is trained for a specific language. Engine (neural, standard, generative, long-form) is selected automatically.

OpenAI TTS — 9 voices: alloy, ash, coral, echo, fable, onyx, nova, sage, shimmer. Default model: tts-1. Use --model tts-1-hd for higher quality.

All voice names are case-insensitive.

MCP Tools

Tool Description
synthesize Single text to MP3
synthesize_batch Multiple texts, optionally merged
synthesize_pair Two texts stitched with a pause
synthesize_pair_batch Multiple pairs, optionally merged

Each tool accepts auto_play (default: true) to play audio immediately after synthesis.

Other MCP clients

langlearn-tts works with any MCP client that supports stdio transport. Use the server command uvx --from langlearn-tts langlearn-tts-server with the environment variables above. Find your uvx path with which uvx — all paths must be absolute.

Development

git clone https://github.com/jmf-pobox/langlearn-tts-mcp.git
cd langlearn-tts-mcp
uv sync --all-extras

uv run pytest tests/ -v
uv run ruff check src/ tests/
uv run ruff format src/ tests/
uv run mypy src/ tests/
uv run pyright src/ tests/

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

langlearn_tts-0.6.2.tar.gz (57.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

langlearn_tts-0.6.2-py3-none-any.whl (102.8 kB view details)

Uploaded Python 3

File details

Details for the file langlearn_tts-0.6.2.tar.gz.

File metadata

  • Download URL: langlearn_tts-0.6.2.tar.gz
  • Upload date:
  • Size: 57.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for langlearn_tts-0.6.2.tar.gz
Algorithm Hash digest
SHA256 8ddd08c16f402af7002113bcf75a707e3a24984fbed82904e36026c004b81d4a
MD5 825d6b4a889625fb18b9a0d61699ca21
BLAKE2b-256 979a5eb48842db146a1a0b4f607954d1bf2864f0b6b21e95037cbeae61c60471

See more details on using hashes here.

File details

Details for the file langlearn_tts-0.6.2-py3-none-any.whl.

File metadata

  • Download URL: langlearn_tts-0.6.2-py3-none-any.whl
  • Upload date:
  • Size: 102.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for langlearn_tts-0.6.2-py3-none-any.whl
Algorithm Hash digest
SHA256 d0fc7f4565b8ed676d0597c186a0294a52632cdca4172ed2eabeae78f017ed54
MD5 64237386121cd77bfffde344e959992d
BLAKE2b-256 210c8bf32cbe25088042df601caa4c6732d2de47b8290a3a0d24f07fb3b30a40

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page