TTS MCP server and CLI for language learning (ElevenLabs, AWS Polly, OpenAI)

These details have not been verified by PyPI

Project links

Project description

langlearn-tts

Generate audio flashcards and vocabulary drills from text. Ask Claude to synthesize words and phrases in any language, or batch-process entire vocabulary lists from the command line. Audio is slowed to 90% speed by default so learners can hear pronunciation clearly.

The pair mode is the core workflow: give it an English word and its translation, and it produces a single MP3 — [English audio] [pause] [target language audio] — ready for Anki, spaced repetition, or passive listening.

Available as both a Claude Desktop MCP server (ask Claude to generate audio in conversation) and a CLI with identical functionality. Supports ElevenLabs (highest quality, 70+ languages), AWS Polly (best per-language pronunciation, 93 voices), and OpenAI TTS (easiest setup, 9 voices).

Features

Single synthesis — convert text to MP3 in any supported language
Batch synthesis — synthesize multiple texts, optionally merged into one file
Pair synthesis — stitch two languages together: [English] [pause] [L2]
Pair batch — batch-process vocabulary lists as stitched pairs
Auto-play — MCP tools play audio immediately after synthesis
Configurable speech rate — default 90% for learner-friendly pacing
Three providers — ElevenLabs (5,000+ voices, 70+ languages), AWS Polly (93 voices, 41 languages), or OpenAI TTS (9 voices, 57 languages)
Voice settings — ElevenLabs stability, similarity, style, and speaker boost controls
Provider selection — auto-detects ElevenLabs when ELEVENLABS_API_KEY is set; falls back to Polly; use --provider to override

Quick Start

1. Install uv (Python package manager)

If you don't have uv yet:

# macOS / Linux
curl -LsSf https://astral.sh/uv/install.sh | sh

# Windows
powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"

uv manages Python versions automatically — you don't need to install Python separately.

2. Install langlearn-tts

uv tool install langlearn-tts

This installs the langlearn-tts CLI and langlearn-tts-server MCP server globally.

3. Install ffmpeg

Required for audio stitching (pairs, merged batches). Single synthesis works without it.

# macOS (requires Homebrew — install from https://brew.sh if needed)
brew install ffmpeg

# Ubuntu/Debian
sudo apt install ffmpeg

# Windows
winget install ffmpeg

4. Configure a TTS provider

Pick one provider. When ELEVENLABS_API_KEY is set, ElevenLabs is auto-detected as the default. Otherwise, Polly is the default.

Option A — ElevenLabs (premium, highest quality):

5,000+ voices, 70+ languages. The most expressive and natural-sounding TTS available. Supports voice settings (stability, similarity, style, speaker boost) for fine-tuning output. Default model: eleven_v3.

export ELEVENLABS_API_KEY=sk_...

Free tier: 10K chars/month. Paid plans start at $5/month.

Option B — AWS Polly (recommended, best pronunciation):

93 dedicated per-language neural voices, 41 languages. Each voice is trained for a specific language, producing natural pronunciation for language learning. Requires an AWS account with polly:SynthesizeSpeech and polly:DescribeVoices permissions.

Install the AWS CLI, then:

aws configure

Enter your Access Key ID, Secret Access Key, and region (e.g., us-east-1). Alternatively, set AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY, and AWS_DEFAULT_REGION environment variables.

Option C — OpenAI TTS (easiest setup):

9 multilingual voices, 57 languages. Simpler to configure (one API key), but uses a single multilingual model for all languages. Pronunciation quality varies by language.

export OPENAI_API_KEY=sk-...

Pricing: $15/1M characters (tts-1) or $30/1M (tts-1-hd).

5. Verify

langlearn-tts doctor

All required checks should show ✓. Fix any that show ✗ before continuing.

From source (development)

git clone https://github.com/jmf-pobox/langlearn-tts-mcp.git
cd langlearn-tts-mcp
uv sync --all-extras
uv run langlearn-tts --help

Claude Desktop Setup

Automatic (recommended)

langlearn-tts install

This registers the MCP server with Claude Desktop. Auto-detects ElevenLabs when ELEVENLABS_API_KEY is set, otherwise defaults to Polly. Use --provider to override.

Options:

--provider NAME — provider (elevenlabs, polly, or openai). Default: auto-detect
--output-dir PATH — custom audio output directory (default: ~/Claude-Audio)
--uvx-path PATH — override the uvx binary path

Restart Claude Desktop after running install.

Manual

Add to ~/Library/Application Support/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "langlearn-tts": {
      "command": "/absolute/path/to/uvx",
      "args": ["--from", "langlearn-tts", "langlearn-tts-server"],
      "env": {
        "LANGLEARN_TTS_OUTPUT_DIR": "/absolute/path/to/output/directory"
      }
    }
  }
}

Claude Desktop does not inherit your shell environment. All paths must be absolute, and API keys (like OPENAI_API_KEY) must be literal values in this file (env var references are not supported). Find your uvx path with which uvx.

Env var	Required	Description
`LANGLEARN_TTS_PROVIDER`	No	`elevenlabs`, `polly` (default when no API key), or `openai`
`ELEVENLABS_API_KEY`	For ElevenLabs	Your literal API key
`OPENAI_API_KEY`	For OpenAI	Your literal API key
`LANGLEARN_TTS_OUTPUT_DIR`	No	Output directory (default: `~/Claude-Audio`)
`LANGLEARN_TTS_MODEL`	No	Model name. ElevenLabs: `eleven_v3` (default). OpenAI: `tts-1`, `tts-1-hd`

For Polly, AWS credentials are read from ~/.aws/credentials. For ElevenLabs or OpenAI, add the respective API key and provider to the env dict. Claude Desktop does not support env var references — all values must be literal.

Restart Claude Desktop after editing the config.

AI Tutor Prompts

langlearn-tts ships with 28 ready-made AI tutor prompts — one for each combination of 7 languages and 4 levels. Paste a prompt into a Claude Desktop Project's Instructions field, and Claude becomes a language tutor that generates audio during lessons.

Browse prompts

# List all available prompts
langlearn-tts prompt list

# Print a prompt (pipe to clipboard with pbcopy on macOS)
langlearn-tts prompt show german-high-school | pbcopy

Set up a Claude Desktop Project

In Claude Desktop, click Projects in the sidebar
Click Create Project and name it (e.g., "German with Herr Schmidt")
Open the project, click Set custom instructions
Paste the prompt content into the Instructions field
Start a new conversation within that project

Using a Project keeps the tutor persona scoped to language learning. Other conversations are unaffected.

Available languages and levels

Language	High School	1st Year	2nd Year	Advanced
German	Herr Schmidt	Professorin Weber	Professor Hartmann	Professor Becker
Spanish	Profesora Elena	Profesor Garcia	Profesora Carmen	Profesora Reyes
French	Madame Moreau	Professeur Laurent	Professeur Dubois	Professeur Beaumont
Russian	Irina Petrovna	Professor Dmitri	Professor Natasha	Professor Mikhail
Korean	Kim-seonsaengnim	Professor Park	Professor Kim	Professor Yoon
Japanese	Tanaka-sensei	Yamamoto-sensei	Suzuki-sensei	Mori-sensei
Chinese	Laoshi Wang	Professor Chen	Professor Zhang	Professor Wei

Each prompt creates a tutor persona calibrated to the student's level, based on Mollick & Mollick's "Assigning AI" framework. Customize any prompt by adjusting student background, voice selection, speech rate, or focus areas.

Troubleshooting

langlearn-tts doctor

Checks Python version, active provider, ffmpeg, provider-specific credentials, uvx, Claude Desktop config, and output directory. Required checks must pass (exit code 1 on failure); optional checks show ○ markers.

Voices

ElevenLabs (premium)

5,000+ voices. Any voice works with any language. Voice names are case-insensitive. You can also pass a voice ID directly (the 20-character alphanumeric string from the ElevenLabs dashboard).

Popular voices for language learning:

Voice	Description
Rachel	Calm, clear, American English
Drew	Warm, conversational
Clyde	Deep, authoritative
Domi	Expressive, young
Bella	Soft, gentle

Voice settings (ElevenLabs only):

Flag	Range	Description
`--stability`	0.0-1.0	Higher = more consistent, lower = more expressive
`--similarity`	0.0-1.0	Higher = closer to original voice, lower = more variation
`--style`	0.0-1.0	Higher = more expressive/dramatic
`--speaker-boost`	flag	Enhances voice clarity at the cost of generation speed

AWS Polly

Any voice from the AWS Polly voice list is supported. Voice names are case-insensitive. Each voice is a dedicated neural model trained for a specific language, producing the most native-sounding pronunciation. The tool queries the Polly API on first use and caches the result.

Common voices for language learning:

Voice	Language	Engine
joanna	English (US)	neural
matthew	English (US)	neural
daniel	German	neural
vicki	German (female)	neural
lucia	Spanish (European)	neural
lupe	Spanish (US)	neural
léa	French	neural
tatyana	Russian	standard
seoyeon	Korean	neural
takumi	Japanese	neural
zhiyu	Chinese (Mandarin)	neural

The engine (neural, standard, generative, long-form) is selected automatically — neural preferred when available.

OpenAI TTS

9 multilingual voices. Any voice works with any language — the model infers the language from the text. Voice names are case-insensitive. Pronunciation quality varies by language; major languages tend to be solid, less-resourced languages can be inconsistent.

Voice	Description
alloy	Neutral, balanced
ash	Warm, conversational
coral	Clear, expressive
echo	Smooth, authoritative
fable	Warm, British-accented
onyx	Deep, resonant
nova	Friendly, upbeat
sage	Calm, measured
shimmer	Light, gentle

Select the model with --model tts-1 (faster, cheaper) or --model tts-1-hd (higher quality).

CLI Usage

# Single synthesis
langlearn-tts synthesize "Guten Morgen" --voice daniel -o morning.mp3

# Custom speech rate (percentage, default 90)
langlearn-tts synthesize "Привет" --voice tatyana --rate 70 -o privet.mp3

# ElevenLabs with voice settings
langlearn-tts synthesize "Guten Morgen" --voice Rachel \
  --stability 0.5 --similarity 0.7 --style 0.3 --speaker-boost

# Pair: English + German stitched with a pause
langlearn-tts synthesize-pair "good morning" "Guten Morgen" \
  --voice1 joanna --voice2 daniel -o pair.mp3

# Batch from JSON file (["hello", "world", "good morning"])
langlearn-tts synthesize-batch words.json -d output/

# Batch merged into single file
langlearn-tts synthesize-batch words.json -d output/ --merge --pause 800

# Pair batch from JSON file ([["strong", "stark"], ["house", "Haus"]])
langlearn-tts synthesize-pair-batch pairs.json -d output/

# Browse AI tutor prompts
langlearn-tts prompt list
langlearn-tts prompt show german-high-school | pbcopy

MCP Tools

All four tools are available in Claude Desktop once the server is configured:

Tool	Description
`synthesize`	Single text to MP3
`synthesize_batch`	Multiple texts, optionally merged
`synthesize_pair`	Two texts stitched with a pause
`synthesize_pair_batch`	Multiple pairs, optionally merged

Each tool accepts auto_play (default: true) to play audio immediately after synthesis.

Development

# Install with dev dependencies
uv sync --all-extras

# Run tests
uv run pytest tests/ -v

# Linting and formatting
uv run ruff check src/ tests/
uv run ruff format src/ tests/

# Type checking
uv run mypy src/ tests/
uv run pyright src/ tests/

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.6.2

Feb 12, 2026

0.6.1

Feb 12, 2026

0.6.0

Feb 12, 2026

0.5.1

Feb 10, 2026

0.5.0

Feb 10, 2026

0.4.4

Feb 9, 2026

0.4.3

Feb 9, 2026

This version

0.4.2

Feb 9, 2026

0.4.1

Feb 9, 2026

0.4.0

Feb 9, 2026

0.3.2

Feb 9, 2026

0.3.1

Feb 9, 2026

0.3.0

Feb 9, 2026

0.2.0

Feb 9, 2026

0.1.2

Feb 8, 2026

0.1.1

Feb 8, 2026

0.1.0

Feb 8, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

langlearn_tts-0.4.2.tar.gz (51.2 kB view details)

Uploaded Feb 9, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

langlearn_tts-0.4.2-py3-none-any.whl (89.3 kB view details)

Uploaded Feb 9, 2026 Python 3

File details

Details for the file langlearn_tts-0.4.2.tar.gz.

File metadata

Download URL: langlearn_tts-0.4.2.tar.gz
Upload date: Feb 9, 2026
Size: 51.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.1

File hashes

Hashes for langlearn_tts-0.4.2.tar.gz
Algorithm	Hash digest
SHA256	`15aee5c41e2aa692264fd25ead32f7f4b6c3c3888297d6ecd7df40725325de64`
MD5	`f7bb24b790044727668611fc21bbbcca`
BLAKE2b-256	`761c46cab25873e516107111d99257ba87f7836d1c29c8c86cae36977eed338e`

See more details on using hashes here.

File details

Details for the file langlearn_tts-0.4.2-py3-none-any.whl.

File metadata

Download URL: langlearn_tts-0.4.2-py3-none-any.whl
Upload date: Feb 9, 2026
Size: 89.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.1

File hashes

Hashes for langlearn_tts-0.4.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d1118f50071b8e043d057a8b7251f706b39575ca03d3ba0135f06de9a04d5de0`
MD5	`a2fdc7d92efa6f5d4c487ffbf9fff325`
BLAKE2b-256	`d476c5cb4e772eb5ad48bfcf0c3a94899f6365262e764dc78239a5416150d4c2`

See more details on using hashes here.

langlearn-tts 0.4.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

langlearn-tts

Features

Quick Start

1. Install uv (Python package manager)

2. Install langlearn-tts

3. Install ffmpeg

4. Configure a TTS provider

5. Verify

From source (development)

Claude Desktop Setup

Automatic (recommended)

Manual

AI Tutor Prompts

Browse prompts

Set up a Claude Desktop Project

Available languages and levels

Troubleshooting

Voices

ElevenLabs (premium)

AWS Polly

OpenAI TTS

CLI Usage

MCP Tools

Development

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes