TTS MCP server and CLI for language learning (ElevenLabs, AWS Polly, OpenAI)

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

langlearn-tts

A Claude Desktop extension that gives Claude the ability to speak. Ask Claude to pronounce words, generate audio flashcards, or run a full language lesson with audio — in 70+ languages.

Status (2026-02-21)

Core CLI and MCP surfaces are working with ElevenLabs, OpenAI TTS, and AWS Polly.
Provider defaults and audio variants (word + example) are still being standardized.
Claude Desktop directory submission is pending external review.

Roadmap

See ROADMAP.md.

Quick Start

1. Get a TTS API key

You need an account with at least one text-to-speech provider:

ElevenLabs — best quality, 70+ languages, 5,000+ voices. Free tier: 10K chars/month. (Recommended)
OpenAI TTS — good quality, easiest setup, 57 languages, 9 voices.
AWS Polly — better quality, 41 languages, 100+ voices, difficult setup (setup guide).

2. Install in Claude Desktop

Download punt-langlearn-tts.mcpb and double-click to install. Claude Desktop will prompt you for your API key and an output directory.

3. Set up a tutor project (optional)

langlearn-tts ships with 28 tutor prompts — one for each combination of 7 languages and 4 levels. Setting up a project gives Claude a tutor persona that generates audio during lessons.

In Claude Desktop, click Projects in the sidebar
Click Create Project and name it (e.g., "German with Herr Schmidt")
Open the project, click Set custom instructions
Copy a prompt from the prompts directory and paste it into the Instructions field
Start a new conversation within that project

Language	High School	1st Year	2nd Year	Advanced
German	Herr Schmidt	Professorin Weber	Professor Hartmann	Professor Becker
Spanish	Profesora Elena	Profesor Garcia	Profesora Carmen	Profesora Reyes
French	Madame Moreau	Professeur Laurent	Professeur Dubois	Professeur Beaumont
Russian	Irina Petrovna	Professor Dmitri	Professor Natasha	Professor Mikhail
Korean	Kim-seonsaengnim	Professor Park	Professor Kim	Professor Yoon
Japanese	Tanaka-sensei	Yamamoto-sensei	Suzuki-sensei	Mori-sensei
Chinese	Laoshi Wang	Professor Chen	Professor Zhang	Professor Wei

Each prompt is calibrated to the student's level, based on Mollick & Mollick's "Assigning AI" framework.

4. Try it out

In any Claude Desktop conversation, try:

"Say 'Guten Morgen' in German"

"Create an audio flashcard: 'good morning' in English, then 'Guten Morgen' in German"

"Synthesize these Spanish words as a merged audio file: hola, gracias, por favor, de nada"

"Generate pair flashcards for these German vocabulary words: strong/stark, house/Haus, book/Buch"

Audio plays automatically after each request. Files are saved to your output directory (~/langlearn-audio by default).

Features

Pronounce anything — ask Claude to say a word or phrase and hear it spoken aloud
Audio flashcards — Claude creates an MP3 with English first, then the target language, with a pause between them
Vocabulary lists — give Claude a list of words and get back individual or merged audio files
70+ languages — German, Spanish, French, Russian, Korean, Japanese, Chinese, and many more
Tutor mode — 28 built-in tutor personas that teach with audio throughout the lesson
Multiple voices — each provider offers a range of voices; ask Claude to use a specific one by name
Adjustable speed — audio defaults to 90% speed so learners can hear pronunciation clearly

Troubleshooting

If something isn't working, ask Claude to run a health check:

"Run the doctor command to check if everything is set up correctly"

Logs are written to ~/.langlearn-tts/logs/langlearn-tts.log (never contains the text you synthesize). See PRIVACY.md for details.

Developer Reference

Everything below is for developers using the CLI, integrating with other MCP clients, or contributing to the project.

Claude Code / CLI

curl -fsSL https://raw.githubusercontent.com/punt-labs/langlearn-tts/fd8199e/install.sh | sh

The default provider is AWS Polly. To use a different provider:

LANGLEARN_TTS_PROVIDER=elevenlabs curl -fsSL https://raw.githubusercontent.com/punt-labs/langlearn-tts/fd8199e/install.sh | sh

Manual install (if you already have uv)

uv tool install punt-langlearn-tts
langlearn-tts install --provider polly
langlearn-tts doctor

Verify before running

curl -fsSL https://raw.githubusercontent.com/punt-labs/langlearn-tts/fd8199e/install.sh -o install.sh
shasum -a 256 install.sh
cat install.sh
sh install.sh

Install ffmpeg for audio stitching (pairs, merged batches):

# macOS (requires Homebrew — install from https://brew.sh if needed)
brew install ffmpeg

# Linux — see https://ffmpeg.org/download.html for your distro

# Windows
winget install --id Gyan.FFmpeg

Claude Desktop setup via CLI

langlearn-tts install

Writes to ~/Library/Application Support/Claude/claude_desktop_config.json (macOS). Options: --provider NAME, --output-dir PATH, --uvx-path PATH. Restart Claude Desktop after running.

Or add manually:

{
  "mcpServers": {
    "langlearn-tts": {
      "command": "/absolute/path/to/uvx",
      "args": ["--from", "punt-langlearn-tts", "langlearn-tts-server"],
      "env": {
        "LANGLEARN_TTS_OUTPUT_DIR": "/absolute/path/to/output/directory"
      }
    }
  }
}

Claude Desktop does not inherit your shell environment. API keys must be literal values (env var references are not supported). Restart after editing.

Environment variables

Env var	Required	Description
`LANGLEARN_TTS_PROVIDER`	No	`elevenlabs`, `polly` (default when no API key), or `openai`
`ELEVENLABS_API_KEY`	For ElevenLabs	Your API key
`OPENAI_API_KEY`	For OpenAI	Your API key
`LANGLEARN_TTS_OUTPUT_DIR`	No	Output directory (default: `~/langlearn-audio`)
`LANGLEARN_TTS_MODEL`	No	Model name. ElevenLabs: `eleven_v3` (default). OpenAI: `tts-1`, `tts-1-hd`

For Polly, AWS credentials are read from ~/.aws/credentials.

CLI Usage

# Single synthesis
langlearn-tts synthesize "Guten Morgen" --voice daniel -o morning.mp3

# Custom speech rate (percentage, default 90)
langlearn-tts synthesize "Привет" --voice tatyana --rate 70 -o privet.mp3

# ElevenLabs with voice settings
langlearn-tts synthesize "Guten Morgen" --voice Rachel \
  --stability 0.5 --similarity 0.7 --style 0.3 --speaker-boost

# Pair: English + German stitched with a pause
langlearn-tts synthesize-pair "good morning" "Guten Morgen" \
  --voice1 joanna --voice2 daniel -o pair.mp3

# Batch from JSON file (["hello", "world", "good morning"])
langlearn-tts synthesize-batch words.json -d output/

# Batch merged into single file
langlearn-tts synthesize-batch words.json -d output/ --merge --pause 800

# Pair batch from JSON file ([["strong", "stark"], ["house", "Haus"]])
langlearn-tts synthesize-pair-batch pairs.json -d output/

# Browse AI tutor prompts
langlearn-tts prompt list
langlearn-tts prompt show german-high-school | pbcopy

Voices

ElevenLabs — 5,000+ voices. Any voice works with any language. You can also pass a voice ID directly (the 20-character string from the ElevenLabs dashboard). Voice settings: --stability, --similarity, --style (0.0–1.0), --speaker-boost (flag).

AWS Polly — 93 voices from the AWS Polly voice list. Each voice is trained for a specific language. Engine (neural, standard, generative, long-form) is selected automatically.

OpenAI TTS — 9 voices: alloy, ash, coral, echo, fable, onyx, nova, sage, shimmer. Default model: tts-1. Use --model tts-1-hd for higher quality.

All voice names are case-insensitive.

MCP Tools

Tool	Description
`synthesize`	Single text to MP3
`synthesize_batch`	Multiple texts, optionally merged
`synthesize_pair`	Two texts stitched with a pause
`synthesize_pair_batch`	Multiple pairs, optionally merged

Each tool accepts auto_play (default: true) to play audio immediately after synthesis.

Other MCP clients

langlearn-tts works with any MCP client that supports stdio transport. Use the server command uvx --from punt-langlearn-tts langlearn-tts-server with the environment variables above. Find your uvx path with which uvx — all paths must be absolute.

Development

git clone https://github.com/punt-labs/langlearn-tts.git
cd langlearn-tts
uv sync --all-extras

uv run pytest tests/ -v
uv run ruff check src/ tests/
uv run ruff format src/ tests/
uv run mypy src/ tests/
uv run pyright src/ tests/

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

jmf-pobox

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.7.2

Mar 8, 2026

This version

0.7.1

Feb 28, 2026

0.6.4

Feb 25, 2026

0.6.3

Feb 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

punt_langlearn_tts-0.7.1.tar.gz (48.6 kB view details)

Uploaded Feb 28, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

punt_langlearn_tts-0.7.1-py3-none-any.whl (91.5 kB view details)

Uploaded Feb 28, 2026 Python 3

File details

Details for the file punt_langlearn_tts-0.7.1.tar.gz.

File metadata

Download URL: punt_langlearn_tts-0.7.1.tar.gz
Upload date: Feb 28, 2026
Size: 48.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for punt_langlearn_tts-0.7.1.tar.gz
Algorithm	Hash digest
SHA256	`7daf325c4e02186ff02d227abb58f9a7486c8b222ac7cd35650bce751d733642`
MD5	`cf4f6fd279cafa8e9cb084570756c789`
BLAKE2b-256	`dc150445a937c5de51185ac17c5c75b9bc394bc8856d174ee372ede8b50401e5`

See more details on using hashes here.

Provenance

The following attestation bundles were made for punt_langlearn_tts-0.7.1.tar.gz:

Publisher: release.yml on punt-labs/langlearn-tts

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: punt_langlearn_tts-0.7.1.tar.gz
- Subject digest: 7daf325c4e02186ff02d227abb58f9a7486c8b222ac7cd35650bce751d733642
- Sigstore transparency entry: 1004825909
- Sigstore integration time: Feb 28, 2026
Source repository:
- Permalink: punt-labs/langlearn-tts@c2db86341eeeb72be05f82022d3883068b2a5fc5
- Branch / Tag: refs/tags/v0.7.1
- Owner: https://github.com/punt-labs
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@c2db86341eeeb72be05f82022d3883068b2a5fc5
- Trigger Event: workflow_dispatch

File details

Details for the file punt_langlearn_tts-0.7.1-py3-none-any.whl.

File metadata

Download URL: punt_langlearn_tts-0.7.1-py3-none-any.whl
Upload date: Feb 28, 2026
Size: 91.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for punt_langlearn_tts-0.7.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ac968be3b09aaec44d0c1a57d4b6a5e971b45ae40d8f0098491aafa21f203e2e`
MD5	`8a693c6d6f0f289550896bacd7f39ffa`
BLAKE2b-256	`f82a8ba73a45f6d22b5ca7cf79de05382d489c79943c237589a28cf2e1f76168`

See more details on using hashes here.

Provenance

The following attestation bundles were made for punt_langlearn_tts-0.7.1-py3-none-any.whl:

Publisher: release.yml on punt-labs/langlearn-tts

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: punt_langlearn_tts-0.7.1-py3-none-any.whl
- Subject digest: ac968be3b09aaec44d0c1a57d4b6a5e971b45ae40d8f0098491aafa21f203e2e
- Sigstore transparency entry: 1004825912
- Sigstore integration time: Feb 28, 2026
Source repository:
- Permalink: punt-labs/langlearn-tts@c2db86341eeeb72be05f82022d3883068b2a5fc5
- Branch / Tag: refs/tags/v0.7.1
- Owner: https://github.com/punt-labs
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@c2db86341eeeb72be05f82022d3883068b2a5fc5
- Trigger Event: workflow_dispatch

punt-langlearn-tts 0.7.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

langlearn-tts

Status (2026-02-21)

Roadmap

Quick Start

1. Get a TTS API key

2. Install in Claude Desktop

3. Set up a tutor project (optional)

4. Try it out

Features

Troubleshooting

Developer Reference

Claude Code / CLI

Claude Desktop setup via CLI

Environment variables

CLI Usage

Voices

MCP Tools

Other MCP clients

Development

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance