AWS Polly TTS MCP server and CLI for language learning

These details have not been verified by PyPI

Project links

Project description

langlearn-tts

Text-to-speech toolkit for language learning. Provides both an MCP server (for Claude Desktop) and a CLI with identical functionality.

Currently supports AWS Polly with ElevenLabs and OpenAI TTS backends planned. The goal: pick the TTS provider that fits your setup — no AWS account required once alternative backends ship.

Features

Single synthesis — convert text to MP3 in any supported language
Batch synthesis — synthesize multiple texts, optionally merged into one file
Pair synthesis — stitch two languages together: [English audio] [pause] [L2 audio]
Pair batch — batch process vocabulary lists as stitched pairs
Auto-play — MCP tools play audio immediately after synthesis via afplay
Configurable speech rate — default 90% speed for learner-friendly pacing
93 voices, 41 languages — any voice from the AWS Polly voice list works out of the box

Quick Start

1. Install uv (Python package manager)

If you don't have uv yet:

# macOS / Linux
curl -LsSf https://astral.sh/uv/install.sh | sh

# Windows
powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"

uv manages Python versions automatically — you don't need to install Python separately.

2. Install langlearn-tts

uv tool install langlearn-tts

This installs the langlearn-tts CLI and langlearn-tts-server MCP server globally.

3. Install ffmpeg

Required for audio stitching (pairs, merged batches). Single synthesis works without it.

# macOS
brew install ffmpeg

# Ubuntu/Debian
sudo apt install ffmpeg

# Windows
winget install ffmpeg

4. Configure AWS credentials

The tool uses AWS Polly, which requires an AWS account with polly:SynthesizeSpeech and polly:DescribeVoices permissions.

Option A — AWS CLI (recommended):

Install the AWS CLI, then:

aws configure

Enter your Access Key ID, Secret Access Key, and region (e.g., us-east-1).

Option B — Environment variables:

export AWS_ACCESS_KEY_ID=your-key
export AWS_SECRET_ACCESS_KEY=your-secret
export AWS_DEFAULT_REGION=us-east-1

Option C — Credentials file (~/.aws/credentials):

[default]
aws_access_key_id = your-key
aws_secret_access_key = your-secret
region = us-east-1

5. Verify

langlearn-tts doctor

All required checks should show ✓. Fix any that show ✗ before continuing.

From source (development)

git clone https://github.com/jmf-pobox/langlearn-tts-mcp.git
cd langlearn-tts-mcp
uv sync --all-extras
uv run langlearn-tts --help

Claude Desktop Setup

Automatic (recommended)

langlearn-tts install

This registers the MCP server with Claude Desktop. Options:

--output-dir PATH — custom audio output directory (default: ~/Claude-Audio)
--uvx-path PATH — override the uvx binary path

Restart Claude Desktop after running install.

Manual

Add to ~/Library/Application Support/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "langlearn-tts": {
      "command": "/absolute/path/to/uvx",
      "args": ["--from", "langlearn-tts", "langlearn-tts-server"],
      "env": {
        "POLLY_OUTPUT_DIR": "/absolute/path/to/output/directory"
      }
    }
  }
}

Claude Desktop does not inherit your shell PATH. All paths must be absolute. Find your uvx path with which uvx.

The POLLY_OUTPUT_DIR environment variable sets the default output directory. If unset, files are saved to ~/Claude-Audio/.

Restart Claude Desktop after editing the config.

Troubleshooting

langlearn-tts doctor

Checks Python version, ffmpeg, AWS credentials, Polly access, uvx, Claude Desktop config, and output directory. Required checks must pass (exit code 1 on failure); optional checks show ○ markers.

Voices

Any voice from the AWS Polly voice list is supported. Voice names are case-insensitive. The tool queries the Polly API on first use and caches the result.

Common voices for language learning:

Voice	Language	Engine
joanna	English (US)	neural
matthew	English (US)	neural
daniel	German	neural
vicki	German	neural
lucia	Spanish (European)	neural
lupe	Spanish (US)	neural
léa	French	neural
tatyana	Russian	standard
seoyeon	Korean	neural
takumi	Japanese	neural
zhiyu	Chinese (Mandarin)	neural

The engine (neural, standard, generative, long-form) is selected automatically — neural preferred when available.

CLI Usage

# Single synthesis
langlearn-tts synthesize "Guten Morgen" --voice daniel -o morning.mp3

# Custom speech rate (percentage, default 90)
langlearn-tts synthesize "Привет" --voice tatyana --rate 70 -o privet.mp3

# Pair: English + German stitched with a pause
langlearn-tts synthesize-pair "good morning" "Guten Morgen" \
  --voice1 joanna --voice2 daniel -o pair.mp3

# Batch from JSON file (["hello", "world", "good morning"])
langlearn-tts synthesize-batch words.json -d output/

# Batch merged into single file
langlearn-tts synthesize-batch words.json -d output/ --merge --pause 800

# Pair batch from JSON file ([["strong", "stark"], ["house", "Haus"]])
langlearn-tts synthesize-pair-batch pairs.json -d output/

MCP Tools

All four tools are available in Claude Desktop once the server is configured:

Tool	Description
`synthesize`	Single text to MP3
`synthesize_batch`	Multiple texts, optionally merged
`synthesize_pair`	Two texts stitched with a pause
`synthesize_pair_batch`	Multiple pairs, optionally merged

Each tool accepts auto_play (default: true) to play audio immediately after synthesis.

Roadmap

Provider Abstraction Layer

A TTSProvider protocol that decouples CLI/MCP tools from any specific backend. Enables --provider flag, provider auto-detection from API keys, and provider-specific doctor checks.

ElevenLabs Backend

Highest voice quality. 29+ languages, 5,000+ voices, voice cloning. Setup: pip install langlearn-tts[elevenlabs] + ELEVENLABS_API_KEY env var. Free tier: 10K chars/month.

OpenAI TTS Backend

Broadest adoption — most users already have an OpenAI key. 6 built-in voices, 50+ languages. Setup: pip install langlearn-tts[openai] + OPENAI_API_KEY env var. $15/1M chars (tts-1).

Development

# Install with dev dependencies
uv sync --all-extras

# Run tests
uv run pytest tests/ -v

# Linting and formatting
uv run ruff check src/ tests/
uv run ruff format src/ tests/

# Type checking
uv run mypy src/ tests/
uv run pyright src/ tests/

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.6.2

Feb 12, 2026

0.6.1

Feb 12, 2026

0.6.0

Feb 12, 2026

0.5.1

Feb 10, 2026

0.5.0

Feb 10, 2026

0.4.4

Feb 9, 2026

0.4.3

Feb 9, 2026

0.4.2

Feb 9, 2026

0.4.1

Feb 9, 2026

0.4.0

Feb 9, 2026

0.3.2

Feb 9, 2026

0.3.1

Feb 9, 2026

0.3.0

Feb 9, 2026

0.2.0

Feb 9, 2026

This version

0.1.2

Feb 8, 2026

0.1.1

Feb 8, 2026

0.1.0

Feb 8, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

langlearn_tts-0.1.2.tar.gz (13.0 kB view details)

Uploaded Feb 8, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

langlearn_tts-0.1.2-py3-none-any.whl (15.9 kB view details)

Uploaded Feb 8, 2026 Python 3

File details

Details for the file langlearn_tts-0.1.2.tar.gz.

File metadata

Download URL: langlearn_tts-0.1.2.tar.gz
Upload date: Feb 8, 2026
Size: 13.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for langlearn_tts-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`aa2736281af862666ff33c156f890fcf753198f6311c02b3ed67b5ff6fec31a2`
MD5	`9095ebb6f2c4460e6ed43a933acd99ca`
BLAKE2b-256	`e5f744bf36c3683e75e74f2201f3a97d4201e0db0483208b34cc4e5d758cdaaa`

See more details on using hashes here.

File details

Details for the file langlearn_tts-0.1.2-py3-none-any.whl.

File metadata

Download URL: langlearn_tts-0.1.2-py3-none-any.whl
Upload date: Feb 8, 2026
Size: 15.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for langlearn_tts-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b8feac9da5b7c5306de42587110579c3722a1a65cb0689a1f8ade8310e5a568f`
MD5	`d9af78167932d9df8d0721280bd9538b`
BLAKE2b-256	`b8e384e18aee2e44b3e9754a2ca4b4605d56a035933176e5355b560615c9a69a`

See more details on using hashes here.

langlearn-tts 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

langlearn-tts

Features

Quick Start

1. Install uv (Python package manager)

2. Install langlearn-tts

3. Install ffmpeg

4. Configure AWS credentials

5. Verify

From source (development)

Claude Desktop Setup

Automatic (recommended)

Manual

Troubleshooting

Voices

CLI Usage

MCP Tools

Roadmap

Provider Abstraction Layer

ElevenLabs Backend

OpenAI TTS Backend

Development

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes