VoiceMode - Voice interaction capabilities for AI assistants (formerly voice-mcp)

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

mbailey

These details have not been verified by PyPI

Project links

Homepage

Project description

VoiceMode

Natural voice conversations with Claude Code (and other MCP capable agents)

VoiceMode enables natural voice conversations with Claude Code. Voice isn't about replacing typing - it's about being available when typing isn't.

Perfect for:

Walking to your next meeting
Cooking while debugging
Giving your eyes a break after hours of screen time
Holding a coffee (or a dog)
Any moment when your hands or eyes are busy

See It In Action

Quick Start

Requirements: Computer with microphone and speakers

Option 1: Claude Code Plugin (Recommended)

The fastest way for Claude Code users to get started:

# Add the VoiceMode marketplace
claude plugin marketplace add mbailey/voicemode

# Install VoiceMode plugin
claude plugin install voicemode@voicemode

## Install dependencies (CLI, Local Voice Services)

/voicemode:install

# Start talking!
/voicemode:converse

Option 2: Python installer package

Installs dependencies and the VoiceMode Python package.

# Install UV package manager (if needed)
curl -LsSf https://astral.sh/uv/install.sh | sh

# Run the installer (sets up dependencies and local voice services)
uvx voice-mode-install

# Add to Claude Code
claude mcp add --scope user voicemode -- uvx --refresh voice-mode

# Optional: Add OpenAI API key as fallback for local services
export OPENAI_API_KEY=your-openai-key

# Start a conversation
claude converse

For manual setup, see the Getting Started Guide.

Features

Natural conversations - speak naturally, hear responses immediately
Works offline - optional local voice services (Whisper STT, Kokoro TTS)
Low latency - fast enough to feel like a real conversation
Smart silence detection - stops recording when you stop speaking
Privacy options - run entirely locally or use cloud services

Compatibility

Platforms: Linux, macOS, Windows (WSL), NixOS Python: 3.10-3.14

Configuration

VoiceMode works out of the box. For customization:

# Set OpenAI API key (if using cloud services)
export OPENAI_API_KEY="your-key"

# Or configure via file
voicemode config edit

See the Configuration Guide for all options.

Remote Agent (Operator)

VoiceMode includes agent management for running headless Claude Code instances that can be woken remotely from the iOS app or web interface.

Quick Start

# Start the operator agent in a tmux session
voicemode agent start

# Check if it's running
voicemode agent status

# Send a message to the operator
voicemode agent send "Hello, please check my calendar"

# Stop the operator
voicemode agent stop

The Operator Concept

The operator is a headless Claude Code instance running in tmux that:

Listens for remote connections from voicemode.dev
Can be woken by the iOS app or web interface
Responds via voice using VoiceMode's TTS/STT capabilities

Think of it like a phone operator - always there to help when called.

Agent Commands

Command	Description
`voicemode agent start`	Start operator in tmux session
`voicemode agent stop`	Send Ctrl-C to stop Claude gracefully
`voicemode agent stop --kill`	Kill the tmux window
`voicemode agent status`	Show running/stopped status
`voicemode agent send "msg"`	Send message (auto-starts if needed)
`voicemode agent send --no-start "msg"`	Send message (fail if not running)

Agent Directory Structure

Agent configuration lives in ~/.voicemode/agents/:

~/.voicemode/agents/
├── voicemode.env       # Shared settings for all agents
├── AGENT.md            # AI entry point
├── CLAUDE.md           # Claude-specific instructions
├── SKILL.md            # Shared behavior
└── operator/           # Default agent
    ├── voicemode.env   # Operator-specific settings
    ├── AGENT.md
    ├── CLAUDE.md
    └── SKILL.md        # Operator behavior

Configuration (voicemode.env)

Agent-specific settings override base settings. Available options:

# Base settings (~/.voicemode/agents/voicemode.env)
VOICEMODE_VOICE=nova           # Default TTS voice
VOICEMODE_SPEED=1.0            # Speech rate

# Operator settings (~/.voicemode/agents/operator/voicemode.env)
VOICEMODE_AGENT_REMOTE=true    # Enable remote connections
VOICEMODE_AGENT_STARTUP_MESSAGE=  # Message sent on startup
VOICEMODE_AGENT_CLAUDE_ARGS=   # Extra args for Claude Code

Permissions Setup (Optional)

To use VoiceMode without permission prompts, add to ~/.claude/settings.json:

{
  "permissions": {
    "allow": [
      "mcp__voicemode__converse",
      "mcp__voicemode__service"
    ]
  }
}

See the Permissions Guide for more options.

Local Voice Services

For privacy or offline use, install local speech services:

Whisper.cpp - Local speech-to-text
Kokoro - Local text-to-speech with multiple voices

These provide the same API as OpenAI, so VoiceMode switches seamlessly between them.

Installation Details

System Dependencies by Platform

Ubuntu/Debian

sudo apt update
sudo apt install -y ffmpeg gcc libasound2-dev libasound2-plugins libportaudio2 portaudio19-dev pulseaudio pulseaudio-utils python3-dev

WSL2 users: The pulseaudio packages above are required for microphone access.

Fedora/RHEL

sudo dnf install alsa-lib-devel ffmpeg gcc portaudio portaudio-devel python3-devel

macOS

brew install ffmpeg node portaudio

NixOS

# Use development shell
nix develop github:mbailey/voicemode

# Or install system-wide
nix profile install github:mbailey/voicemode

Alternative Installation Methods

From source

git clone https://github.com/mbailey/voicemode.git
cd voicemode
uv tool install -e .

NixOS system-wide

# In /etc/nixos/configuration.nix
environment.systemPackages = [
  (builtins.getFlake "github:mbailey/voicemode").packages.${pkgs.system}.default
];

Troubleshooting

Problem	Solution
No microphone access	Check terminal/app permissions. WSL2 needs pulseaudio packages.
UV not found	Run `curl -LsSf https://astral.sh/uv/install.sh \| sh`
OpenAI API error	Verify `OPENAI_API_KEY` is set correctly
No audio output	Check system audio settings and available devices

Save Audio for Debugging

export VOICEMODE_SAVE_AUDIO=true
# Files saved to ~/.voicemode/audio/YYYY/MM/

Documentation

Getting Started - Full setup guide
Configuration - All environment variables
Whisper Setup - Local speech-to-text
Kokoro Setup - Local text-to-speech
Development Setup - Contributing guide

Full documentation: voice-mode.readthedocs.io

License

MIT - A Failmode Project

mcp-name: com.failmode/voicemode

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

mbailey

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

8.6.1

Apr 21, 2026

8.6.0

Apr 16, 2026

8.5.1

Mar 13, 2026

8.5.0

Mar 7, 2026

8.4.0

Mar 5, 2026

8.3.0

Feb 24, 2026

8.2.1

Feb 19, 2026

This version

8.2.0

Feb 13, 2026

8.1.0

Feb 2, 2026

8.0.8

Jan 28, 2026

8.0.7

Jan 28, 2026

8.0.6

Jan 28, 2026

8.0.5

Jan 28, 2026

8.0.4

Jan 28, 2026

8.0.3

Jan 28, 2026

8.0.2

Jan 24, 2026

8.0.1

Jan 24, 2026

8.0.0

Jan 24, 2026

7.4.2

Jan 16, 2026

7.4.1

Jan 16, 2026

7.4.0

Jan 6, 2026

7.3.0

Jan 6, 2026

7.2.0

Jan 5, 2026

7.1.2

Dec 26, 2025

7.1.1

Dec 25, 2025

7.1.0

Dec 24, 2025

7.0.1

Dec 2, 2025

7.0.0

Nov 26, 2025

6.2.0

Nov 24, 2025

6.1.1

Nov 11, 2025

6.1.0

Nov 10, 2025

6.0.5

Oct 26, 2025

6.0.4

Oct 26, 2025

6.0.3

Oct 26, 2025

6.0.2

Oct 26, 2025

6.0.1

Oct 19, 2025

6.0.0

Oct 15, 2025

5.1.9

Oct 13, 2025

5.1.8

Oct 12, 2025

5.1.7

Oct 12, 2025

5.1.6

Oct 12, 2025

5.1.5

Oct 12, 2025

5.1.4

Oct 12, 2025

5.1.3

Oct 12, 2025

5.1.2

Oct 12, 2025

5.1.1

Oct 12, 2025

5.1.0

Oct 11, 2025

5.0.3

Oct 4, 2025

5.0.2

Oct 4, 2025

5.0.1

Oct 3, 2025

5.0.0

Oct 3, 2025

4.8.0

Oct 3, 2025

4.7.1

Sep 22, 2025

4.7.0

Sep 22, 2025

4.6.0

Sep 21, 2025

4.5.0

Sep 17, 2025

4.4.0

Sep 10, 2025

4.3.2

Sep 2, 2025

4.3.1

Sep 2, 2025

4.3.0

Sep 22, 2025

4.2.0

Sep 2, 2025

4.1.0

Aug 31, 2025

4.0.1

Aug 31, 2025

3.34.3

Aug 26, 2025

2.34.2

Aug 26, 2025

2.34.1

Aug 26, 2025

2.34.0

Aug 26, 2025

2.33.4

Aug 25, 2025

2.33.3

Aug 25, 2025

2.33.2

Aug 25, 2025

2.33.0

Aug 25, 2025

2.32.0

Aug 24, 2025

2.31.0

Aug 24, 2025

2.30.0

Aug 24, 2025

2.29.0

Aug 24, 2025

2.28.3

Aug 24, 2025

2.28.2

Aug 24, 2025

2.28.1

Aug 24, 2025

2.28.0

Aug 23, 2025

2.27.0

Aug 20, 2025

2.26.0

Aug 18, 2025

2.25.1

Aug 17, 2025

2.25.0

Aug 17, 2025

2.24.0

Aug 16, 2025

2.23.0

Aug 16, 2025

2.22.3

Aug 16, 2025

2.22.2

Aug 16, 2025

2.22.1

Aug 16, 2025

2.22.0

Aug 16, 2025

2.21.1

Aug 12, 2025

2.21.0

Aug 12, 2025

2.20.1

Aug 11, 2025

2.20.0

Aug 10, 2025

2.19.0

Aug 9, 2025

2.18.0

Aug 9, 2025

2.17.3

Aug 6, 2025

2.17.2

Jul 28, 2025

2.17.1

Jul 28, 2025

2.17.0

Jul 28, 2025

2.16.0 yanked

Jul 27, 2025

Reason this release was yanked:

bug

2.15.0

Jul 22, 2025

2.14.0

Jul 20, 2025

2.13.0

Jul 14, 2025

2.12.0

Jul 6, 2025

2.11.0

Jul 5, 2025

2.10.0

Jul 5, 2025

2.9.0

Jul 3, 2025

2.8.0

Jul 3, 2025

2.7.1

Jul 2, 2025

2.7.0

Jul 2, 2025

2.6.0

Jun 29, 2025

2.5.1

Jun 27, 2025

2.4.1

Jun 25, 2025

2.4.0

Jun 24, 2025

2.3.0

Jun 23, 2025

2.2.0

Jun 22, 2025

2.1.3

Jun 20, 2025

2.1.1

Jun 20, 2025

2.1.0

Jun 20, 2025

2.0.3

Jun 19, 2025

0.1.26

Jun 17, 2025

0.1.25

Jun 17, 2025

0.1.24

Jun 17, 2025

0.1.22

Jun 17, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

voice_mode-8.2.0.tar.gz (1.1 MB view details)

Uploaded Feb 13, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

voice_mode-8.2.0-py3-none-any.whl (1.1 MB view details)

Uploaded Feb 13, 2026 Python 3

File details

Details for the file voice_mode-8.2.0.tar.gz.

File metadata

Download URL: voice_mode-8.2.0.tar.gz
Upload date: Feb 13, 2026
Size: 1.1 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for voice_mode-8.2.0.tar.gz
Algorithm	Hash digest
SHA256	`34fd9cfdf1755fef49967952bdad5b459f468d8a5b8233901515b58734224854`
MD5	`54b3dcc109ed7abcb3c8921b381e8603`
BLAKE2b-256	`28de4b87ab70a32d2afb9032deeb75d30591e44512c1940bac48b47d46195986`

See more details on using hashes here.

Provenance

The following attestation bundles were made for voice_mode-8.2.0.tar.gz:

Publisher: publish-pypi-and-mcp.yml on mbailey/voicemode

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: voice_mode-8.2.0.tar.gz
- Subject digest: 34fd9cfdf1755fef49967952bdad5b459f468d8a5b8233901515b58734224854
- Sigstore transparency entry: 951589672
- Sigstore integration time: Feb 13, 2026
Source repository:
- Permalink: mbailey/voicemode@d01140749649f82dff36a5f133ac3535e2b90f80
- Branch / Tag: refs/tags/v8.2.0
- Owner: https://github.com/mbailey
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi-and-mcp.yml@d01140749649f82dff36a5f133ac3535e2b90f80
- Trigger Event: push

File details

Details for the file voice_mode-8.2.0-py3-none-any.whl.

File metadata

Download URL: voice_mode-8.2.0-py3-none-any.whl
Upload date: Feb 13, 2026
Size: 1.1 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for voice_mode-8.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3f4dbda7639bfbf32d05aac20a58ba976b9d1a3351931cf84e37ea44b6b66a5c`
MD5	`0c872b81e38c8e5713a09d3e90fc67d9`
BLAKE2b-256	`de4e81861bb688c6a055abac7fc806ec674e364d584ba753091a60f2f61cbab7`

See more details on using hashes here.

Provenance

The following attestation bundles were made for voice_mode-8.2.0-py3-none-any.whl:

Publisher: publish-pypi-and-mcp.yml on mbailey/voicemode

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: voice_mode-8.2.0-py3-none-any.whl
- Subject digest: 3f4dbda7639bfbf32d05aac20a58ba976b9d1a3351931cf84e37ea44b6b66a5c
- Sigstore transparency entry: 951589918
- Sigstore integration time: Feb 13, 2026
Source repository:
- Permalink: mbailey/voicemode@d01140749649f82dff36a5f133ac3535e2b90f80
- Branch / Tag: refs/tags/v8.2.0
- Owner: https://github.com/mbailey
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi-and-mcp.yml@d01140749649f82dff36a5f133ac3535e2b90f80
- Trigger Event: push

voice-mode 8.2.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

VoiceMode

See It In Action

Quick Start

Option 1: Claude Code Plugin (Recommended)

Option 2: Python installer package

Features

Compatibility

Configuration

Remote Agent (Operator)

Quick Start

The Operator Concept

Agent Commands

Agent Directory Structure

Configuration (voicemode.env)

Permissions Setup (Optional)

Local Voice Services

Installation Details

Ubuntu/Debian

Fedora/RHEL

macOS

NixOS

From source

NixOS system-wide

Troubleshooting

Save Audio for Debugging

Documentation

Links

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance