Skip to main content

Pocket TTS integration for Vision Agents - lightweight CPU-based text-to-speech

Project description

Pocket TTS Plugin

A lightweight Text-to-Speech (TTS) plugin for Vision Agents powered by Kyutai's Pocket TTS model. Runs efficiently on CPU with low latency (~200ms) and supports voice cloning.

Features

  • Runs on CPU - no GPU required
  • Small model size (100M parameters)
  • Low latency (~200ms to first audio)
  • Voice cloning support
  • Built-in voice selection

Installation

uv add "vision-agents[pocket]"
# or directly
uv add vision-agents-plugins-pocket

Usage

from vision_agents.plugins import pocket

# Create TTS with default voice
tts = pocket.TTS()

# Or specify a built-in voice
tts = pocket.TTS(voice="marius")

# Or use a custom voice for cloning
tts = pocket.TTS(voice="path/to/your/voice.wav")

Configuration

Parameter Description Values
voice Built-in voice name or path to custom wav file "alba" (default), "marius", "javert", "jean", "fantine", "cosette", "eponine", "azelma", or custom path

Built-in Voices

  • alba - Default voice
  • marius
  • javert
  • jean
  • fantine
  • cosette
  • eponine
  • azelma

Dependencies

  • pocket-tts>=0.1.0
  • PyTorch 2.5+

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_pocket-0.4.5.tar.gz (3.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vision_agents_plugins_pocket-0.4.5-py3-none-any.whl (8.7 kB view details)

Uploaded Python 3

File details

Details for the file vision_agents_plugins_pocket-0.4.5.tar.gz.

File metadata

  • Download URL: vision_agents_plugins_pocket-0.4.5.tar.gz
  • Upload date:
  • Size: 3.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.6 {"installer":{"name":"uv","version":"0.10.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vision_agents_plugins_pocket-0.4.5.tar.gz
Algorithm Hash digest
SHA256 476b481be377d8b391e67ca74c080f8b61a22ccd97941707cc904223df6dae8b
MD5 3bc6916056e32a1c82b89a2dcc685985
BLAKE2b-256 84bfbed1eb02dd44f69a49fc4ca68da1a56e8a8ad76991596a90433b340a6cc7

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_pocket-0.4.5-py3-none-any.whl.

File metadata

  • Download URL: vision_agents_plugins_pocket-0.4.5-py3-none-any.whl
  • Upload date:
  • Size: 8.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.6 {"installer":{"name":"uv","version":"0.10.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vision_agents_plugins_pocket-0.4.5-py3-none-any.whl
Algorithm Hash digest
SHA256 e6410e6b85c196e97ad07e43f16ee7d6bf83a9afc4d0a7943a8db1e6a3a1320b
MD5 2ac081f12de484c1310dbbdbf182a153
BLAKE2b-256 457e13685285f9e5fef42c1fc006a1cd52ee9090c929135ecd2f54de43401f9c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page