Cartesia TTS integration for Vision Agents

These details have not been verified by PyPI

Project links

Project description

Cartesia

Cartesia is a service that provides Speech-to-Text (STT) and Text-to-Speech (TTS) capabilities. It's designed for real-time voice applications, making it ideal for voice AI agents, transcription pipelines, and conversational interfaces.

The Cartesia plugin for the Stream Python AI SDK allows you to add TTS functionality to your project.

Installation

Install the Stream Cartesia plugin with

uv add "vision-agents[cartesia]"
# or directly
uv add vision-agents-plugins-cartesia

Examples

Read on for some key details and check out our Cartesia examples to see working code samples:

in tts.py we see a simple bot greeting users upon joining a call
in narrator-example.py we see a well-prompted combination of a STT -> LLM -> TTS flow that leverages the powers of Cartesia's Sonic 3 model to narrate a creative story from the user's input

Initialisation

The Cartesia plugin for Stream exists in the form of the TTS class:

from vision_agents.plugins import cartesia

tts = cartesia.TTS()

To initialise without passing in the API key, make sure the `CARTESIA_API_KEY` is available as an environment variable. You can do this either by defining it in a `.env` file or exporting it directly in your terminal.

Parameters

These are the parameters available in the CartesiaTTS plugin for you to customise:

Name	Type	Default	Description
`api_key`	`str` or `None`	`None`	Your Cartesia API key. If not provided, the plugin will look for the `CARTESIA_API_KEY` environment variable.
`model_id`	`str`	`"sonic-3"`	ID of the Cartesia STT or TTS model to use. Defaults to the recently released Sonic-3
`voice_id`	`str` or `None`	`"f9836c6e-a0bd-460e-9d3c-f7299fa60f94"`	ID of the voice to use for TTS responses.
`sample_rate`	`int`	`16000`	Sample rate (in Hz) used for audio processing.

Functionality

Send text to convert to speech

The send() method sends the text passed in for the service to synthesize. The resulting audio is then played through the configured output track.

tts.send("Demo text you want AI voice to say")

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.5.4

Apr 15, 2026

0.5.3

Apr 14, 2026

0.5.2

Apr 13, 2026

This version

0.5.1

Apr 7, 2026

0.5.0

Apr 1, 2026

0.4.7

Mar 27, 2026

0.4.6

Mar 26, 2026

0.4.5

Mar 25, 2026

0.4.4

Mar 23, 2026

0.4.3

Mar 11, 2026

0.4.2

Mar 10, 2026

0.4.1

Mar 4, 2026

0.4.0

Mar 3, 2026

0.3.8

Feb 24, 2026

0.3.7

Feb 23, 2026

0.3.6

Feb 13, 2026

0.3.5

Feb 10, 2026

0.3.4

Feb 6, 2026

0.3.3

Feb 4, 2026

0.3.2

Jan 27, 2026

0.3.1

Jan 21, 2026

0.3.0

Jan 20, 2026

0.2.10

Jan 14, 2026

0.2.9

Jan 9, 2026

0.2.8

Jan 8, 2026

0.2.7

Jan 6, 2026

0.2.6

Dec 16, 2025

0.2.5

Dec 12, 2025

0.2.4

Dec 12, 2025

0.2.3

Dec 7, 2025

0.2.2

Nov 29, 2025

0.2.1

Nov 21, 2025

0.2.0

Nov 14, 2025

0.1.14

Nov 11, 2025

0.1.13

Nov 3, 2025

0.1.12

Oct 31, 2025

0.1.11

Oct 28, 2025

0.1.9

Oct 22, 2025

0.1.8

Oct 22, 2025

0.1.7

Oct 21, 2025

0.1.6

Oct 16, 2025

0.1.5

Oct 9, 2025

0.1.3

Oct 9, 2025

0.1.0

Oct 9, 2025

0.0.18

Oct 8, 2025

0.0.17

Oct 8, 2025

0.0.12

Oct 8, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_cartesia-0.5.1.tar.gz (4.0 kB view details)

Uploaded Apr 7, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vision_agents_plugins_cartesia-0.5.1-py3-none-any.whl (8.6 kB view details)

Uploaded Apr 7, 2026 Python 3

File details

Details for the file vision_agents_plugins_cartesia-0.5.1.tar.gz.

File metadata

Download URL: vision_agents_plugins_cartesia-0.5.1.tar.gz
Upload date: Apr 7, 2026
Size: 4.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.6 {"installer":{"name":"uv","version":"0.10.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vision_agents_plugins_cartesia-0.5.1.tar.gz
Algorithm	Hash digest
SHA256	`b6997cbe8829be029b8ce41ea1852bd4aa83e100b667d01a57350bf22b687903`
MD5	`6f5df4a88b6a4f581683482d04ada02d`
BLAKE2b-256	`3bb622bb2d1256a2306e76e4d54d80f35c1f6a4f0204db513de4a5acaa17e925`

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_cartesia-0.5.1-py3-none-any.whl.

File metadata

Download URL: vision_agents_plugins_cartesia-0.5.1-py3-none-any.whl
Upload date: Apr 7, 2026
Size: 8.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.6 {"installer":{"name":"uv","version":"0.10.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vision_agents_plugins_cartesia-0.5.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`34ee228232adbdf024e890f998a3fdc3d50af925c33e77d4e9b814a6746b717e`
MD5	`6fd6f29d12f7663565b495c0cdf8fea2`
BLAKE2b-256	`e1b2acf433568bb43f68f19c067899f288a1a1f2db1d3ee6f65dd888bec23507`

See more details on using hashes here.

vision-agents-plugins-cartesia 0.5.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Cartesia

Installation

Examples

Initialisation

Parameters

Functionality

Send text to convert to speech

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes