TTS caching integration for Pipecat to reduce API costs on repeated phrases

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

omChauhanDev

These details have not been verified by PyPI

Project description

Tests

Pipecat TTS Cache: Zero-Latency Audio Synthesis

Pipecat TTS Cache is a lightweight caching layer for the Pipecat ecosystem. It transparently wraps existing TTS services to eliminate API costs for repeated phrases and reduce response latency to <5ms.

See it in action: Watch the Demo Video

🚀 Key Features

Ultra-Low Latency – Delivers cached audio in ~0.1ms (Memory) or ~1-5ms (Redis).
Cost Reduction – Stop paying your TTS provider for common phrases like "Hello," "One moment," or "I didn't catch that."
Universal Compatibility – Works as a Mixin with all Pipecat TTS services (Cartesia, ElevenLabs, Deepgram, Google, etc.).
Smart Interruption – Automatically clears pending cache tasks and resets state when users interrupt the bot.
Precision Alignment – Preserves word-level timestamps for perfect lip-syncing and subtitles, even on cached replays.

📦 Installation

# Standard installation (Memory backend only)
pip install pipecat-tts-cache

# Production installation (with Redis support)
pip install "pipecat-tts-cache[redis]"

🧩 Service Compatibility

The caching layer intelligently handles different TTS architectures to ensure smooth playback regardless of the provider.

Service Type	Caching Strategy	Supported Providers (Examples)
AudioContextWordTTS	Batch Caching Splits audio at word boundaries and caches individual sentences.	Cartesia, Rime
WordTTSService	Full Caching w/ Timestamps Caches the full response and preserves alignment data.	ElevenLabs, Hume
TTSService	Standard Caching Caches the full audio response (no alignment data).	Google, OpenAI, Deepgram (HTTP)
InterruptibleTTS	Sentence Caching Caches single-sentence responses only.	Sarvam, Deepgram (WebSocket)

🛠️ Usage

1. Basic In-Memory Cache (Development)

The MemoryCacheBackend is perfect for local development or single-process bots. It uses an LRU (Least Recently Used) eviction policy.

from pipecat_tts_cache import TTSCacheMixin, MemoryCacheBackend
from pipecat.services.google.tts import GoogleHttpTTSService

# 1. Create a cached class using the Mixin
class CachedGoogleTTS(TTSCacheMixin, GoogleHttpTTSService):
    pass

# 2. Initialize with memory backend
tts = CachedGoogleTTS(
    voice_id="en-US-Chirp3-HD-Charon",
    cache_backend=MemoryCacheBackend(max_size=1000),
    cache_ttl=86400,  # Cache for 24 hours
)

2. Distributed Redis Cache (Production)

For production deployments, use RedisCacheBackend. This allows the cache to persist across restarts and be shared among multiple bot instances.

from pipecat_tts_cache.backends import RedisCacheBackend

tts = CachedGoogleTTS(
    voice_id="en-US-Chirp3-HD-Charon",
    cache_backend=RedisCacheBackend(
        redis_url="redis://localhost:6379/0",
        key_prefix="pipecat:tts:",
    ),
    cache_ttl=604800, # Cache for 1 week
)

🧠 How It Works

The system utilizes a Frame Interception Architecture to seamlessly integrate with the Pipecat pipeline:

Deterministic Key Gen: Before requesting audio, a unique key is generated based on the normalized text, voice ID, model, speed, and pitch. Sensitive data (API keys) is excluded.
Cache Check (run_tts):

Hit: The system immediately pushes cached audio frames and timestamps to the pipeline.
Miss: The system calls the parent TTS service.

Collection (push_frame): As the parent service generates audio, the Mixin intercepts the frames, aggregates them, and stores them in the backend for future use.

Interruption Handling

When an InterruptionFrame is received, the cache mixin immediately:

Clears all pending cache write tasks.
Resets the internal batch state.
Ensures no partial or cut-off audio is committed to the pipeline.

📊 Management & Stats

You can monitor cache performance or clear entries programmatically.

# Check performance
stats = await tts.get_cache_stats()
print(f"Hit Rate: {stats['hit_rate']:.1%}")
print(f"Total Saved Calls: {stats['hits']}")

# Maintenance
await tts.clear_cache() # Clear all
await tts.clear_cache(namespace="user_123") # Clear specific namespace

⚡ Performance

Metric	Direct API	Memory Cache	Redis Cache
Latency	200ms - 1500ms	~0.1ms	~2ms
Cost	$ per character	$0	$0
Consistency	Variable	Deterministic	Deterministic

Running the Example

Prerequisites

# Install with example dependencies
pip install "pipecat-tts-cache[examples]"

# Optional: Install with Redis support
pip install "pipecat-tts-cache[examples,redis]"

# Set environment variables
export DEEPGRAM_API_KEY=your_key
export CARTESIA_API_KEY=your_key
export GOOGLE_API_KEY=your_key

# Optional: For Redis backend
export USE_REDIS_CACHE=true
export REDIS_URL=redis://localhost:6379/0

Option 1: Daily Bots (Recommended)

# Start the bot server
python examples/basic_caching.py --host 0.0.0.0 --port 7860

# Connect via Daily Bots or your Daily room

Option 2: Local WebRTC

# Run with local WebRTC transport
python examples/basic_caching.py -t webrtc --host localhost --port 8765

Compatibility

Pipecat Version	Status
v0.0.91+	✅ Tested

🛟 Getting help

➡️ Reach out via mail

➡️ Connect on LinkedIn

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

omChauhanDev

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.0.3

Jan 18, 2026

0.0.2

Jan 17, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pipecat_tts_cache-0.0.3.tar.gz (288.5 kB view details)

Uploaded Jan 18, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pipecat_tts_cache-0.0.3-py3-none-any.whl (16.9 kB view details)

Uploaded Jan 18, 2026 Python 3

File details

Details for the file pipecat_tts_cache-0.0.3.tar.gz.

File metadata

Download URL: pipecat_tts_cache-0.0.3.tar.gz
Upload date: Jan 18, 2026
Size: 288.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.9.26 {"installer":{"name":"uv","version":"0.9.26","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for pipecat_tts_cache-0.0.3.tar.gz
Algorithm	Hash digest
SHA256	`27ba58d54f040118ff95db24f12b18809f6a669fbf13ed3abdf0e044a9107df8`
MD5	`31cdad4561d9391359103a5ad7f1042d`
BLAKE2b-256	`71fa747faefe88a60c5b004090705d4426877772ad6cbaa13ed06958d13bf299`

See more details on using hashes here.

File details

Details for the file pipecat_tts_cache-0.0.3-py3-none-any.whl.

File metadata

Download URL: pipecat_tts_cache-0.0.3-py3-none-any.whl
Upload date: Jan 18, 2026
Size: 16.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.9.26 {"installer":{"name":"uv","version":"0.9.26","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for pipecat_tts_cache-0.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d891ec5f98c78a27950c85d09a9da540ce677c0be187b47d93d94802acf8a20d`
MD5	`42751f38dd6ac8221a57c021a8e90d6a`
BLAKE2b-256	`6ae8c87f755a5d2bf4d68c12936f95b8e80181a5f7dd52dd6a01e855fb0620ad`

See more details on using hashes here.

pipecat-tts-cache 0.0.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

Pipecat TTS Cache: Zero-Latency Audio Synthesis

🚀 Key Features

📦 Installation

🧩 Service Compatibility

🛠️ Usage

1. Basic In-Memory Cache (Development)

2. Distributed Redis Cache (Production)

🧠 How It Works

Interruption Handling

📊 Management & Stats

⚡ Performance

Running the Example

Prerequisites

Option 1: Daily Bots (Recommended)

Option 2: Local WebRTC

Compatibility

🛟 Getting help

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes