LiveKit Agents Plugin for services from AWS

These details have not been verified by PyPI

Project links

Project description

AWS Plugin for LiveKit Agents

Complete AWS AI integration for LiveKit Agents, including Bedrock, Polly, Transcribe, and realtime speech-to-speech support for Amazon Nova Sonic

What's included:

RealtimeModel - Amazon Nova 2 Sonic and Nova Sonic 1.0 for speech-to-speech
LLM - Powered by Amazon Bedrock, defaults to Nova 2 Lite
STT - Powered by Amazon Transcribe
TTS - Powered by Amazon Polly

See https://docs.livekit.io/agents/integrations/aws/ for more information.

⚠️ Breaking Change

Default model changed to Nova 2 Sonic: RealtimeModel() now defaults to amazon.nova-2-sonic-v1:0 with modalities="mixed" (was amazon.nova-sonic-v1:0 with modalities="audio").

If you need the previous behavior, explicitly specify Nova Sonic 1.0:

model = aws.realtime.RealtimeModel.with_nova_sonic_1()
# or
model = aws.realtime.RealtimeModel(
    model="amazon.nova-sonic-v1:0",
    modalities="audio"
)

Installation

pip install livekit-plugins-aws

# For Nova Sonic realtime models
pip install livekit-plugins-aws[realtime]

Prerequisites

AWS Credentials

You'll need AWS credentials with access to Amazon Bedrock. Set them as environment variables:

export AWS_ACCESS_KEY_ID=<your-access-key>
export AWS_SECRET_ACCESS_KEY=<your-secret-key>
export AWS_DEFAULT_REGION=us-east-1  # or your preferred region

Getting Temporary Credentials from SSO (Local Testing)

If you use AWS SSO for authentication, get temporary credentials for local testing:

# Login to your SSO profile
aws sso login --profile your-profile-name

# Export credentials from your SSO session
eval $(aws configure export-credentials --profile your-profile-name --format env)

# Verify credentials are set
aws sts get-caller-identity

Alternatively, add this to your shell profile for automatic credential export:

# Add to ~/.bashrc or ~/.zshrc
function aws-creds() {
    eval $(aws configure export-credentials --profile $1 --format env)
}

# Usage: aws-creds your-profile-name

Quick Start Example

The realtime_joke_teller.py example demonstrates both realtime and pipeline modes:

Demonstrates Both Modes

Realtime mode: Nova 2 Sonic for end-to-end speech-to-speech
Pipeline mode: Amazon Transcribe + Nova 2 Lite + Amazon Polly

Demonstrates Nova 2 Sonic Capabilities

Text prompting: Agent greets users first using generate_reply()
Multilingual support: Automatic language detection and response in 7 languages
Multiple voices: 18 expressive voices across languages
Function calling: Weather lookup, web search, and joke telling

Setup

Install dependencies:

pip install livekit-plugins-aws[realtime] \
            livekit-plugins-silero \
            jokeapi \
            duckduckgo-search \
            python-weather \
            python-dotenv

Copy the example locally:

curl -O https://raw.githubusercontent.com/livekit/agents/main/examples/voice_agents/realtime_joke_teller.py

Set up environment variables:

# Create .env file
echo "AWS_DEFAULT_REGION=us-east-1" > .env
# Add your AWS credentials (see Prerequisites above)

(Optional) Run local LiveKit server:

For testing without LiveKit Cloud, run a local server:

# Install LiveKit server
brew install livekit  # macOS
# or download from https://github.com/livekit/livekit/releases

# Run in dev mode
livekit-server --dev

Add to your .env file:

LIVEKIT_URL=wss://127.0.0.1:7880
LIVEKIT_API_KEY=devkey
LIVEKIT_API_SECRET=secret

See self-hosting documentation for more details.

Running the Example

Realtime Mode (Nova 2 Sonic) - Recommended for testing:

python realtime_joke_teller.py console

This runs locally using your computer's speakers and microphone. Use a headset to prevent echo.

Multilingual Support: Nova 2 Sonic automatically detects and responds in your language. Just start speaking in your preferred language (English, French, Italian, German, Spanish, Portuguese, or Hindi) and Nova 2 Sonic will respond in the same language!

Pipeline Mode (Transcribe + Nova Lite + Polly):

python realtime_joke_teller.py console --mode pipeline

Dev Mode (connect to LiveKit room for remote testing):

python realtime_joke_teller.py dev
# or
python realtime_joke_teller.py dev --mode pipeline

Try asking:

"What's the weather in Seattle?"
"Tell me a programming joke"
"Search for information about my favorite movie, Short Circuit"

Features

Nova 2 Sonic Capabilities

Amazon Nova 2 Sonic is a unified speech-to-speech foundation model that delivers:

Realtime bidirectional streaming - Low-latency, natural conversations
Multilingual support - English, French, Italian, German, Spanish, Portuguese, and Hindi
Automatic language mirroring - Responds in the user's spoken language
Polyglot voices - Matthew and Tiffany can seamlessly switch between languages within a single conversation, ideal for multilingual applications
18 expressive voices - Multiple voices per language with natural prosody
Function calling - Built-in tool use and agentic workflows
Interruption handling - Graceful handling without losing context
Noise robustness - Works in real-world environments
Text input support - Programmatic text prompting

Model Selection

from livekit.plugins import aws

# Nova 2 Sonic (audio + text input, latest)
model = aws.realtime.RealtimeModel.with_nova_sonic_2()

# Nova Sonic 1.0 (audio-only, original model)
model = aws.realtime.RealtimeModel.with_nova_sonic_1()

Voice Selection

Voices are specified as lowercase strings. Import SONIC1_VOICES or SONIC2_VOICES type hints for IDE autocomplete.

from livekit.plugins.aws.experimental.realtime import SONIC2_VOICES

model = aws.realtime.RealtimeModel.with_nova_sonic_2(
    voice="carolina"  # Portuguese, feminine
)

Nova 2 Sonic Voice IDs (18 voices)

See official documentation for most up-to-date list and IDs.

English (US): tiffany (polyglot), matthew (polyglot)
English (UK): amy
English (Australia): olivia
English (India): kiara, arjun
French: ambre, florian
Italian: beatrice, lorenzo
German: tina, lennart
Spanish (US): lupe, carlos
Portuguese (Brazil): carolina, leo
Hindi: kiara, arjun

Note: Tiffany abd Matthew in Nova 2 Sonic support polyglot mode, seamlessly switching between languages within a single conversation.

Nova Sonic 1.0 Voice IDs (11 voices)

See official documentation for most up-to-date list and IDs.

English (US): tiffany, matthew
English (UK): amy
French: ambre, florian
Italian: beatrice, lorenzo
German: greta, lennart
Spanish: lupe, carlos

Text Prompting with `generate_reply()`

Nova 2 Sonic supports programmatic text input. This can be used to trigger agent responses or to mix speech and text input within a UI in the same conversation:

class Assistant(Agent):
    async def on_enter(self):
        # Make the agent speak first with a greeting
        await self.session.generate_reply(
            instructions="Greet the user and introduce your capabilities"
        )

`instructions` vs `user_input`

The generate_reply() method accepts two parameters with different behaviors:

instructions - System-level commands (recommended):

await session.generate_reply(
    instructions="Greet the user warmly and ask how you can help"
)

Sent as a system prompt/command to the model
Triggers immediate generation
Does not appear in conversation history as user message
Use for: Agent-initiated speech, prompting specific behaviors

user_input - Simulated user messages:

await session.generate_reply(
    user_input="Hello, I need help with my account"
)

Sent as interactive USER role content
Added to Nova's conversation context
Triggers generation as if user spoke
Use for: Testing, simulating user input, programmatic conversations

When to use each:

Agent greetings: Use instructions - agent should speak without user input
Guided responses: Use instructions - direct the agent's next action
Simulated conversations: Use user_input - test multi-turn dialogs
Programmatic user input: Use user_input - inject text as if user spoke

Turn-Taking Sensitivity

Control how quickly the agent responds to pauses:

model = aws.realtime.RealtimeModel.with_nova_sonic_2(
    turn_detection="MEDIUM"  # HIGH, MEDIUM (default), LOW
)

HIGH: Fastest response time, optimized for latency. May interrupt slower speakers
MEDIUM: Balanced approach with moderate response time. Reduces false positives while maintaining responsiveness (recommended)
LOW: Slowest response time with maximum patience, better for hesitant speakers

Complete Example

from livekit import agents
from livekit.agents import Agent, AgentSession
from livekit.plugins import aws
from dotenv import load_dotenv


load_dotenv()

class Assistant(Agent):
    def __init__(self):
        super().__init__(
            instructions="You are a helpful voice assistant powered by Amazon Nova 2 Sonic."
        )
    
    async def on_enter(self):
        await self.session.generate_reply(
            instructions="Greet the user and offer assistance"
        )

server = agents.AgentServer()

@server.rtc_session()
async def entrypoint(ctx: agents.JobContext):
    await ctx.connect()
    
    session = AgentSession(
        llm=aws.realtime.RealtimeModel.with_nova_sonic_2(
            voice="matthew",
            turn_detection="MEDIUM",
            tool_choice="auto"
        )
    )
    
    await session.start(room=ctx.room, agent=Assistant())

if __name__ == "__main__":
    agents.cli.run_app(server)

Pipeline Mode (STT + LLM + TTS)

For more control over individual components, use pipeline mode:

from livekit.plugins import aws, silero

session = AgentSession(
    stt=aws.STT(),                    # Amazon Transcribe
    llm=aws.LLM(),                    # Nova 2 Lite (default)
    tts=aws.TTS(),                    # Amazon Polly
    vad=silero.VAD.load(),
)

Nova 2 Lite

Amazon Nova 2 Lite is a fast, cost-effective reasoning model optimized for everyday AI workloads:

Lightning-fast processing - Very low latency for real-time conversations
Cost-effective - Industry-leading price-performance
Multimodal inputs - Text, image, and video (documentation)
1 million token context window - Handle long conversations and complex context (source)
Agentic workflows - RAG systems, function calling, tool use
Fine-tuning support - Customize for your specific use case

Ideal for pipeline mode where you need fast, accurate LLM responses in voice applications.

Resources

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.5.5

Apr 20, 2026

1.5.4

Apr 16, 2026

1.5.3

Apr 15, 2026

1.5.2

Apr 8, 2026

1.5.1

Mar 23, 2026

1.5.0

Mar 19, 2026

1.5.0rc2 pre-release

Mar 6, 2026

1.5.0rc1 pre-release

Feb 13, 2026

1.4.6

Mar 16, 2026

1.4.5

Mar 11, 2026

1.4.4

Mar 3, 2026

1.4.3

Feb 23, 2026

This version

1.4.2

Feb 17, 2026

1.4.1

Feb 6, 2026

1.4.0

Feb 6, 2026

1.4.0rc2 pre-release

Jan 23, 2026

1.4.0rc1 pre-release

Dec 23, 2025

1.3.12

Jan 21, 2026

1.3.11

Jan 14, 2026

1.3.10

Dec 23, 2025

1.3.9

Dec 19, 2025

1.3.8

Dec 17, 2025

1.3.7

Dec 16, 2025

1.3.6

Dec 3, 2025

1.3.5

Nov 25, 2025

1.3.4

Nov 24, 2025

1.3.3

Nov 19, 2025

1.3.2

Nov 17, 2025

1.3.1

Nov 17, 2025

1.3.0rc2 pre-release

Nov 15, 2025

1.3.0rc1 pre-release

Nov 6, 2025

1.2.18

Nov 5, 2025

1.2.17

Oct 29, 2025

1.2.16

Oct 27, 2025

1.2.15

Oct 15, 2025

1.2.14

Oct 1, 2025

1.2.13

Oct 1, 2025

1.2.12

Sep 29, 2025

1.2.11

Sep 18, 2025

1.2.10

Sep 18, 2025

1.2.9

Sep 15, 2025

1.2.8

Sep 2, 2025

1.2.7

Aug 28, 2025

1.2.6

Aug 18, 2025

1.2.5

Aug 10, 2025

1.2.4

Aug 7, 2025

1.2.3

Aug 4, 2025

1.2.2

Jul 24, 2025

1.2.1

Jul 17, 2025

1.2.0

Jul 17, 2025

1.1.7

Jul 15, 2025

1.1.6

Jul 10, 2025

1.1.5

Jun 30, 2025

1.1.4

Jun 25, 2025

1.1.3

Jun 21, 2025

1.1.2

Jun 20, 2025

1.1.1

Jun 10, 2025

1.1.0

Jun 10, 2025

1.0.23

May 29, 2025

1.0.22

May 17, 2025

1.0.21

May 15, 2025

1.0.20

May 8, 2025

1.0.19

May 3, 2025

1.0.18

May 1, 2025

1.0.17

Apr 24, 2025

1.0.16

Apr 22, 2025

1.0.15

Apr 22, 2025

1.0.14

Apr 22, 2025

1.0.13

Apr 15, 2025

1.0.12

Apr 15, 2025

1.0.11

Apr 10, 2025

1.0.10

Apr 10, 2025

1.0.2

Apr 10, 2025

1.0.1

Apr 10, 2025

1.0.0

Apr 10, 2025

1.0.0rc9 pre-release

Apr 7, 2025

1.0.0rc8 pre-release

Apr 7, 2025

1.0.0rc7 pre-release

Apr 7, 2025

1.0.0rc6 pre-release

Apr 3, 2025

1.0.0rc5 pre-release

Apr 3, 2025

1.0.0rc4 pre-release

Mar 29, 2025

1.0.0rc3 pre-release

Mar 27, 2025

1.0.0rc2 pre-release

Mar 27, 2025

1.0.0rc1 pre-release

Mar 26, 2025

1.0.0.dev5 pre-release

Mar 19, 2025

1.0.0.dev4 pre-release

Mar 19, 2025

0.1.2

May 19, 2025

0.1.1

Mar 6, 2025

0.1.0

Feb 28, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

livekit_plugins_aws-1.4.2.tar.gz (42.9 kB view details)

Uploaded Feb 17, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

livekit_plugins_aws-1.4.2-py3-none-any.whl (48.6 kB view details)

Uploaded Feb 17, 2026 Python 3

File details

Details for the file livekit_plugins_aws-1.4.2.tar.gz.

File metadata

Download URL: livekit_plugins_aws-1.4.2.tar.gz
Upload date: Feb 17, 2026
Size: 42.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.12

File hashes

Hashes for livekit_plugins_aws-1.4.2.tar.gz
Algorithm	Hash digest
SHA256	`f165d84aedc22181bcef482521f70592b94ef0b080297b48ba9f7db3150dbb81`
MD5	`aa130b6f0a313f4ee6ffb3cc8f9b846f`
BLAKE2b-256	`27b20375d70132ae6ac3e960a08bcadcc24cf9616e058a9d7de6ab1e8f60f040`

See more details on using hashes here.

File details

Details for the file livekit_plugins_aws-1.4.2-py3-none-any.whl.

File metadata

Download URL: livekit_plugins_aws-1.4.2-py3-none-any.whl
Upload date: Feb 17, 2026
Size: 48.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.12

File hashes

Hashes for livekit_plugins_aws-1.4.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d7ac81645089b957dc193696ef2e40d579949b44940d7b8f5298d50cb0026538`
MD5	`91546f08d401cfa0638127f1bf4ee86c`
BLAKE2b-256	`b71402895f04f5ba26e4fa05f2467c3626983646aa85d580cd0e1842e75f3131`

See more details on using hashes here.

livekit-plugins-aws 1.4.2

Navigation

Verified details

Owner

Unverified details

Project links

Meta

Classifiers

Project description

AWS Plugin for LiveKit Agents

⚠️ Breaking Change

Installation

Prerequisites

AWS Credentials

Getting Temporary Credentials from SSO (Local Testing)

Quick Start Example

Demonstrates Both Modes

Demonstrates Nova 2 Sonic Capabilities

Setup

Running the Example

Features

Nova 2 Sonic Capabilities

Model Selection

Voice Selection

Nova 2 Sonic Voice IDs (18 voices)

Nova Sonic 1.0 Voice IDs (11 voices)

Text Prompting with generate_reply()

instructions vs user_input

Turn-Taking Sensitivity

Complete Example

Pipeline Mode (STT + LLM + TTS)

Nova 2 Lite

Resources

Project details

Verified details

Owner

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Text Prompting with `generate_reply()`

`instructions` vs `user_input`