Agent Framework plugin for Baseten

These details have been verified by PyPI

Project links

Source

Owner

LiveKit

GitHub Statistics

These details have not been verified by PyPI

Project links

Project description

Baseten plugin for LiveKit Agents

Support for Baseten-hosted models in LiveKit Agents, including STT (Speech-to-Text), TTS (Text-to-Speech), and LLM (Large Language Model) integrations.

Installation

pip install livekit-plugins-baseten

Pre-requisites

You'll need an API key from Baseten. It can be set as an environment variable: BASETEN_API_KEY

You also need to deploy a model to Baseten and will need your model endpoint to configure the plugin.

STT (Speech-to-Text)

The STT plugin connects to Baseten's Whisper Streaming WebSocket endpoint for real-time transcription. It works with both truss and chain deployments.

Recommended model

Whisper v3 Turbo – WebSocket

Endpoint URL formats

Deployment type	URL pattern
Truss	`wss://model-{model_id}.api.baseten.co/environments/production/websocket`
Chain	`wss://chain-{chain_id}.api.baseten.co/environments/production/websocket`

Basic usage

You can specify the endpoint in three ways:

from livekit.plugins import baseten

# 1. Using a truss model ID (recommended for truss deployments)
stt = baseten.STT(
    api_key="your-baseten-api-key",  # or set BASETEN_API_KEY env var
    model_id="your-model-id",
    language="en",
)

# 2. Using a chain ID (recommended for chain deployments)
stt = baseten.STT(
    api_key="your-baseten-api-key",
    chain_id="your-chain-id",
    language="en",
)

# 3. Using a full endpoint URL (for custom routing or deployment URLs)
stt = baseten.STT(
    api_key="your-baseten-api-key",
    model_endpoint="wss://model-{model_id}.api.baseten.co/environments/production/websocket",
    language="en",
)

Configuration options

Parameter	Default	Description
`api_key`	`BASETEN_API_KEY` env var	Baseten API key
`model_endpoint`	`BASETEN_MODEL_ENDPOINT` env var	Full WebSocket URL (takes priority over `model_id`/`chain_id`)
`model_id`	—	Baseten truss model ID; auto-constructs the endpoint URL
`chain_id`	—	Baseten chain ID; auto-constructs the endpoint URL
`language`	`"en"`	BCP-47 language code (use `"auto"` for auto-detection)
`encoding`	`"pcm_s16le"`	Audio encoding (`pcm_s16le` or `pcm_mulaw`)
`sample_rate`	`16000`	Audio sample rate in Hz
`enable_partial_transcripts`	`True`	Emit interim transcripts while the speaker is talking
`partial_transcript_interval_s`	`1.0`	Interval (seconds) between partial transcript updates
`final_transcript_max_duration_s`	`30`	Max seconds of audio before forcing a final transcript
`show_word_timestamps`	`True`	Include word-level timestamps in results
`vad_threshold`	`0.5`	Server-side VAD speech probability threshold (0.0–1.0)
`vad_min_silence_duration_ms`	`300`	Minimum silence (ms) to mark end of speech
`vad_speech_pad_ms`	`30`	Padding (ms) added around detected speech

Full voice pipeline example

import os
from livekit import agents
from livekit.agents import AgentSession, Agent, RoomInputOptions, inference
from livekit.plugins import baseten, openai, noise_cancellation
from livekit.agents.inference import AudioTurnDetector

BASETEN_API_KEY = os.getenv("BASETEN_API_KEY")
whisper_model_id = "your-whisper-model-id"  # or use chain_id for chain deployments
orpheus_model_id = "your-orpheus-model-id"


class Assistant(Agent):
    def __init__(self) -> None:
        super().__init__(instructions="You are a helpful voice AI assistant.")


async def entrypoint(ctx: agents.JobContext):
    session = AgentSession(
        stt=baseten.STT(
            api_key=BASETEN_API_KEY,
            model_id=whisper_model_id,  # or chain_id="your-chain-id"
            language="en",
            enable_partial_transcripts=True,
        ),
        llm=openai.LLM(
            api_key=BASETEN_API_KEY,
            base_url="https://inference.baseten.co/v1",
            model="openai/gpt-oss-120b",
        ),
        tts=baseten.TTS(
            api_key=BASETEN_API_KEY,
            model_endpoint=(
                f"https://model-{orpheus_model_id}"
                ".api.baseten.co/environments/production/predict"
            ),
        ),
        vad=inference.VAD(),
        turn_detection=AudioTurnDetector(),
    )

    await session.start(
        room=ctx.room,
        agent=Assistant(),
        room_input_options=RoomInputOptions(
            noise_cancellation=noise_cancellation.BVC(),
        ),
    )

    await session.generate_reply(
        instructions="Greet the user and offer your assistance."
    )


if __name__ == "__main__":
    agents.cli.run_app(agents.WorkerOptions(entrypoint_fnc=entrypoint))

TTS (Text-to-Speech)

The TTS plugin calls Baseten-hosted TTS models (e.g. Orpheus 3B) over HTTP.

tts = baseten.TTS(
    api_key="your-baseten-api-key",
    model_endpoint="https://model-{model_id}.api.baseten.co/environments/production/predict",
    voice="tara",
    language="en",
)

LLM (Large Language Model)

The LLM plugin wraps Baseten's OpenAI-compatible inference endpoint.

llm = baseten.LLM(
    api_key="your-baseten-api-key",
    model="openai/gpt-oss-120b",
)

Documentation

Project details

These details have been verified by PyPI

Project links

Source

Owner

LiveKit

GitHub Statistics

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.6.0rc2 pre-release

May 29, 2026

1.6.0rc1 pre-release

May 27, 2026

1.5.15

May 29, 2026

1.5.14

May 27, 2026

1.5.13

May 25, 2026

1.5.12

May 21, 2026

1.5.11

May 19, 2026

1.5.10

May 18, 2026

1.5.9

May 13, 2026

1.5.8

May 5, 2026

1.5.7

Apr 30, 2026

1.5.6

Apr 22, 2026

1.5.5

Apr 20, 2026

1.5.4

Apr 16, 2026

1.5.3

Apr 15, 2026

1.5.2

Apr 8, 2026

1.5.1

Mar 23, 2026

1.5.0

Mar 19, 2026

1.5.0rc2 pre-release

Mar 6, 2026

1.5.0rc1 pre-release

Feb 13, 2026

1.4.6

Mar 16, 2026

1.4.5

Mar 11, 2026

1.4.4

Mar 3, 2026

1.4.3

Feb 23, 2026

1.4.2

Feb 17, 2026

1.4.1

Feb 6, 2026

1.4.0

Feb 6, 2026

1.4.0rc2 pre-release

Jan 23, 2026

1.4.0rc1 pre-release

Dec 23, 2025

1.3.12

Jan 21, 2026

1.3.11

Jan 14, 2026

1.3.10

Dec 23, 2025

1.3.9

Dec 19, 2025

1.3.8

Dec 17, 2025

1.3.7

Dec 16, 2025

1.3.6

Dec 3, 2025

1.3.5

Nov 25, 2025

1.3.4

Nov 24, 2025

1.3.3

Nov 19, 2025

1.3.2

Nov 17, 2025

1.3.1

Nov 17, 2025

1.3.0rc2 pre-release

Nov 15, 2025

1.3.0rc1 pre-release

Nov 6, 2025

1.2.18

Nov 5, 2025

1.2.17

Oct 29, 2025

1.2.16

Oct 27, 2025

1.2.15

Oct 15, 2025

1.2.14

Oct 1, 2025

1.2.13

Oct 1, 2025

1.2.12

Sep 29, 2025

1.2.11

Sep 18, 2025

1.2.9

Sep 15, 2025

1.2.8

Sep 2, 2025

1.2.7

Aug 28, 2025

1.2.6

Aug 18, 2025

1.2.5

Aug 10, 2025

1.2.4

Aug 7, 2025

1.2.3

Aug 4, 2025

1.2.2

Jul 24, 2025

1.2.1

Jul 17, 2025

1.2.0

Jul 17, 2025

1.1.7

Jul 15, 2025

1.1.6

Jul 10, 2025

1.1.5

Jun 30, 2025

1.1.4

Jun 25, 2025

1.1.3

Jun 21, 2025

1.1.2

Jun 20, 2025

1.1.1

Jun 10, 2025

1.1.0

Jun 10, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

livekit_plugins_baseten-1.6.0rc2.tar.gz (12.4 kB view details)

Uploaded May 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

livekit_plugins_baseten-1.6.0rc2-py3-none-any.whl (15.2 kB view details)

Uploaded May 29, 2026 Python 3

File details

Details for the file livekit_plugins_baseten-1.6.0rc2.tar.gz.

File metadata

Download URL: livekit_plugins_baseten-1.6.0rc2.tar.gz
Upload date: May 29, 2026
Size: 12.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for livekit_plugins_baseten-1.6.0rc2.tar.gz
Algorithm	Hash digest
SHA256	`346561920c74c30e6ec5ab33d1f88e665cb3ce867ec257e6fea980388cd68803`
MD5	`14913631fe82271807d5bb5f0390c7e0`
BLAKE2b-256	`7bb6ac0f5b0de570dfe9ecb71ef0010baf6805a4388f933274d87ea1c8a99a59`

See more details on using hashes here.

Provenance

The following attestation bundles were made for livekit_plugins_baseten-1.6.0rc2.tar.gz:

Publisher: publish.yml on livekit/agents

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: livekit_plugins_baseten-1.6.0rc2.tar.gz
- Subject digest: 346561920c74c30e6ec5ab33d1f88e665cb3ce867ec257e6fea980388cd68803
- Sigstore transparency entry: 1671822875
- Sigstore integration time: May 29, 2026
Source repository:
- Permalink: livekit/agents@080216a7dba4010d5a068f5eb150c34829505624
- Branch / Tag: refs/heads/feat/AGT-2520-multimodal-EOU
- Owner: https://github.com/livekit
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@080216a7dba4010d5a068f5eb150c34829505624
- Trigger Event: pull_request

File details

Details for the file livekit_plugins_baseten-1.6.0rc2-py3-none-any.whl.

File metadata

Download URL: livekit_plugins_baseten-1.6.0rc2-py3-none-any.whl
Upload date: May 29, 2026
Size: 15.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for livekit_plugins_baseten-1.6.0rc2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`37a40e424907d0fe2e3ad5e2f932ce94899eb31fd8b0a87a11345ff4afafb946`
MD5	`1ffe2d8435f4e6c4ea0c2de2cfd7e7eb`
BLAKE2b-256	`5bf5e4f1c20cd3870c13714a05096cc535d5b666a7dcf07703a687526e878301`

See more details on using hashes here.

Provenance

The following attestation bundles were made for livekit_plugins_baseten-1.6.0rc2-py3-none-any.whl:

Publisher: publish.yml on livekit/agents

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: livekit_plugins_baseten-1.6.0rc2-py3-none-any.whl
- Subject digest: 37a40e424907d0fe2e3ad5e2f932ce94899eb31fd8b0a87a11345ff4afafb946
- Sigstore transparency entry: 1671822903
- Sigstore integration time: May 29, 2026
Source repository:
- Permalink: livekit/agents@080216a7dba4010d5a068f5eb150c34829505624
- Branch / Tag: refs/heads/feat/AGT-2520-multimodal-EOU
- Owner: https://github.com/livekit
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@080216a7dba4010d5a068f5eb150c34829505624
- Trigger Event: pull_request

livekit-plugins-baseten 1.6.0rc2

Navigation

Verified details

Project links

Owner

GitHub Statistics

Unverified details

Project links

Meta

Classifiers

Project description

Baseten plugin for LiveKit Agents

Installation

Pre-requisites

STT (Speech-to-Text)

Recommended model

Endpoint URL formats

Basic usage

Configuration options

Full voice pipeline example

TTS (Text-to-Speech)

LLM (Large Language Model)

Documentation

Project details

Verified details

Project links

Owner

GitHub Statistics

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance