Skip to main content

A modular voice agent with swappable STT/TTS/LLM backends

Project description

fastrtc-voice-agent

A modular voice agent built on FastRTC

Installation

pip install fastrtc-voice-agent

Please install your desired STT and LLM with (for example) :

pip install "fastrtc-voice-agent[ollama]"

or for all optional dependancies :

pip install "fastrtc-voice-agent[all]"

CLI Usage Example

For default config :

fastrtc-voice-agent --run

Please refere to the help for custom config :

fastrtc-voice-agent --help

Python Usage Example

from fastrtc import ReplyOnPause, Stream
from voice_agent import create_agent, AgentConfig, STTConfig, TTSConfig, LLMConfig

config = AgentConfig(
    system_prompt="You are a helpful voice assistant.",
    stt=STTConfig(backend="faster_whisper", model_size="small"),
    tts=TTSConfig(backend="edge", voice="en-US-AvaMultilingualNeural"),
    llm=LLMConfig(backend="ollama", model="llama3.2:3b"),
)

agent = create_agent(config)

stream = Stream(
    ReplyOnPause(agent.create_fastrtc_handler()),
    modality="audio",
    mode="send-receive",
)

stream.ui.launch()

Note

To use Anthropic API (may be OpenAI or else later) please copy .env.example as .env file and fill it with your API KEY and the desired model

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fastrtc_voice_agent-0.1.4.tar.gz (10.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

fastrtc_voice_agent-0.1.4-py3-none-any.whl (15.0 kB view details)

Uploaded Python 3

File details

Details for the file fastrtc_voice_agent-0.1.4.tar.gz.

File metadata

  • Download URL: fastrtc_voice_agent-0.1.4.tar.gz
  • Upload date:
  • Size: 10.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.11

File hashes

Hashes for fastrtc_voice_agent-0.1.4.tar.gz
Algorithm Hash digest
SHA256 808bc3a860e3244a165d14255d67ee87f8624f3bb8944c7cbf82f479fbca25f2
MD5 d533bff503d36d73628e688d8ee2dbcc
BLAKE2b-256 05ce1d1b43d39ff08a7afe68fae9059d3608e1ce6e01cba400f839b0b8604590

See more details on using hashes here.

File details

Details for the file fastrtc_voice_agent-0.1.4-py3-none-any.whl.

File metadata

File hashes

Hashes for fastrtc_voice_agent-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 d532c76d249eff3122fb97666e367af28165a1d0ed22c8cd871f9d2b93f557fd
MD5 0d7b60988e448c1034f7d53a672353da
BLAKE2b-256 4c0dba3c6ab6805dbe8c4edebeb280ef88089856d9e632b4a098126f2aec68eb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page