Skip to main content

Mistral Voxtral STT integration for Vision Agents

Project description

Mistral Voxtral STT Plugin

Mistral Voxtral realtime speech-to-text integration for Vision Agents.

Features

  • Real-time speech recognition via WebSocket streaming
  • Low-latency transcription using Voxtral models
  • Automatic language detection
  • Partial transcript streaming for responsive UX
  • Sentence-level final transcripts (triggered by ., ?, !)

Installation

uv add "vision-agents[mistral]"
# or directly
uv add vision-agents-plugins-mistral

Usage

from vision_agents.core import Agent, Runner, User
from vision_agents.core.agents import AgentLauncher
from vision_agents.plugins import deepgram, gemini, getstream, mistral


async def create_agent(**kwargs) -> Agent:
    return Agent(
        edge=getstream.Edge(),
        agent_user=User(name="Assistant", id="agent"),
        instructions="You're a helpful voice AI assistant. Keep replies short and conversational.",
        stt=mistral.STT(),
        tts=deepgram.TTS(),
        llm=gemini.LLM(),
    )


async def join_call(agent: Agent, call_type: str, call_id: str, **kwargs) -> None:
    call = await agent.create_call(call_type, call_id)

    async with agent.join(call):
        await agent.run()


if __name__ == "__main__":
    Runner(AgentLauncher(create_agent=create_agent, join_call=join_call)).cli()

Run with:

uv run plugins/mistral/example/mistral_stt_example.py run

Turn Detection

Mistral Voxtral STT does not include built-in turn detection (turn_detection=False). You'll need to pair it with an external turn detection plugin.

Configuration

Parameter Description Default
api_key Mistral API key MISTRAL_API_KEY env var
model Model identifier voxtral-mini-transcribe-realtime-2602
sample_rate Audio sample rate (Hz): 8000, 16000, 22050, 44100, 48000 16000
client Pre-configured Mistral client None

Events

The plugin emits standard STT events:

  • STTTranscriptEvent: Final transcript (emitted at sentence boundaries or stream end)
  • STTPartialTranscriptEvent: Partial word/delta as transcription streams

Dependencies

  • mistralai[realtime]>=1.12.0
  • vision-agents

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_mistral-0.5.0.tar.gz (4.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vision_agents_plugins_mistral-0.5.0-py3-none-any.whl (11.9 kB view details)

Uploaded Python 3

File details

Details for the file vision_agents_plugins_mistral-0.5.0.tar.gz.

File metadata

  • Download URL: vision_agents_plugins_mistral-0.5.0.tar.gz
  • Upload date:
  • Size: 4.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vision_agents_plugins_mistral-0.5.0.tar.gz
Algorithm Hash digest
SHA256 678e5c005becbde6dfa8f4ec3cc8befca065abc36ada2e83d16e893bfe76147f
MD5 9b6f229e31dfa60b30b570f1165bd7a0
BLAKE2b-256 8c369a9fb9099bf862d1560c917ff9045544d0679699bf1ca990831ac39eefba

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_mistral-0.5.0-py3-none-any.whl.

File metadata

  • Download URL: vision_agents_plugins_mistral-0.5.0-py3-none-any.whl
  • Upload date:
  • Size: 11.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vision_agents_plugins_mistral-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7f09cb88f41d9367a98b3dd684e1d4f2c3b4a214fd2d8ebf44d555f43a179a0e
MD5 8bcfb2ee59ad5f3ea4452cf0a0cf73a4
BLAKE2b-256 70b118c763c08a9530ce0c19748a184e537af1780d40d9d1671c21cae8f62d2c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page