Mistral Voxtral STT integration for Vision Agents
Project description
Mistral Voxtral STT Plugin
Mistral Voxtral realtime speech-to-text integration for Vision Agents.
Features
- Real-time speech recognition via WebSocket streaming
- Low-latency transcription using Voxtral models
- Automatic language detection
- Partial transcript streaming for responsive UX
- Sentence-level final transcripts (triggered by
.,?,!)
Installation
uv add vision-agents[mistral]
Usage
from vision_agents.core import Agent, Runner, User
from vision_agents.core.agents import AgentLauncher
from vision_agents.plugins import deepgram, gemini, getstream, mistral
async def create_agent(**kwargs) -> Agent:
return Agent(
edge=getstream.Edge(),
agent_user=User(name="Assistant", id="agent"),
instructions="You're a helpful voice AI assistant. Keep replies short and conversational.",
stt=mistral.STT(),
tts=deepgram.TTS(),
llm=gemini.LLM(),
)
async def join_call(agent: Agent, call_type: str, call_id: str, **kwargs) -> None:
await agent.create_user()
call = await agent.create_call(call_type, call_id)
async with agent.join(call):
await agent.run()
if __name__ == "__main__":
Runner(AgentLauncher(create_agent=create_agent, join_call=join_call)).cli()
Run with:
uv run plugins/mistral/example/mistral_stt_example.py run
Turn Detection
Mistral Voxtral STT does not include built-in turn detection (turn_detection=False). You'll need to pair it with an external turn detection plugin.
Configuration
| Parameter | Description | Default |
|---|---|---|
api_key |
Mistral API key | MISTRAL_API_KEY env var |
model |
Model identifier | voxtral-mini-transcribe-realtime-2602 |
sample_rate |
Audio sample rate (Hz): 8000, 16000, 22050, 44100, 48000 | 16000 |
client |
Pre-configured Mistral client | None |
Events
The plugin emits standard STT events:
STTTranscriptEvent: Final transcript (emitted at sentence boundaries or stream end)STTPartialTranscriptEvent: Partial word/delta as transcription streams
Dependencies
mistralai[realtime]>=1.12.0vision-agents
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file vision_agents_plugins_mistral-0.3.7.tar.gz.
File metadata
- Download URL: vision_agents_plugins_mistral-0.3.7.tar.gz
- Upload date:
- Size: 4.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.20
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b81dab1091e62894b78fc1a498e8c0687f7b6ebc397677c1d912e3a2c8de18b4
|
|
| MD5 |
d9caa6adc4d2d469b83b37d1ed6d749e
|
|
| BLAKE2b-256 |
d0d56b67e8e527826ff44972e851766c018a554107c0e93bf1166148c3077459
|
File details
Details for the file vision_agents_plugins_mistral-0.3.7-py3-none-any.whl.
File metadata
- Download URL: vision_agents_plugins_mistral-0.3.7-py3-none-any.whl
- Upload date:
- Size: 11.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.20
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c3bd321c5e19a97ad8c8f71767a415cc46501cc50271d61444c640da42bede5e
|
|
| MD5 |
0970409d51ac7d2c4f1c90ee4fb4b54c
|
|
| BLAKE2b-256 |
aa114083bd5b71ca528ffc3df3f4db38e7cfb959844acf2943646a3e72327c1d
|