Qwen Omni plugin for vision agents

These details have not been verified by PyPI

Project links

Project description

Qwen Realtime Plugin for Vision Agents

Qwen3 Realtime LLM integration for Vision Agents framework with native audio output and built-in speech recognition using WebSocket-based realtime communication.

Features

Native audio output: No TTS service needed - audio comes directly from the model
Built-in STT: Integrated speech-to-text using gummy-realtime-v1 - no external STT service required
Server-side VAD: Automatic turn detection with configurable silence thresholds
Video understanding: Optional video frame support for multimodal interactions
Real-time streaming: WebSocket-based bidirectional communication for low-latency responses
Interruption handling: Automatic cancellation when user starts speaking

Installation

uv add vision-agents[qwen]

Usage

from vision_agents.core import User, Agent
from vision_agents.plugins import getstream, qwen

agent = Agent(
    edge=getstream.Edge(),
    agent_user=User(name="Qwen Assistant"),
    instructions="Be helpful and friendly",
    llm=qwen.Realtime(
        model="qwen3-omni-flash-realtime",
        voice="Cherry",
        fps=1,
    ),
    # No STT or TTS needed - Qwen Realtime provides both
)

Configuration

Parameter	Description	Default	Accepted Values
`model`	Qwen Realtime model identifier	`"qwen3-omni-flash-realtime"`	Model name string
`api_key`	DashScope API key	`None` (from env)	String or `None`
`base_url`	WebSocket API base URL	`"wss://dashscope-intl.aliyuncs.com/api-ws/v1/realtime"`	URL string
`voice`	Voice for audio output	`"Cherry"`	Voice name string
`fps`	Video frames per second	`1`	Integer
`include_video`	Include video frames in requests	`False`	Boolean
`video_width`	Video frame width	`1280`	Integer
`video_height`	Video frame height	`720`	Integer

Environment Variables

Set DASHSCOPE_API_KEY in your environment or .env file:

DASHSCOPE_API_KEY=your_dashscope_api_key_here

Example

See plugins/qwen/example/qwen_realtime_example.py for a complete working example.

Dependencies

vision-agents
websockets
aiortc
av

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.5.7

May 7, 2026

0.5.6

May 5, 2026

0.5.5

Apr 27, 2026

0.5.4

Apr 15, 2026

0.5.3

Apr 14, 2026

0.5.2

Apr 13, 2026

0.5.1

Apr 7, 2026

0.5.0

Apr 1, 2026

0.4.7

Mar 27, 2026

0.4.6

Mar 26, 2026

0.4.5

Mar 25, 2026

0.4.4

Mar 23, 2026

0.4.3

Mar 11, 2026

0.4.2

Mar 10, 2026

0.4.1

Mar 4, 2026

0.4.0

Mar 3, 2026

0.3.8

Feb 24, 2026

0.3.7

Feb 23, 2026

0.3.6

Feb 13, 2026

0.3.5

Feb 10, 2026

0.3.4

Feb 6, 2026

This version

0.3.3

Feb 4, 2026

0.3.2

Jan 27, 2026

0.3.1

Jan 21, 2026

0.3.0

Jan 20, 2026

0.2.10

Jan 14, 2026

0.2.9

Jan 9, 2026

0.2.8

Jan 8, 2026

0.2.7

Jan 7, 2026

0.2.6

Dec 16, 2025

0.2.5

Dec 12, 2025

0.2.4

Dec 12, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_qwen-0.3.3.tar.gz (8.6 kB view details)

Uploaded Feb 4, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vision_agents_plugins_qwen-0.3.3-py3-none-any.whl (15.9 kB view details)

Uploaded Feb 4, 2026 Python 3

File details

Details for the file vision_agents_plugins_qwen-0.3.3.tar.gz.

File metadata

Download URL: vision_agents_plugins_qwen-0.3.3.tar.gz
Upload date: Feb 4, 2026
Size: 8.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.20

File hashes

Hashes for vision_agents_plugins_qwen-0.3.3.tar.gz
Algorithm	Hash digest
SHA256	`0d5fbe75279aaad73c79a378b19b6aa0c4195253703c3109e1a58862f681ca22`
MD5	`c08af893bde91d9c68de3047e6b2075a`
BLAKE2b-256	`53a59740465a9eed71bb10bf73303d68733661424c5006af8b09d7bb33c67594`

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_qwen-0.3.3-py3-none-any.whl.

File metadata

Download URL: vision_agents_plugins_qwen-0.3.3-py3-none-any.whl
Upload date: Feb 4, 2026
Size: 15.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.20

File hashes

Hashes for vision_agents_plugins_qwen-0.3.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`827bbaf7216dacfbff7dccf575db48c75e33ae04692d0b322c0b2d990d3f6543`
MD5	`b3418a3c173ffe3c199d7bc5bde7ce3a`
BLAKE2b-256	`26c46388e7c0f507576837ff8497dba8660bdb7bea32d8265f3cf590f10590a1`

See more details on using hashes here.

vision-agents-plugins-qwen 0.3.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Qwen Realtime Plugin for Vision Agents

Features

Installation

Usage

Configuration

Environment Variables

Example

Dependencies

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes