A Python client for Qwen and DeepSeek models with AutoGen, supporting structured outputs and function calling.

These details have not been verified by PyPI

Project links

Intended Audience
- Developers
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Artificial Intelligence

Project description

QwenOpenAIChatCompletionClient

A Python client library for interacting with Qwen3 and DeepSeek models via OpenAI-compatible API, built on top of AutoGen. This client provides structured output support, function calling, and comprehensive model configuration for building agentic AI applications.

Installation

You can install the package using either of the following methods:

From PyPI (recommended):

pip install qwen3-autogen-client

From source (for development):

pip install -e .

Or install dependencies directly:

pip install -r requirements.txt

Build a Wheel File

To build a wheel file for distribution, run the following script from the project root:

./build_wheel.sh

This will generate a .whl file in the dist/ directory.

Attribution

This project is based on the excellent work from:

Author: Data Leads Future
GitHub Repository: Agentic AI Playground - AutoGen-Qwen3 Integration

Features

Multi-Model Support: Qwen3, Qwen2.5, and DeepSeek models
Structured Output: Pydantic model-based JSON schema enforcement for reliable AI responses
Function Calling: Full support for tool usage and function calling capabilities
Async Support: Both streaming and non-streaming async operations
Token Management: Intelligent token counting and remaining token calculation
Comprehensive Logging: Built-in logging for debugging and monitoring
AutoGen Integration: Seamless integration with AutoGen's agent framework

Supported Models

Qwen3 Models

Qwen3-32B (32K context)
Qwen3-14B (32K context)
Qwen3-8B (32K context)
Qwen3-4B (32K context)
Qwen3-1.7B (32K context)
qwen-max (32K context)
qwen-max-latest (128K context)
qwen-plus (128K context)
qwen-plus-latest (128K context)
qwen-turbo (1M context)
qwen-turbo-latest (1M context)
qwen3-235b-a22b (128K context)
qwen3-30b-a3b (128K context)

Qwen2.5 Models

Qwen2.5-Omni-7B (32K context, vision)
Qwen2.5-Omni-3B (32K context, vision)
Qwen2.5-VL-32B-Instruct (32K context, vision)
Qwen2.5-VL-7B-Instruct (32K context, vision)

DeepSeek Models

deepseek-chat (64K context, function calling supported)
deepseek-reasoner (64K context, reasoning mode)

Quick Start

Basic Usage

from qwen3_autogen_client import QwenOpenAIChatCompletionClient
from autogen_core.models import UserMessage

# Initialize the client
client = QwenOpenAIChatCompletionClient(
    model="qwen-max-latest",
    base_url="https://dashscope.aliyuncs.com/compatible-mode/v1",
    api_key="your_api_key_here"
)

# Create a simple completion
messages = [UserMessage(content="Hello, how are you?")]
result = await client.create(messages=messages)
print(result.content)

Structured Output with Pydantic

from pydantic import BaseModel
from typing import List

class TaskList(BaseModel):
    tasks: List[str]
    priority: str

# Get structured output
messages = [UserMessage(content="Create a task list for planning a vacation")]
result = await client.create(
    messages=messages,
    json_output=TaskList
)
# Result will be automatically formatted according to TaskList schema

Streaming Response

async for chunk in client.create_stream(messages=messages):
    if isinstance(chunk, str):
        print(chunk, end="")
    else:
        # Final result
        print(f"\nFinal result: {chunk}")

Function Calling

from autogen_core.tools import Tool

# Define a tool
def get_weather(location: str) -> str:
    return f"The weather in {location} is sunny."

tool = Tool(get_weather, name="get_weather", description="Get weather for a location")

# Use with function calling
result = await client.create(
    messages=[UserMessage(content="What's the weather in Tokyo?")],
    tools=[tool]
)

Usage Example: Simple Agent and Function Calling

Below is an example demonstrating how to use a simple agent with function calling capability:

import os
import sys
import asyncio
import logging
from datetime import datetime
from qwen3_autogen_client import QwenOpenAIChatCompletionClient
from autogen import AssistantAgent

logging.basicConfig(level=logging.INFO)
logger = logging.getLogger(__name__)

async def get_time() -> str:
    """Returns the current server time in YYYY-MM-DD HH:MM:SS format."""
    return datetime.now().strftime("%Y-%m-%d %H:%M:%S")

def print_result(result):
    for msg in result.messages:
        if hasattr(msg, 'content'):
            print(f"Message: {msg.content}")
        elif hasattr(msg, 'tool_calls'):
            print(f"Tool Calls: {msg.tool_calls}")
        else:
            print(f"Unknown message type: {msg}")

async def main():
    logger.info("Starting Qwen client script")
    logger.info("Checking required environment variables")

    # 1. Instantiate your LLM client (here using Local Qwen Model)
    logger.info("Instantiating QwenOpenAIChatCompletionClient")
    model_client = QwenOpenAIChatCompletionClient(model=os.getenv("MODEL_NAME"), base_url=os.getenv("OPENAI_API_BASE"))
    logger.info(f"Client instantiated: {model_client}")

    # 2. Create an assistant agent
    agent = AssistantAgent("assistant", model_client=model_client, tools=[get_time])
    logger.info(f"Agent instantiated: {agent}")
    # 3. Run the agent on simple Knowledge task
    logger.info("Running agent")
    result = await agent.run(task="What is most common language in the world?/no_think")
    logger.info(f"Result: {result}")
    print_result(result)

    # 4. Run the agent on a more complex task
    logger.info("Running agent on a task that requires function calling")
    result = await agent.run(task="What is the current server time?, Is it afternoon?")
    logger.info(f"Result: returned {len(result.messages)} messages")
    print_result(result)
    return 0

if __name__ == "__main__":
    exit_code = asyncio.run(main())
    sys.exit(exit_code)

Sample Output

\033[90m2025-06-08 17:18:22,855 - Instantiating QwenOpenAIChatCompletionClient\033[0m
\033[90m2025-06-08 17:18:22,915 - Initialized QwenOpenAIChatCompletionClient with model: Qwen3 4B and base URL: [MASKED_URL]\033[0m
\033[90m2025-06-08 17:18:22,915 - Client instantiated: <qwen3_autogen_client.qwen_client.QwenOpenAIChatCompletionClient object at 0x117ab51d0>\033[0m
\033[90m2025-06-08 17:18:22,916 - Agent instantiated: <autogen_agentchat.agents._assistant_agent.AssistantAgent object at 0x117b33190>\033[0m
\033[90m2025-06-08 17:18:22,916 - Running agent\033[0m
\033[90m2025-06-08 17:18:44,330 - HTTP Request: POST [MASKED_URL] "HTTP/1.1 200 OK"\033[0m
\033[90m2025-06-08 17:18:44,338 - { ... "model": "Qwen3 4B", ... }\033[0m
\033[90m2025-06-08 17:18:44,339 - Result: messages=[TextMessage(source='user', ...), TextMessage(source='assistant', ...)] stop_reason=None\033[0m
\033[90m2025-06-08 17:18:44,339 - Running agent on a task that requires function calling\033[0m
Message: What is most common language in the world?/no_think
Message: The most common language in the world is Mandarin Chinese. It is spoken by approximately 1.3 billion people, making it the most spoken language globally. However, if we consider the number of native speakers, Spanish is the most spoken language.
\033[90m2025-06-08 17:18:52,078 - HTTP Request: POST [MASKED_URL] "HTTP/1.1 200 OK"\033[0m
\033[90m2025-06-08 17:18:52,082 - { ... "model": "Qwen3 4B", ... }\033[0m
\033[90m2025-06-08 17:18:52,083 - {"type": "ToolCall", "tool_name": "get_time", "arguments": {}, "result": "2025-06-08 17:18:52", "agent_id": null}\033[0m
\033[90m2025-06-08 17:18:52,083 - Result: returned 4 messages\033[0m
Message: What is the current server time?, Is it afternoon?
Message: [FunctionCall(id='1rX3Ta4NkgrJsgzUZrijIxVbxKRNGaDD', arguments='{}', name='get_time')]
Message: [FunctionExecutionResult(content='2025-06-08 17:18:52', name='get_time', call_id='1rX3Ta4NkgrJsgzUZrijIxVbxKRNGaDD', is_error=False)]
Message: 2025-06-08 17:18:52

Configuration

Environment Variables

Create a .env file in your project root:

OPENAI_API_KEY=your_api_key_here
OPENAI_API_BASE=https://dashscope.aliyuncs.com/compatible-mode/v1

Direct Configuration

client = QwenOpenAIChatCompletionClient(
    model="qwen-max-latest",
    base_url="https://your-api-endpoint.com",
    api_key="your_api_key",
    # Additional OpenAI client parameters
    timeout=30.0,
    max_retries=3
)

API Reference

QwenOpenAIChatCompletionClient

`init(model: str, base_url: str, **kwargs)`

Initialize the client.

Parameters:

model (str): The model name to use (required)
base_url (str): Base URL for the API endpoint (required)
**kwargs: Additional parameters passed to the underlying OpenAI client

Example:

client = QwenOpenAIChatCompletionClient(
    model="qwen-max-latest",
    base_url="https://dashscope.aliyuncs.com/compatible-mode/v1",
    api_key="your_key"
)

`async create(messages, *, tools=[], json_output=None, extra_create_args={}, cancellation_token=None) -> CreateResult`

Create a completion.

Parameters:

messages: Sequence of LLMMessage objects
tools: Optional sequence of Tool or ToolSchema objects
json_output: Optional bool or Pydantic BaseModel class for structured output
extra_create_args: Additional arguments for the API call
cancellation_token: Optional cancellation token

`async create_stream(...) -> AsyncGenerator[Union[str, CreateResult], None]`

Create a streaming completion with the same parameters as create().

`remaining_tokens(messages, *, tools=[]) -> int`

Calculate remaining tokens available for the conversation.

Model Capabilities

Model	Function Calling	JSON Output	Vision	Context Window
qwen-max	✅	✅	❌	32K
qwen-max-latest	✅	✅	❌	128K
qwen-plus	✅	✅	❌	128K
qwen-turbo	✅	✅	❌	1M
deepseek-chat	✅	✅	❌	64K
deepseek-reasoner	❌	❌	❌	64K

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details

These details have not been verified by PyPI

Project links

Intended Audience
- Developers
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Artificial Intelligence

Release history Release notifications | RSS feed

This version

0.1.4

Jun 8, 2025

0.1.3

Jun 8, 2025

0.1.2

Jun 8, 2025

0.1.1

Jun 8, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

qwen3_autogen_client-0.1.4.tar.gz (14.8 kB view details)

Uploaded Jun 8, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

qwen3_autogen_client-0.1.4-py3-none-any.whl (12.1 kB view details)

Uploaded Jun 8, 2025 Python 3

File details

Details for the file qwen3_autogen_client-0.1.4.tar.gz.

File metadata

Download URL: qwen3_autogen_client-0.1.4.tar.gz
Upload date: Jun 8, 2025
Size: 14.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.4

File hashes

Hashes for qwen3_autogen_client-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`e9a0ca8d97d64c5a3874e423e2989768a6b1ee10f9ccbd45a793e0dc94dc9453`
MD5	`c793fe22b53118293f6a8f1f4164888f`
BLAKE2b-256	`a166834402019f6440b09fc727f10fa45e22d832b14612a8857a6aa718fd7607`

See more details on using hashes here.

File details

Details for the file qwen3_autogen_client-0.1.4-py3-none-any.whl.

File metadata

Download URL: qwen3_autogen_client-0.1.4-py3-none-any.whl
Upload date: Jun 8, 2025
Size: 12.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.4

File hashes

Hashes for qwen3_autogen_client-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ab565ea2409842a27a478b9bc7d326b5eec7e5ff963ea065e3b73978f230c592`
MD5	`8b8a614345c47423a36a00afb47e3953`
BLAKE2b-256	`f772fe832768d4e6cf14687df8d1391e9c0b502d7c24c855b38fef961d4657e8`

See more details on using hashes here.

qwen3-autogen-client 0.1.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

QwenOpenAIChatCompletionClient

Installation

Build a Wheel File

Attribution

Features

Supported Models

Qwen3 Models

Qwen2.5 Models

DeepSeek Models

Quick Start

Basic Usage

Structured Output with Pydantic

Streaming Response

Function Calling

Usage Example: Simple Agent and Function Calling

Sample Output

Configuration

Environment Variables

Direct Configuration

API Reference

QwenOpenAIChatCompletionClient

__init__(model: str, base_url: str, **kwargs)

async create(messages, *, tools=[], json_output=None, extra_create_args={}, cancellation_token=None) -> CreateResult

async create_stream(...) -> AsyncGenerator[Union[str, CreateResult], None]

remaining_tokens(messages, *, tools=[]) -> int

Model Capabilities

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`init(model: str, base_url: str, **kwargs)`

`async create(messages, *, tools=[], json_output=None, extra_create_args={}, cancellation_token=None) -> CreateResult`

`async create_stream(...) -> AsyncGenerator[Union[str, CreateResult], None]`

`remaining_tokens(messages, *, tools=[]) -> int`