MLX integration for the Agent Framework
Project description
Agent Framework MLX
This library provides an Apple MLX LM backend for the Agent Framework. It allows you to run Language Models locally on macOS with Apple Silicon using the mlx-lm library.
Features
- Local Inference: Run models directly on your Mac using Apple Silicon
- Tool Calling: Native support for agentic tool invocation.
- Observability: Built-in OpenTelemetry tracing, metrics, and token usage tracking.
- Streaming Support: Full support for streaming responses.
- Configurable Generation: Fine-tune generation parameters like temperature, top-p, repetition penalty, and more.
- Message Preprocessing: Hook into the pipeline to modify messages before they are converted to prompts.
- Agent Framework Integration: Seamlessly plugs into the Agent Framework's
BaseChatClientinterface.
Installation
Ensure you have Python 3.9+ and are running on macOS with Apple Silicon.
pip install agent-framework-mlx
Or install from source:
git clone https://github.com/filipw/agent-framework-mlx.git
cd agent-framework-mlx
pip install -e .
Usage
Basic Example
import asyncio
from agent_framework import ChatMessage, Role, ChatOptions
from agent_framework_mlx import MLXChatClient, MLXGenerationConfig
# Initialize the client
client = MLXChatClient(
model_path="mlx-community/Phi-4-mini-instruct-4bit",
generation_config=MLXGenerationConfig(
temp=0.7,
max_tokens=500
)
)
# Create messages
messages = [
ChatMessage(role=Role.SYSTEM, text="You are a helpful assistant."),
ChatMessage(role=Role.USER, text="Why is the sky blue?")
]
# Get response
response = await client.get_response(messages=messages, chat_options=ChatOptions())
print(response.text)
Agent with Tools
You can easily create agents capable of calling tools (Python functions) using the client:
from typing import Annotated
def calculate_bmi(weight: float, height: float) -> str:
"""Calculates BMI."""
return f"{weight / (height ** 2):.2f}"
# Create an agent with the tool
agent = client.as_agent(
name="HealthAssistant",
instructions="You are a helpful assistant.",
tools=[calculate_bmi]
)
response = await agent.run("Calculate BMI for 70kg and 1.75m")
print(response)
Workflow Integration
You can use the client as backbone for Agent Framework agents when building agentic workflows:
from agent_framework import ChatAgent
# notice the client constructed in the previous example now backs the local agent
local_agent = client.as_agent(
name="Local_MLX",
instructions="You are a helpful assistant."
)
remote_agent = ChatAgent(
name="Cloud_LLM",
instructions="You are a fallback expert. The previous assistant was unsure. Provide a complete answer.",
chat_client=azure_client
)
builder = WorkflowBuilder()
builder.set_start_executor(local_agent)
builder.add_edge(
source=local_agent,
target=remote_agent,
condition=should_fallback_to_cloud
)
workflow = builder.build()
Streaming
async for update in client.get_streaming_response(messages=messages, chat_options=ChatOptions()):
print(update.text, end="", flush=True)
Configuration
You can configure the client using environment variables or a .env file. Using environment variables like MLX_MODEL_PATH allows you to omit arguments in code.
export MLX_MODEL_PATH="mlx-community/Phi-4-mini-instruct-4bit"
# No arguments needed if env vars are set
client = MLXChatClient()
Advanced Configuration
You can configure generation parameters globally via MLXGenerationConfig or per-request via ChatOptions.
config = MLXGenerationConfig(
temp=0.7,
top_p=0.9,
repetition_penalty=1.1,
seed=42
)
Message Preprocessing
You can intercept and modify messages before they are sent to the model. This is useful for injecting instructions or formatting content.
def inject_instruction(messages):
if messages:
messages[-1]["content"] += "\nIMPORTANT: Be concise."
return messages
client = MLXChatClient(
model_path="...",
message_preprocessor=inject_instruction
)
Requirements
- macOS
- Apple Silicon
- Python 3.9+
License
This project is licensed under the MIT License. See the LICENSE file for details.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file agent_framework_mlx-0.4.0.tar.gz.
File metadata
- Download URL: agent_framework_mlx-0.4.0.tar.gz
- Upload date:
- Size: 9.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
263e47c74b44053e06c2f6bedc67d0b72126bfff591189bde782bc143a388748
|
|
| MD5 |
9f776069d7d674a96e362003d891bd9d
|
|
| BLAKE2b-256 |
c56de7256dff29ece495cf00850026dfc08af1eedb5bc787c35a4bb023c57974
|
Provenance
The following attestation bundles were made for agent_framework_mlx-0.4.0.tar.gz:
Publisher:
publish.yml on filipw/agent-framework-mlx
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
agent_framework_mlx-0.4.0.tar.gz -
Subject digest:
263e47c74b44053e06c2f6bedc67d0b72126bfff591189bde782bc143a388748 - Sigstore transparency entry: 856096595
- Sigstore integration time:
-
Permalink:
filipw/agent-framework-mlx@7fe4aecd209ba0c54c0b5fe6a9085cfdfb4f9663 -
Branch / Tag:
refs/tags/v0.4.0 - Owner: https://github.com/filipw
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@7fe4aecd209ba0c54c0b5fe6a9085cfdfb4f9663 -
Trigger Event:
release
-
Statement type:
File details
Details for the file agent_framework_mlx-0.4.0-py3-none-any.whl.
File metadata
- Download URL: agent_framework_mlx-0.4.0-py3-none-any.whl
- Upload date:
- Size: 7.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e8581a0f251850e9b72c4fa773a09c88a3dde7bfdb0155fb0a7c8791ea3f9b36
|
|
| MD5 |
494977e885be40da82a1cc46ebb137d8
|
|
| BLAKE2b-256 |
44cf02228d7363f12d45314c40c3530510e21762d30fa766ce42f78d3a830c07
|
Provenance
The following attestation bundles were made for agent_framework_mlx-0.4.0-py3-none-any.whl:
Publisher:
publish.yml on filipw/agent-framework-mlx
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
agent_framework_mlx-0.4.0-py3-none-any.whl -
Subject digest:
e8581a0f251850e9b72c4fa773a09c88a3dde7bfdb0155fb0a7c8791ea3f9b36 - Sigstore transparency entry: 856096630
- Sigstore integration time:
-
Permalink:
filipw/agent-framework-mlx@7fe4aecd209ba0c54c0b5fe6a9085cfdfb4f9663 -
Branch / Tag:
refs/tags/v0.4.0 - Owner: https://github.com/filipw
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@7fe4aecd209ba0c54c0b5fe6a9085cfdfb4f9663 -
Trigger Event:
release
-
Statement type: