Use MLX in Strands Agents

These details have not been verified by PyPI

Project links

Project description

strands-mlx

Running Strands Agents locally on Apple Silicon - inference, fine-tuning, vision in Python

MLX provider for Strands Agents with LoRA training pipelines.

Requirements: Python ≤3.13, macOS/Linux

# Text models
pip install strands-mlx

# With vision/audio/video
pip install "strands-mlx[vision]"

# Environment setup
uv venv --python 3.13 && source .venv/bin/activate

# Install dependencies
uv pip install strands-agents strands-mlx strands-agents-tools

Quick Start

agent.py

from strands import Agent
from strands_mlx import MLXModel
from strands_tools import calculator

model = MLXModel(model_id="mlx-community/Qwen3-1.7B-4bit")
agent = Agent(model=model, tools=[calculator])

agent("What is 29 * 42?")

# Run with uv
uv run agent.py

Architecture

graph LR
    A[Agent Conversations] -->|MLXSessionManager| B[Training Data JSONL]
    B -->|dataset_splitter| C[train/valid/test]
    C -->|mlx_trainer| D[LoRA Adapter]
    D -->|MLXModel| E[Domain Expert Agent]
    E -.->|Continuous Learning| A
    
    style A fill:#e1f5ff
    style E fill:#d4edda
    style D fill:#fff3cd

The complete training cycle: Agents collect their own training data → fine-tune themselves → become domain experts → continue learning.

Train Your Own Model

4 steps: Collect → Split → Train → Use

1. Collect Training Data

from strands import Agent
from strands_tools import calculator
from strands_mlx import MLXSessionManager, dataset_splitter, mlx_trainer

session = MLXSessionManager(session_id="my_training", storage_dir="./dataset")
agent = Agent(model=model, tools=[calculator, dataset_splitter, mlx_trainer], session_manager=session)

# Have conversations - auto-saved to JSONL
agent("Teach me about quantum computing")
agent("Calculate 15 * 7")

# Saved to: ./dataset/my_training.jsonl

2. Split Dataset

agent.tool.dataset_splitter(
    input_path="./dataset/my_training.jsonl"
)
# Creates train.jsonl, valid.jsonl, test.jsonl (80/10/10 split)

3. Train with LoRA

agent.tool.mlx_trainer(
    action="train",
    config={
        "model": "mlx-community/Qwen3-1.7B-4bit",
        "data": "./dataset/my_training",
        "adapter_path": "./adapter",
        "iters": 200,
        "learning_rate": 1e-5,
        "batch_size": 1
    }
)

4. Use Trained Model

from strands import Agent
from strands_mlx import MLXModel

trained = MLXModel("mlx-community/Qwen3-1.7B-4bit", adapter_path="./adapter")
agent = Agent(model=trained)

agent("Explain quantum computing")  # Uses trained knowledge!

Vision Models

from strands_mlx import MLXVisionModel

model = MLXVisionModel(model_id="mlx-community/Qwen2-VL-2B-Instruct-4bit")
agent = Agent(model=model)

agent("Describe: <image>photo.jpg</image>")
agent("Transcribe: <audio>speech.wav</audio>")
agent("What happens: <video>clip.mp4</video>")

Training Tools

Tool	Purpose
`mlx_trainer`	Background LoRA training
`dataset_splitter`	Split JSONL → train/valid/test
`validate_training_data`	Check format & token counts
`mlx_invoke`	Runtime model switching
`mlx_vision_invoke`	Vision as a tool

Advanced Training

YAML config file:

model: mlx-community/Qwen3-1.7B-4bit
data: ./training_data
iters: 1000
learning_rate: 1e-5
lora_parameters:
  rank: 8
  scale: 16.0
lr_schedule:
  name: cosine_decay
  warmup: 100
optimizer: adamw

Use config:

agent.tool.mlx_trainer(action="train", config="./lora_config.yaml")

Popular Models

Text:

mlx-community/Qwen3-1.7B-4bit (recommended)
mlx-community/Qwen3-4B-4bit
mlx-community/Llama-3.2-1B-4bit
mlx-community/gemma-2-2b-it-4bit

Vision:

mlx-community/Qwen2-VL-2B-Instruct-4bit (recommended)
mlx-community/Qwen2-Audio-7B-Instruct (audio)
mlx-community/llava-v1.6-mistral-7b-4bit

Community models at mlx-community

Troubleshooting

Out of memory:

config = {
    "grad_checkpoint": True,
    "batch_size": 1,
    "max_seq_length": 1024
}

Model degraded:

config = {
    "iters": 200,  # Lower for small datasets
    "learning_rate": 1e-5  # Conservative
}

Resources

Citation

@software{strands_mlx2025,
  author = {Cagatay Cali},
  title = {strands-mlx: MLX Model Provider for Strands Agents},
  year = {2025},
  url = {https://github.com/cagataycali/strands-mlx}
}

Apache 2 License | Built with MLX, MLX-LM, and Strands Agents

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.6

Nov 3, 2025

0.2.5

Nov 3, 2025

This version

0.2.4

Nov 2, 2025

0.2.1

Nov 2, 2025

0.2.0

Nov 1, 2025

0.1.1

Oct 31, 2025

0.1.0

Oct 31, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

strands_mlx-0.2.4.tar.gz (41.3 kB view details)

Uploaded Nov 2, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

strands_mlx-0.2.4-py3-none-any.whl (43.5 kB view details)

Uploaded Nov 2, 2025 Python 3

File details

Details for the file strands_mlx-0.2.4.tar.gz.

File metadata

Download URL: strands_mlx-0.2.4.tar.gz
Upload date: Nov 2, 2025
Size: 41.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.22

File hashes

Hashes for strands_mlx-0.2.4.tar.gz
Algorithm	Hash digest
SHA256	`d84a9b1970726aa7e4a94e0e6cd83efe6b03014f182296db3ee0b43b4bbcd533`
MD5	`5e72ff69a1531be2226a0739e4199479`
BLAKE2b-256	`b409f3c0f4b40759893a7d576541b55ccc40fd063047d1e4e1b82562dd4eafdf`

See more details on using hashes here.

File details

Details for the file strands_mlx-0.2.4-py3-none-any.whl.

File metadata

Download URL: strands_mlx-0.2.4-py3-none-any.whl
Upload date: Nov 2, 2025
Size: 43.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.22

File hashes

Hashes for strands_mlx-0.2.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2500676a93efa577fc61a7605778ce524753fa369118d2914ac3c233a491024a`
MD5	`f47fa37f44f0f10a1337b219a22012de`
BLAKE2b-256	`224e353e47713fe2e7f678df1c2cc4753b9262df4a4a4c01c4bdffcda593e98b`

See more details on using hashes here.

strands-mlx 0.2.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

strands-mlx

Quick Start

Architecture

Train Your Own Model

1. Collect Training Data

2. Split Dataset

3. Train with LoRA

4. Use Trained Model

Vision Models

Training Tools

Advanced Training

Popular Models

Troubleshooting

Resources

Citation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes