Coding Agent for Mac

These details have not been verified by PyPI

Project links

Project description

mlx-code

A lightweight coding agent built on Apple's MLX framework.

https://github.com/user-attachments/assets/0569d101-8d0a-4e67-9e82-fce84a5ef3f0

Features

Composable by design: Agent, Tool, and the REPL are separate pieces you can import and wire together however you like
Swappable backends: point the harness at the local MLX server, a remote provider, or any OpenAI-compatible endpoint without changing anything else
Git worktree isolation: every session gets a fresh worktree so the agent can't silently corrupt your working tree
9 built-in tools: Read, Write, Edit, Bash, Grep, Find, Ls, Skill, Agent
Interactive REPL commands: /clear, /history, /tools, /branch, /abort

Quick Start

pip install mlx-code
mlc

Command Line

`mlc`: local server + harness

Starts the MLX inference server and launches a harness against it.

# Default: local MLX server + built-in REPL harness
mlc

# Use a different harness (routes traffic through the local server)
mlc --leash claude
mlc --leash gemini
mlc --leash codex

# Server only, no harness
mlc --leash none

# Specify a model
mlc --model mlx-community/Qwen3.5-4B-OptiQ-4bit

# Restrict the tools available to the agent
mlc --tools Read Write Bash

# Custom system prompt
mlc --system "You are a helpful assistant."

# Load skills from a directory (scans recursively for SKILL.md files)
mlc --skill ./my-skills

# Resume a previous session from a git commit hash
mlc --resume <commit-hash>

# Because `mlc` reads from stdin when it isn't a TTY, it composes naturally with shell pipes:
echo "Here's the solution you proposed: <excerpt>$(mlc -p "write code for a chrome extension to play youtube x5 speed")</excerpt> Now argue against it. What are the edge cases this doesn't handle? What assumptions did you make that might not hold in a production system? What would you change if you knew this code would be read by a senior engineer in a security audit?" | mlc

`mlc-run`: harness only

Runs the agent harness against an already-running server or a remote provider.

# Connect to a local server at 127.0.0.1:8000 (default)
mlc-run

# Remote providers
mlc-run --api claude
mlc-run --api gemini
mlc-run --api deepseek --model deepseek-v4-pro
mlc-run --api codex

# Custom endpoint
echo "explain lsp.py" | mlc-run -a deepseek | cat - PLAN.md | mlc-run --url http://localhost:9000

Using as a Library

Import the pieces you need to build background workers, scheduled jobs, or event-triggered handlers.

Spawn an agent from Python

import asyncio
from mlx_code.repl import Agent

async def main():
    agent = Agent(system="You are a concise technical writer.")
    await agent.run("Summarise all *.py files changed in the last 7 days. Save to digest.md.")

asyncio.run(main())

Multi-agent pipeline

import asyncio
from mlx_code.repl import Agent

async def main():
    researcher = Agent(system="You are a research assistant.")
    await researcher.run("Research PBFT consensus. Save a structured summary to kb/draft.md.")

    reviewer = Agent(system="You are a critical reviewer.")
    await reviewer.run(
        "Read kb/draft.md. Write a one-paragraph critique to kb/critique.md. "
        "Use only information in that file."
    )

asyncio.run(main())

Parallel workers with `asyncio.gather`

import asyncio
from mlx_code.repl import Agent

async def main():
    topics = ["history", "algorithms", "industry_usage"]
    agents = [Agent() for _ in topics]
    await asyncio.gather(*[
        a.run(f"Research the {t} of Byzantine Fault Tolerance. Save to kb/{t}.md.")
        for a, t in zip(agents, topics)
    ])
    reducer = Agent()
    await reducer.run("Read all files in kb/. Synthesise into final_report.md.")

asyncio.run(main())

Resume a session from a git commit

mlx-code stores the full conversation as JSON in each commit message, so you can restore both the workspace state and the agent's memory from any checkpoint.

import asyncio
from mlx_code.gits import resume_worktree
from mlx_code.repl import Agent, repl

async def main():
    gwt, messages = resume_worktree(".", "abc1234")
    agent = Agent(ctx={"gwt": gwt})
    agent.messages = messages
    await repl(agent)

asyncio.run(main())

Custom tools

Subclass Tool, define a Pydantic schema, and pass the class at instantiation.

from mlx_code.tools import Tool
from mlx_code.repl import Agent
from pydantic import BaseModel, Field

class QueryParams(BaseModel):
    query: str = Field(description="SQL query to run")

class LiveDBTool(Tool):
    name = "QueryDB"
    description = "Execute a query against the dev database"
    parameters = QueryParams

    async def execute(self, params: QueryParams, signal=None) -> dict:
        result = run_query(params.query)   # your logic here
        return {"content": [{"type": "text", "text": result}], "is_error": False}

agent = Agent(extra_tool_classes=[LiveDBTool], tool_names=["QueryDB"])

Credits

Built on mlx and mlx-lm. Inspired by Mario Zechner's pi.

License

Apache License 2.0: see LICENSE for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.0.26

Jun 14, 2026

0.0.25

Jun 13, 2026

0.0.24

Jun 13, 2026

0.0.23

Jun 13, 2026

0.0.21

Jun 13, 2026

0.0.20

Jun 13, 2026

This version

0.0.19

Jun 13, 2026

0.0.18

Jun 12, 2026

0.0.17

Jun 12, 2026

0.0.16

Jun 12, 2026

0.0.15

Jun 12, 2026

0.0.14

Jun 12, 2026

0.0.13

Jun 12, 2026

0.0.12

Jun 12, 2026

0.0.11

Jun 6, 2026

0.0.10

May 30, 2026

0.0.9

May 24, 2026

0.0.8

May 10, 2026

0.0.7

May 9, 2026

0.0.6

May 5, 2026

0.0.5

May 5, 2026

0.0.4

May 4, 2026

0.0.3

May 4, 2026

0.0.2

May 3, 2026

0.0.2a6 pre-release

Apr 5, 2026

0.0.2a5 pre-release

Apr 4, 2026

0.0.2a3 pre-release

Apr 4, 2026

0.0.2a2 pre-release

Mar 31, 2026

0.0.2a1 pre-release

Mar 28, 2026

0.0.2a0 pre-release

Mar 25, 2026

0.0.1

Mar 21, 2026

0.0.1a3 pre-release

Mar 13, 2026

0.0.1a2 pre-release

Mar 13, 2026

0.0.1a1 pre-release

Mar 13, 2026

0.0.1a0 pre-release

Mar 8, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlx_code-0.0.19.tar.gz (77.4 kB view details)

Uploaded Jun 13, 2026 Source

File details

Details for the file mlx_code-0.0.19.tar.gz.

File metadata

Download URL: mlx_code-0.0.19.tar.gz
Upload date: Jun 13, 2026
Size: 77.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.8

File hashes

Hashes for mlx_code-0.0.19.tar.gz
Algorithm	Hash digest
SHA256	`e1b4225893af382c41bd8fad93b0c25af68be4e295ddf9c6728bc26dd3d134c7`
MD5	`0c34a848bc21cfad91744885fbd5adcb`
BLAKE2b-256	`d208b8fcb7fb87ecdaa1d00aeee3fbf44843e8345fd3a222a2d4d80d2437f429`

See more details on using hashes here.

mlx-code 0.0.19

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

mlx-code

Features

Quick Start

Command Line

`mlc`: local server + harness

`mlc-run`: harness only

Using as a Library

Spawn an agent from Python

Multi-agent pipeline

Parallel workers with `asyncio.gather`

Resume a session from a git commit

Custom tools

Credits

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes

mlx-code 0.0.19

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

mlx-code

Features

Quick Start

Command Line

mlc: local server + harness

mlc-run: harness only

Using as a Library

Spawn an agent from Python

Multi-agent pipeline

Parallel workers with asyncio.gather

Resume a session from a git commit

Custom tools

Credits

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes

`mlc`: local server + harness

`mlc-run`: harness only

Parallel workers with `asyncio.gather`