A bash-native AI agent that builds its own toolkit

These details have not been verified by PyPI

Project description

Binsmith

An AI agent that works by running shell commands and writing scripts. Tools it creates persist across sessions.

The idea

Most AI agents are stateless. They solve problems, then forget everything. Binsmith takes a different approach: when it does something useful, it writes a script. That script goes into a persistent toolkit.

Ask Binsmith to fetch a webpage, and it writes fetch-url. Ask it to convert HTML to markdown, and it writes html2md. A week later, when you ask for a daily briefing, it composes them:

$ brief

# News
- AI lab announces new model...
- Tech company acquires startup...

# Weather
San Francisco: 62°F, partly cloudy

# Your todos
- [ ] Review PR #42
- [ ] Write README improvements

That brief command didn't exist until you needed it. Now it does, and it builds on tools that already existed.

Binsmith uses a server/client architecture. The TUI and web UI are interchangeable clients — they talk to the same server, share the same threads, and see the same toolkit.

Requirements

uv — used both to run Binsmith and by the scripts it creates
Python 3.14+ (uv will install this automatically)
An API key for at least one LLM provider (Gemini, Anthropic, or OpenAI)

Quick start

uvx binsmith

Or clone and run locally:

uv sync
binsmith

This starts the TUI, which auto-starts a local server.

Set GEMINI_API_KEY for the default model (Gemini Flash), or ANTHROPIC_API_KEY / OPENAI_API_KEY for alternatives. See Models.

CLI

binsmith                 # Run the TUI (default)
binsmith tui             # Run the TUI explicitly
binsmith server          # Run the API server

`binsmith tui`

--server <url>           Binsmith server base URL (default: http://localhost:8000)
--no-autostart           Do not auto-start a local server if missing
--server-workspace       Workspace mode for auto-started server: local | central

`binsmith server`

--host <host>            Host interface to bind (default: 127.0.0.1)
--port <port>            Port to bind (default: 8000)
--reload                 Enable auto-reload
--local-workspace        Use a project-local .binsmith workspace on the server

What the agent builds

After a few days of use, a toolkit might look like:

~/.binsmith/workspace/bin/
  fetch-url     # Fetch a URL, handle retries, extract text
  html2md       # Convert HTML to clean markdown
  news          # Top stories from news sources
  weather       # Weather for a location
  todo          # Manage a simple todo list
  brief         # Daily briefing (composes news, weather, todo)
  code-map      # Map out a codebase structure
  code-ref      # Find references to a symbol

Each tool is a standalone, self-contained script that works for both you and Binsmith. Python scripts use inline script metadata so dependencies are declared in the file itself — just run the script and uv handles the rest. No virtualenv, no pip install.

Run todo add "buy milk" yourself, or let Binsmith do it — same interface, same tool.

Tools are symlinked to ~/.local/bin (or $BINSMITH_GLOBAL_BIN) so they're available everywhere, not just inside Binsmith. They pipe into each other and compose naturally.

How it works

Binsmith has one tool: bash. It runs commands in your project directory with the workspace bin/ on the PATH. The workspace persists:

.binsmith/
  workspace/
    bin/      # Scripts the agent creates
    data/     # Persistent data
    tmp/      # Scratch space
  binsmith.db # Conversation history

On each run, the agent sees its current toolkit and is prompted to use existing tools before writing one-off commands.

Architecture

┌──────────────────────────────────────────┐
│              Clients                     │
│        TUI  /  Web UI  /  (API)          │
└─────────────────┬────────────────────────┘
                  │ HTTP + AG-UI streaming
                  ▼
┌──────────────────────────────────────────┐
│          Binsmith Server                 │
│   FastAPI · SQLite · Session management  │
└─────────────────┬────────────────────────┘
                  │ pydantic-ai
                  ▼
┌──────────────────────────────────────────┐
│               Agent                      │
│   Dynamic prompt · bash tool · Toolkit   │
└─────────────────┬────────────────────────┘
                  │ subprocess
                  ▼
┌──────────────────────────────────────────┐
│            File System                   │
│   Scripts as files · Git-friendly        │
└──────────────────────────────────────────┘

The TUI and web UI are interchangeable clients. Both talk to the same server, share the same history, and see the same toolkit.

Models

Default: google-gla:gemini-3-flash-preview

export GEMINI_API_KEY=...     # Google
export ANTHROPIC_API_KEY=...  # Anthropic
export OPENAI_API_KEY=...     # OpenAI

Switch models in the TUI with /model set <name> or via the web UI sidebar. Run /model list to see available models.

Web UI

cd frontend
npm install
npm run dev

Connects to http://localhost:8000 by default. Override with VITE_BINSMITH_SERVER_URL in frontend/.env.local.

TUI commands

/help                     Show help
/threads                  List threads
/thread <id>              Switch to a thread
/thread new [id]          Create a new thread
/thread delete <id>       Delete a thread
/clear                    Clear current thread
/model                    Show current model
/model list [filter]      List models
/model set <name>         Set model
/model default            Reset to default
/quit                     Exit

Configuration

Variable	Default	Description
`BINSMITH_MODEL`	`google-gla:gemini-3-flash-preview`	Default model
`BINSMITH_WORKSPACE_MODE`	`local`	`local` (per-project) or `central` (~/.binsmith)
`BINSMITH_SERVER_URL`	`http://localhost:8000`	Server URL for clients
`BINSMITH_LOGFIRE`	`0`	Enable Logfire telemetry

Running the server directly

binsmith server
# or
uvicorn binsmith.server.asgi:app --reload --port 8000

The TUI auto-starts a server if needed, so this is mainly for development or running a shared server.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.5.2

Jan 16, 2026

0.5.1

Jan 9, 2026

0.5.0

Jan 8, 2026

0.4.0

Dec 31, 2025

0.3.1

Dec 31, 2025

0.3.0

Dec 31, 2025

0.2.3

Jan 7, 2026

0.2.2

Jan 6, 2026

0.2.1

Jan 6, 2026

This version

0.2.0

Dec 31, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

binsmith-0.2.0.tar.gz (169.6 kB view details)

Uploaded Dec 31, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

binsmith-0.2.0-py3-none-any.whl (37.6 kB view details)

Uploaded Dec 31, 2025 Python 3

File details

Details for the file binsmith-0.2.0.tar.gz.

File metadata

Download URL: binsmith-0.2.0.tar.gz
Upload date: Dec 31, 2025
Size: 169.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.18 {"installer":{"name":"uv","version":"0.9.18","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"22.04","id":"jammy","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for binsmith-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`bb9b818160569284db3d4aebd5dc2da4ee094839f2633aa67c936f3c8181d1c9`
MD5	`c036891be490df1321dc3d720b374a32`
BLAKE2b-256	`02ee4d184bd345ff2f123d6bc20f14ed5442f0c2b8b1495c5fa1bf18bee57330`

See more details on using hashes here.

File details

Details for the file binsmith-0.2.0-py3-none-any.whl.

File metadata

Download URL: binsmith-0.2.0-py3-none-any.whl
Upload date: Dec 31, 2025
Size: 37.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.18 {"installer":{"name":"uv","version":"0.9.18","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"22.04","id":"jammy","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for binsmith-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c9d6a9cccc707ef6edf6d241f53803a316fe2b7cd686eb1ade26f1356814b72b`
MD5	`5e1ba4c3e38348d868470e6f6b937cd9`
BLAKE2b-256	`973741e57a13c8b5f40c8017da2d0468bfbbc598084e853b122b36afd4ed46ef`

See more details on using hashes here.

binsmith 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Binsmith

The idea

Requirements

Quick start

CLI

`binsmith tui`

`binsmith server`

What the agent builds

How it works

Architecture

Models

Web UI

TUI commands

Configuration

Running the server directly

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes