primer-ai

Microagents framework

Project description

Primer - orchestrate fleets of small, context-optimized agents

An unopinionated, batteries-included agent-orchestration platform built around one bet: a small model given a clean, purpose-built context can rival a much larger one.

Quickstart · Features · How it works · Docs · Contributing

Why Primer

A language model spreads a fixed budget of attention across every token in its context at once. Keep the context tight and the few tokens that matter get most of that attention; bloat it with stale history, unused tool definitions, and irrelevant background and the signal thins out. Primer's core bet is that you often do not need the biggest model if you give the model you have exactly what it needs and nothing more - and that this is a lever on any model, large or small.

So instead of one giant agent with everything crammed into its prompt, Primer lets you orchestrate fleets of small, focused agents, each with a clean working context, wired together with the primitives a real deployment needs: LLM providers, workspaces, graphs, knowledge collections, channels, triggers, and semantic search - integrated from the start, runnable on your own hardware.

What you can build

⏸️ Yielding, event-driven agents

Agents park on a slow tool or a human decision and resume when the event fires - freeing compute while they wait.

🔀 Directed graphs

Wire agents into cyclic graphs (producer-judge loops, fan-out/fan-in, conditional branches) that run as structured workflows.

📁 Workspaces & sessions

Materialised sandboxes (local, container, or Kubernetes) give each agent a persistent filesystem and git-backed state.

🔎 Semantic search

Ingest documents into vector collections; agents retrieve only the relevant chunks, not the whole corpus.

💬 Channels

Bridge agents to Slack, Telegram, and Discord - ask questions, request approvals, and trigger work from a message.

🔌 MCP server

Expose the full platform tool surface over the Model Context Protocol so external agents and MCP clients can use it.

✅ Human approvals

Gate sensitive tool calls behind an approval that a person grants from a channel or the console before the agent proceeds.

📦 Harnesses

Package a tuned set of agents, graphs, and collections into a git-backed, versioned bundle and deploy it anywhere in one step.

🧭 Dynamic discovery

Two meta-tools let an agent search for and invoke any tool or agent at runtime - without carrying the whole catalog in its context.

Built for loop engineering

Loop engineering is the shift from prompting an agent turn-by-turn to designing the system that prompts it - a loop that wakes on a schedule, works toward a stated goal, checks its own output against evidence, and escalates to a human only when it should. The leverage moves from writing a good prompt to designing a good loop.

A loop needs a specific set of primitives. Primer ships all of them, integrated and self-hostable:

A loop needs...	Primer gives you
A heartbeat - work surfaced on a cadence, not by hand	Triggers that start a fresh session or graph run (or resume a parked one) on a cron schedule, a delay, or a webhook
Isolation - parallel agents that don't collide	Workspaces - a per-agent local, container, or Kubernetes sandbox with its own persistent, git-backed filesystem
Durable memory - the agent forgets, the repo doesn't	Git-backed workspace state plus knowledge collections agents retrieve from, so knowledge compounds across runs instead of resetting to zero
A maker and a checker - keep the writer away from the grader	Directed cyclic graphs with producer-judge loops, fan-out/fan-in, and runtime agent/graph invocation
Connectors - reach real tools and real people	A built-in MCP server (and MCP client), plus Slack / Telegram / Discord channels
A human gate - approve the risky, let the safe run	Approval gates and park-and-resume: an agent waits on a person for hours without holding compute, then continues when the reply lands

Primer does not press "go" on the loop for you - it gives you the orchestration substrate to build one and to keep a human in it where that matters. And the same context discipline that makes a single agent accurate is what lets a loop run for a long time without drifting: each iteration gets a clean, purpose-built context instead of an ever-growing transcript.

Quickstart

Pick whichever install fits. All three start the same server zero-config on an embedded SQLite database - perfect for a first look.

pipx (isolated CLI install; needs Python 3.13+):

pipx install primer-ai
primer api                                       # API + in-process worker

Docker (no Python toolchain required):

docker run --rm -p 8000:8000 ghcr.io/primerhq/primer:latest

From source (for contributors):

git clone https://github.com/primerhq/primer.git
cd primer
uv sync
uv run primer api

Then verify and open the console:

curl http://localhost:8000/v1/health             # -> {"status":"ok"}

The operator console is at http://localhost:8000/console/.

Going to Postgres (multi-process, semantic search, production)

Zero-config SQLite is single-process and ships without a vector store. For multiple workers, semantic search, or production, point Primer at Postgres:

docker compose up -d postgres                    # or: podman compose up -d postgres
cp config.example.yaml config.yaml               # set db.config.password to match
uv run primer api --config config.yaml

config.example.yaml documents every field. Environment variables override file values: every AppConfig field maps to PRIMER_<FIELD> (nested fields use __, e.g. PRIMER_DB__CONFIG__PASSWORD). The Docker image reads the same variables - set PRIMER_DB_HOST (and friends) and it renders a Postgres + pgvector config automatically; otherwise it runs the embedded-SQLite path above. For a SQLite database that survives container restarts, mount a volume at /app/data.

How it works

Primer is a stack of layers, where each layer keeps the one below it from getting cluttered:

Context discipline - tool selection, meta-tools, and internal collections keep each agent's prompt lean.
State - workspaces give agents a shared, minimal surface to hand off results without carrying history in-context.
Sequencing - directed cyclic graphs express multi-step reasoning as structure instead of one giant prompt.
Time - event-driven park-and-resume frees compute while work waits on a slow tool or a human.
Sharing - harnesses package a working configuration into a versioned, git-backed bundle.
Edges - channels, web search, and approval gates handle where agents reach outside the platform.

At runtime, requests arrive from many edges (REST/console, MCP clients, chat channels, triggers), become sessions / chats / graph runs that a worker pool claims and drives; each turn calls LLM providers, tools, workspaces, and collections, and can park on a human or event and resume later - all backed by Postgres.

Documentation

Operator docs - served at /docs when the server is running.
Agent-usage docs - docs/agents/ - how to drive a running Primer instance from an AI agent over MCP.
Developer docs - docs/dev/ - architecture patterns and subsystem references. Start at docs/dev/README.md.

Contributing

Read AGENTS.md first - it is the authoritative contributor contract (project layout, the Definition of Done, how to run the suites, and the hard rules). CONTRIBUTING.md is the human-facing summary.

uv sync
docker compose up -d postgres
# narrowed unit sweep (excludes e2e/distributed/ui_e2e):
uv run pytest tests/ -q --ignore=tests/distributed --ignore=tests/ui_e2e \
  --ignore=tests/e2e --ignore=tests/integration --ignore=tests/llm

See CODE_OF_CONDUCT.md for community expectations.

Security

Please report vulnerabilities privately - see SECURITY.md.

License

Primer is licensed under the Apache License 2.0. See LICENSE for the full text.

Project details

Release history Release notifications | RSS feed

This version

0.1.0

Jun 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

primer_ai-0.1.0.tar.gz (6.5 MB view details)

Uploaded Jun 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

primer_ai-0.1.0-py3-none-any.whl (3.5 MB view details)

Uploaded Jun 23, 2026 Python 3

File details

Details for the file primer_ai-0.1.0.tar.gz.

File metadata

Download URL: primer_ai-0.1.0.tar.gz
Upload date: Jun 23, 2026
Size: 6.5 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.14

File hashes

Hashes for primer_ai-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`b989d6af4598ba0cdbb1a43ec80d67e9cd4514cb639889000f34e0f6506c0fc1`
MD5	`d9a61dd55b3c421a134223ade0b1c40b`
BLAKE2b-256	`456d437bb1ab172921a8752275feaa7188a1002e817d87d22a62d129c37386f3`

See more details on using hashes here.

File details

Details for the file primer_ai-0.1.0-py3-none-any.whl.

File metadata

Download URL: primer_ai-0.1.0-py3-none-any.whl
Upload date: Jun 23, 2026
Size: 3.5 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.14

File hashes

Hashes for primer_ai-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`72922db978a53c8299d6ef973db0f6b267424b435ece53d92e45f62b799d6109`
MD5	`79b1d2bd25898a6a4c27ec79081cf786`
BLAKE2b-256	`84cb264e90b53699076be0dab58473bf64c6694b7c0d548a0410652c6ec358df`

See more details on using hashes here.

primer-ai 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Why Primer

What you can build

Built for loop engineering

Quickstart

Going to Postgres (multi-process, semantic search, production)

How it works

Documentation

Contributing

Security

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes