Drop-in cost intelligence for Python AI agent frameworks.

These details have not been verified by PyPI

Project links

Project description

RunCost 💸

Run a 1,000-agent simulation for $2 instead of $200. Drop-in cost intelligence for Python AI agent frameworks.

The Problem

You built a multi-agent system. It works beautifully in testing.

Then you ran it for real and saw the bill.

A 500-agent simulation on GPT-4o costs $40–$80 per run. A recursive loop that nobody catches costs $200 before you notice. An overnight batch job you forgot about costs $600 by morning.

Nobody warns you. No framework stops it.

Multi-agent AI is the future. Uncontrolled spend is the tax on building it.

The Fix: One Line

# Before RunCost
from openai import OpenAI
client = OpenAI()

# After RunCost — nothing else changes
from runcost import OpenAI
client = OpenAI()

Drop it in. That's it. Your existing code works exactly as before — except now every API call is intercepted, measured, and intelligently routed before it costs you money.

What Happens When You Run It

RunCost  //  Live Agent Cost Monitor          cost.run
──────────────────────────────────────────────────────
  ✓  researcher_01  →  llama-3-8b   $0.001    11ms
  ✓  analyst_04     →  gpt-4o       $0.047   780ms
  ✓  writer_02      →  mistral-7b   $0.002    43ms
  ✗  crawler_07     →  BLOCKED      $0.000  loop@13
  ✓  researcher_14  →  llama-3-8b   $0.001     9ms
──────────────────────────────────────────────────────
  Spent:    $1.82 / $5.00   [====      ]  36%
  Saved:   $41.30            Efficiency: 95.7%
  Blocked:  3 loops          Calls:       847
──────────────────────────────────────────────────────

RunCost intercepts every LLM call and:

Estimates the cost before spending a dollar
Routes simple tasks to cheap models (Groq, Llama 3, Mistral) automatically
Routes reasoning-heavy tasks to GPT-4o or Claude only when needed
Detects recursive loops and kills them before they drain your account
Enforces hard spending limits — when you hit your cap, everything stops
Logs every call to a local SQLite database
Shows a live terminal dashboard of spend vs. savings in real time

The Numbers

Same simulation. Same output quality. 10x cheaper.

Workload	Without RunCost	With RunCost	Saved
1,000-agent simulation	~$180-$200	~$2-$4	~98%
500-agent CrewAI workflow	~$40-$80	~$4-$8	~90%
AutoGen research pipeline	~$15-$20	~$1-$2	~90%
Recursive loop (caught)	$200+	$0.00	100%

Install

pip install runcost

Supported frameworks: OpenAI SDK · CrewAI · LangGraph · AutoGen · LangChain · MiroFish · any OpenAI-compatible client

Quick Start

from runcost import OpenAI, BudgetConfig

config = BudgetConfig(
    hard_limit_usd=5.00,     # Hard stop — never exceed this per run
    warn_at_usd=2.00,        # Alert when approaching limit
    auto_route=True,          # Auto-route cheap tasks to Llama/Groq
    block_loops=True,         # Kill recursive agent loops instantly
    log_to_db=True            # Save full history to runcost.db
)

client = OpenAI(budget=config)

# Use exactly as normal — RunCost works silently underneath
response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Analyze these 500 documents"}]
)

How Routing Works

RunCost classifies each call by complexity before sending it:

Complexity	Model Used	Typical Cost
Simple: formatting, lookup, summarization	Groq Llama-3 8B	~$0.001
Medium: research, extraction, classification	Mistral 7B	~$0.002
Complex: reasoning, code, multi-step logic	GPT-4o / Claude	~$0.04–0.09
Detected loop / budget exceeded	BLOCKED	$0.000

You can override routing per agent, per task type, or per model preference.

The Dashboard

runcost dashboard

Opens a live terminal view showing real-time spend, savings, active agents, blocked loops, and full call history. Dark mode. No browser required.

Why Open Source?

Because every developer deserves to see exactly what their agents are spending — before it's too late.

The core engine is AGPL-3.0. Run it yourself, audit it, fork it, build on it.

RunCost Pro (coming soon): team dashboards · multi-project tracking · SSO · compliance exports · Slack/Discord alerts · SLA support

Roadmap

✅ OpenAI SDK wrapper ✅ Real-time cost tracking per call ✅ Hard budget limits with BudgetExceededError ✅ SQLite call logging ✅ Terminal dashboard (runcost dashboard) ✅ Web dashboard (runcost server) ✅ Pre-flight cost calculator ✅ DeepSeek support ✅ Grok / xAI support ✅ Auto-routing (automatic cheap model selection) ✅ Recursive loop detection ✅ Slack / Discord spend alerts 🔜 Claude (Anthropic SDK) support 🔜 Gemini (Google SDK) support 🔜 CrewAI native plugin 🔜 LangGraph native plugin

🔜 AgentLedger — audit trail for every agent action

If RunCost saved you money, a ⭐ on GitHub costs nothing and means everything.

License

AGPL-3.0 — free for individuals and open source projects.

Commercial license available for enterprise deployments.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.4

Mar 21, 2026

0.3.2

Mar 21, 2026

0.3.1

Mar 21, 2026

0.3

Mar 21, 2026

0.2.3

Mar 21, 2026

0.2.2

Mar 21, 2026

0.2.1

Mar 21, 2026

0.2.0

Mar 21, 2026

0.1.6

Mar 17, 2026

0.1.5

Mar 17, 2026

0.1.4

Mar 17, 2026

0.1.3

Mar 17, 2026

0.1.2

Mar 17, 2026

0.1.1

Mar 17, 2026

0.1.0

Mar 17, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

runcost-0.4.tar.gz (27.3 kB view details)

Uploaded Mar 21, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

runcost-0.4-py3-none-any.whl (26.6 kB view details)

Uploaded Mar 21, 2026 Python 3

File details

Details for the file runcost-0.4.tar.gz.

File metadata

Download URL: runcost-0.4.tar.gz
Upload date: Mar 21, 2026
Size: 27.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for runcost-0.4.tar.gz
Algorithm	Hash digest
SHA256	`a9bb69ab7a410e2f000e6817e03cc6aad8f07a5297a6c4488e176c56a4329951`
MD5	`1b189c2bee9e9d50ada3d547752db0cb`
BLAKE2b-256	`1e56022ddc24f0cc9e8b5fe211221d8492f3f59943c42ee1a07bee0824112b1a`

See more details on using hashes here.

File details

Details for the file runcost-0.4-py3-none-any.whl.

File metadata

Download URL: runcost-0.4-py3-none-any.whl
Upload date: Mar 21, 2026
Size: 26.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for runcost-0.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e8e1409fbc8119bc1a22e640ddae778139e01c57e4fb4655cd3929967ffbcae8`
MD5	`b82ad6a1d7417dbaf643093851c99edd`
BLAKE2b-256	`ffd9ae7f7f701c527a4f52c0f4c3afb77a21ac84f3e62d11d1541f9412393aa9`

See more details on using hashes here.

runcost 0.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

RunCost 💸

The Problem

The Fix: One Line

What Happens When You Run It

The Numbers

Install

Quick Start

How Routing Works

The Dashboard

Why Open Source?

Roadmap

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes