Skip to main content

Alpha cost ledger utility for LLM and agent API responses.

Project description

RunCost

RunCost is a small alpha utility for answering one question:

What did this LLM or agent API call cost, and why?

It turns provider responses, framework usage objects, or normalized usage into a componentized cost ledger with input, cached input, output, reasoning, tool units, discounts, price sources, and warnings.

Install

Current alpha install paths:

# Python from the repo
python3 -m pip install git+https://github.com/adamallcock/runcost.git

# JavaScript/TypeScript from a checkout
npm pack ./packages/javascript/core
npm install ./runcost-0.1.1.tgz

# Go
go get github.com/adamallcock/runcost/packages/go/ledger

First registry release commands, once publishing is enabled:

pip install runcost-ai
npm install runcost
go get github.com/adamallcock/runcost/packages/go/ledger

The Python distribution name is runcost-ai; the import package and CLI are runcost.

One-Minute Examples

Python:

from runcost import from_response

response = {
    "model": "gpt-4.1-mini-2025-04-14",
    "usage": {
        "input_tokens": 36,
        "input_tokens_details": {"cached_tokens": 6},
        "output_tokens": 87,
        "output_tokens_details": {"reasoning_tokens": 12},
    },
}

price_cards = [{
    "schema_version": "0.1",
    "id": "openai:gpt-4.1-mini:example",
    "provider": "openai",
    "surface": "openai.responses",
    "model": "gpt-4.1-mini",
    "aliases": ["gpt-4.1-mini-2025-04-14"],
    "components": [
        {"usage_component": "input_uncached_tokens", "unit": "token", "price": {"amount": "0.40", "currency": "USD", "per": "1000000"}},
        {"usage_component": "input_cache_read_tokens", "unit": "token", "price": {"amount": "0.10", "currency": "USD", "per": "1000000"}},
        {"usage_component": "output_text_tokens", "unit": "token", "price": {"amount": "1.60", "currency": "USD", "per": "1000000"}},
        {"usage_component": "output_reasoning_tokens", "unit": "token", "price": {"amount": "1.60", "currency": "USD", "per": "1000000"}},
    ],
    "source": {"name": "example"},
}]

ledger = from_response(
    response,
    provider="openai",
    surface="openai.responses",
    model="gpt-4.1-mini",
    price_cards=price_cards,
)

print(ledger["total"])
print(ledger["components"])
print(ledger["warnings"])

TypeScript:

import { fromResponse } from "runcost";

// Using the same response and priceCards shape as the Python example above.
const ledger = fromResponse(response, {
  provider: "openai",
  surface: "openai.responses",
  model: "gpt-4.1-mini",
  priceCards
});

console.log(ledger.total);
console.log(ledger.components);
console.log(ledger.warnings);

Go:

package main

import (
    "fmt"

    ledger "github.com/adamallcock/runcost/packages/go/ledger"
)

func main() {
    cost := ledger.FromResponse(
        ledger.Object{
            "model": "gpt-4.1-mini-2025-04-14",
            "usage": ledger.Object{
                "input_tokens":  36,
                "output_tokens": 87,
            },
        },
        ledger.Object{
            "provider":    "openai",
            "surface":     "openai.responses",
            "model":       "gpt-4.1-mini",
            "price_cards": []any{
                ledger.Object{
                    "schema_version": "0.1",
                    "id":             "openai:gpt-4.1-mini:example",
                    "provider":       "openai",
                    "surface":        "openai.responses",
                    "model":          "gpt-4.1-mini",
                    "aliases":        []any{"gpt-4.1-mini-2025-04-14"},
                    "components": []any{
                        ledger.Object{
                            "usage_component": "input_uncached_tokens",
                            "unit":            "token",
                            "price": ledger.Object{"amount": "0.40", "currency": "USD", "per": "1000000"},
                        },
                        ledger.Object{
                            "usage_component": "output_text_tokens",
                            "unit":            "token",
                            "price": ledger.Object{"amount": "1.60", "currency": "USD", "per": "1000000"},
                        },
                    },
                    "source": ledger.Object{"name": "example"},
                },
            },
        },
    )

    fmt.Println(cost["total"])
}

Already have normalized usage? Use the deterministic calculator directly:

from runcost import calculate_cost

ledger = calculate_cost(
    usage_ledger={
        "schema_version": "0.1",
        "provider": "openai",
        "surface": "openai.responses",
        "model": {"requested": "gpt-4.1-mini"},
        "components": [
            {"name": "input_uncached_tokens", "quantity": "30", "unit": "token"},
            {"name": "output_text_tokens", "quantity": "75", "unit": "token"},
        ],
    },
    price_cards=price_cards,
)

Main APIs

Job Python JavaScript/TypeScript Go
Price normalized usage calculate_cost(...) calculateCost(options) CalculateCost(options)
Price a provider response from_response(...) fromResponse(response, options) FromResponse(response, options)
Aggregate call ledgers aggregate_cost_ledgers(...) aggregateCostLedgers(options) AggregateCostLedgers(...)
Use framework outputs from_langsmith_run(...), track_langchain_costs(...), and more fromVercelAISDKStreamFinish(...), createRunCostVercelOnFinish(...), and more FromLangSmithRun(...), FromSemanticKernelTelemetry(...), and more
Load price sources price_cards_from_json_file(...), price_cards_from_openrouter_models(...) priceCardsFromJSONFile(...), priceCardsFromOpenRouterModels(...) PriceCardsFromJSONFile(...), PriceCardsFromOpenRouterModels(...)
Add custom prices Pass price_cards Pass priceCards Pass price_cards in options
Apply discounts Pass discount_policies Pass discountPolicies Pass discount_policies in options
Audit decisions debug_trace=True debugTrace: true "debug_trace": true
Fail on ambiguity mode="strict" mode: "strict" mode: "strict"
CLI checks runcost price-cards, runcost fixture-check N/A N/A

Supported Inputs

Fixture-backed surfaces include OpenAI Responses and Chat Completions, Anthropic Messages, OpenRouter, Gemini and Vertex generateContent, AWS Bedrock Converse, Cohere Chat and Rerank, OpenAI-compatible providers such as Groq, xAI, Mistral, DeepSeek, Azure OpenAI, and Hugging Face Inference Providers, plus selected framework objects from LangChain, Vercel AI SDK, OpenAI Agents SDK, LlamaIndex, Haystack, LiteLLM, AutoGen/AG2, LangSmith, Semantic Kernel, and OpenRouter SDK paths.

See supported surfaces for the current matrix.

Custom Prices And Discounts

RunCost treats provider pricing as data. You can pass user price cards for private rates, exact aliases, service tiers, long-context prices, historical effective dates, tool units, or internal billing units.

discounts = [{
    "schema_version": "0.1",
    "id": "openai-contract-4pct",
    "match": {"provider": "openai"},
    "adjustment": {"type": "percentage_discount", "value": "4"},
}]

The returned ledger records selected price sources, applied discounts, and any warning that prevents the total from being fully explained.

Fixtures are behavioral conformance tests, not a complete model-price database. Use source adapters and reviewed source-cache snapshots for upstream catalog data; see price data strategy.

Warnings

RunCost is designed to be boring. When it cannot confidently price something, it returns a structured warning such as unknown_model, component_unpriced, price_stale, stream_usage_missing, or provider_reported_cost_mismatch. Use strict mode in tests or reconciliation flows when warnings should fail.

CLI

The Python package installs a lightweight CLI:

runcost price-cards --source-type user-pricing --input prices.json
runcost fixture-check fixtures/my-case.json

Read Next

Status

RunCost is alpha software. The core behavior is fixture-backed across Python, JavaScript/TypeScript, and Go, but registry publishing is still held until the release gates are complete. Smoke costs may use sample price cards; use provider exports or dashboard reconciliation before treating a total as invoice-exact.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

runcost_ai-0.1.1.tar.gz (37.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

runcost_ai-0.1.1-py3-none-any.whl (35.3 kB view details)

Uploaded Python 3

File details

Details for the file runcost_ai-0.1.1.tar.gz.

File metadata

  • Download URL: runcost_ai-0.1.1.tar.gz
  • Upload date:
  • Size: 37.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.5

File hashes

Hashes for runcost_ai-0.1.1.tar.gz
Algorithm Hash digest
SHA256 c7369fbcfef6005ee8682620796689cda5ba7836e146731f7633bc047c104592
MD5 d2ee8da2b1bf76c70d9673cdb75294f9
BLAKE2b-256 2594c22a6b19bbb4bd89f034b3141b55cf0de031a293e2f3277235c537bc7dd9

See more details on using hashes here.

File details

Details for the file runcost_ai-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: runcost_ai-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 35.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.5

File hashes

Hashes for runcost_ai-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 0159f913314b674f520c3f0dedceb5c46e876a8b7d01c47e0d1204fae03be0c7
MD5 f6bd44786e531d59e8dad44404790710
BLAKE2b-256 94949c8d6c33303e0ed8a12cf97380e86d00ceced3123c7d35a7557c7fbfd662

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page