Estimate costs and running times of complex LLM workflows/experiments/pipelines in advance before spending money, via simulations.

These details have not been verified by PyPI

Project description

costly

Estimate costs and running times of complex LLM workflows/experiments/pipelines in advance before spending money, via simulations. Just put @costly() on the load-bearing function; make sure all functions that call it pass **kwargs to it and call your complex function with simulate=True and some cost_log: Costlog object. See examples.ipynb for more details.

https://github.com/abhimanyupallavisudhir/costly

Installation

pip install costly

Usage

See examples.ipynb for a full walkthrough; some examples below.

from costly import Costlog, costly, CostlyResponse
from costly.estimators.llm_api_estimation import LLM_API_Estimation as estimator


@costly()
def chatgpt(input_string: str, model: str) -> str:
    from openai import OpenAI

    client = OpenAI()
    response = client.chat.completions.create(
        model=model, messages=[{"role": "user", "content": input_string}]
    )
    output_string = response.choices[0].message.content
    return output_string


@costly(
    input_tokens=lambda kwargs: LLM_API_Estimation.messages_to_input_tokens(
        kwargs["messages"], kwargs["model"]
    ),
)
def chatgpt_messages(messages: list[dict[str, str]], model: str) -> str:
    from openai import OpenAI

    client = OpenAI()
    response = client.chat.completions.create(model=model, messages=messages)
    output_string = response.choices[0].message.content
    return output_string


@costly()
def chatgpt(input_string: str, model: str) -> str:
    from openai import OpenAI

    client = OpenAI()
    response = client.chat.completions.create(
        model=model,
        messages=[
            {"role": "user", "content": input_string},
        ],
    )

    return CostlyResponse(
        output=response.choices[0].message.content,
        cost_info={
            "input_tokens": response.usage.prompt_tokens,
            "output_tokens": response.usage.completion_tokens,
        },
    ) # in usage, this will still just return the output, not the whole CostlyResponse object

Testing

poetry run pytest -s -m "not slow"
poetry run pytest -s -m "slow"

Tests for instructor currently fail.

TODO

Make it work with async
Support for locally run LLMs -- ideally need a cost & time estimator that takes into account your machine details, GPU pricing etc.
Decide and document what the best way to "propagate" description (for breakdown purposes) through function calls is. Have the user manually write def f(...): ... g(description = kwargs.get("description") + ["f"]? Add a @description("blabla") decorator? Add a @description decorator that automatically appends the function name and arguments into description?
Better solution for token counting for Chat messages (search HACK in the repo)
make instructor tests pass https://community.openai.com/t/how-to-calculate-the-tokens-when-using-function-call/266573/11
support more models

Instructor tests don't really pass but I can kinda live with this. Lmk if anyone has a good way to count tokens from messages that includes tool calling (I'm using this).

FAILED tests/test_estimators/test_llm_api_estimation.py::test_estimate_contains_exact_instructor[PERSONINFO_gpt-4o] - AssertionError: ['Input tokens estimate 65 not within 20pc of truth 83']
FAILED tests/test_estimators/test_llm_api_estimation.py::test_estimate_contains_exact_instructor[PERSONINFO_gpt-4o-mini] - AssertionError: ['Input tokens estimate 65 not within 20pc of truth 83']
FAILED tests/test_estimators/test_llm_api_estimation.py::test_estimate_contains_exact_instructor[PERSONINFO_gpt-4-turbo] - AssertionError: ['Input tokens estimate 65 not within 20pc of truth 85']
FAILED tests/test_estimators/test_llm_api_estimation.py::test_estimate_contains_exact_instructor[PERSONINFO_gpt-3.5-turbo] - AssertionError: ['Input tokens estimate 65 not within 20pc of truth 85']
FAILED tests/test_estimators/test_llm_api_estimation.py::test_estimate_contains_exact_instructor[FOOMODEL_gpt-4o] - AssertionError: ['Input tokens estimate 77 not within 20pc of truth 108']
FAILED tests/test_estimators/test_llm_api_estimation.py::test_estimate_contains_exact_instructor[FOOMODEL_gpt-4o-mini] - AssertionError: ['Input tokens estimate 77 not within 20pc of truth 108']
FAILED tests/test_estimators/test_llm_api_estimation.py::test_estimate_contains_exact_instructor[FOOMODEL_gpt-4-turbo] - AssertionError: ['Input tokens estimate 77 not within 20pc of truth 113']
FAILED tests/test_estimators/test_llm_api_estimation.py::test_estimate_contains_exact_instructor[FOOMODEL_gpt-3.5-turbo] - AssertionError: ['Input tokens estimate 77 not within 20pc of truth 113']
FAILED tests/test_estimators/test_llm_api_estimation.py::test_estimate_contains_exact_instructor[BARMODEL_gpt-4o] - AssertionError: ['Input tokens estimate 70 not within 20pc of truth 168']
FAILED tests/test_estimators/test_llm_api_estimation.py::test_estimate_contains_exact_instructor[BARMODEL_gpt-4o-mini] - AssertionError: ['Input tokens estimate 70 not within 20pc of truth 168']
FAILED tests/test_estimators/test_llm_api_estimation.py::test_estimate_contains_exact_instructor[BARMODEL_gpt-4-turbo] - AssertionError: ['Input tokens estimate 70 not within 20pc of truth 178']
FAILED tests/test_estimators/test_llm_api_estimation.py::test_estimate_contains_exact_instructor[BARMODEL_gpt-4] - AssertionError: ['Input tokens estimate 70 not within 20pc of truth 126']
FAILED tests/test_estimators/test_llm_api_estimation.py::test_estimate_contains_exact_instructor[BARMODEL_gpt-3.5-turbo] - AssertionError: ['Input tokens estimate 70 not within 20pc of truth 178']

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

2.3.7

Jan 10, 2026

2.3.6

Jan 10, 2026

2.3.5

Jan 8, 2026

2.3.4

Jan 29, 2025

2.3.3

Jan 24, 2025

2.3.2

Jan 24, 2025

2.3.1

Jan 24, 2025

2.3.0 yanked

Jan 24, 2025

Reason this release was yanked:

buggy

2.2.4

Dec 2, 2024

2.2.3

Dec 2, 2024

2.2.2

Nov 10, 2024

2.2.1

Nov 6, 2024

2.2.0

Oct 23, 2024

2.1.0

Oct 6, 2024

2.0.0

Oct 5, 2024

1.1.2

Sep 25, 2024

1.1.1

Sep 23, 2024

1.1.0

Sep 22, 2024

1.0.2

Sep 20, 2024

1.0.1

Sep 18, 2024

1.0.0

Sep 17, 2024

0.1.13

Sep 9, 2024

0.1.12

Sep 9, 2024

0.1.11

Sep 9, 2024

0.1.10

Sep 9, 2024

0.1.9

Sep 6, 2024

This version

0.1.8

Sep 6, 2024

0.1.7

Sep 5, 2024

0.1.6

Aug 31, 2024

0.1.5

Aug 31, 2024

0.1.4

Aug 31, 2024

0.1.3

Aug 30, 2024

0.1.1

Aug 29, 2024

0.1.0

Aug 29, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

costly-0.1.8.tar.gz (11.5 kB view details)

Uploaded Sep 6, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

costly-0.1.8-py3-none-any.whl (13.5 kB view details)

Uploaded Sep 6, 2024 Python 3

File details

Details for the file costly-0.1.8.tar.gz.

File metadata

Download URL: costly-0.1.8.tar.gz
Upload date: Sep 6, 2024
Size: 11.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.8.2 CPython/3.11.0 Windows/10

File hashes

Hashes for costly-0.1.8.tar.gz
Algorithm	Hash digest
SHA256	`684554ff971d97bc7f466036f140542c727c385d885d44016d9611415e80f4af`
MD5	`e2a2ee4f892c4bfbdbe3722e413f3811`
BLAKE2b-256	`11353279fdd6ebb4cfab279269b221d7802a4aae7cea88e9b8d566522c34c834`

See more details on using hashes here.

File details

Details for the file costly-0.1.8-py3-none-any.whl.

File metadata

Download URL: costly-0.1.8-py3-none-any.whl
Upload date: Sep 6, 2024
Size: 13.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.8.2 CPython/3.11.0 Windows/10

File hashes

Hashes for costly-0.1.8-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c4693736b6e11553230b0df98fe794a191194e98a0fea6c9496e3a5411dce090`
MD5	`2fb3163dd190498eb6a6e0f8b922b760`
BLAKE2b-256	`b1ecc500b3379e91bc1eb9ae5a432e55878f4677045825f53bdf2a8458e03451`

See more details on using hashes here.

costly 0.1.8

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

costly

Installation

Usage

Testing

TODO

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes