A lightweight Python library that automatically discovers, evaluates, and tunes parameters of multi-provider LLM pipelines to balance quality, cost, and latency.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

joaoplay

These details have not been verified by PyPI

Project description

Octuner - Multi-Provider LLM Optimizer

Optimize LLM providers, models, and parameters — without the guesswork.

Octuner is a lightweight library that solves the decision-making process when integrating with LLMs, especially in multi-step model chaining scenarios.

Why Octuner?

Building LLM applications often feels like solving a puzzle:

Which provider? OpenAI, Gemini, Anthropic… or self-hosted (Ollama, vLLM, etc.)?
Which model? GPT-4o, Gemini Pro, Claude…?
Which parameters? Temperature, top-p, max_tokens…?
How to balance quality, cost, and latency?

Things get harder with model chaining, where each step depends on the previous one:

Input → [LLM A] → Intermediate Result → [LLM B] → Final Output

Manual trial-and-error leads to inconsistent performance, wasted budget, and provider lock-in. Octuner removes the guesswork.

Quick Start

Build a tiny sentiment chain that first explains why, then outputs a single-word label. You'll pass an explicit YAML config path so it's ready for optimization.

1. Create your model chain

from octuner import MultiProviderTunableLLM

class SentimentChain:
    def __init__(self, config_file: str):
        # Reason step (clear explanation)
        self.reasoner = MultiProviderTunableLLM(
            config_file,
            default_provider="openai",
            default_model="gpt-4o-mini",
        )
        # Label step (concise single-word output)
        self.labeler = MultiProviderTunableLLM(
            config_file,
            default_provider="gemini",
            default_model="gemini-1.5-flash",
        )

    def _build_reason_prompt(self, text: str) -> str:
        return (
            "Explain the sentiment (positive/negative/neutral) of the text below. "
            "Keep the reasoning short and specific.\n\n"
            f"Text: {text}\n"
        )

    def _build_label_prompt(self, reasoning: str) -> str:
        return (
            "Given the reasoning below, respond with only one word: "
            "positive | negative | neutral.\n\n"
            f"Reasoning:\n{reasoning}\n"
        )

    def predict(self, text: str) -> dict:
        reason = self.reasoner.call(self._build_reason_prompt(text)).text
        label = self.labeler.call(self._build_label_prompt(reason)).text.strip().lower()
        return {"sentiment": label, "why": reason}

2. Add a dataset and metric

dataset = [
    {"input": "I love this!", "target": {"sentiment": "positive"}},
    {"input": "This is awful.", "target": {"sentiment": "negative"}},
    {"input": "It's fine.", "target": {"sentiment": "neutral"}},
]

def metric(output, target):
    return 1.0 if output["sentiment"] == target["sentiment"] else 0.0

3. Optimize

from octuner import AutoTuner, apply_best

chain = SentimentChain("configs/llm.yaml")  # explicit YAML config path

tuner = AutoTuner.from_component(
    component=chain,
    entrypoint=lambda c, x: c.predict(x),
    dataset=dataset,
    metric=metric,
)

# Focus on the most impactful knobs first
tuner.include([
    "reasoner.provider_model", "reasoner.temperature",
    "labeler.provider_model", "labeler.temperature",
])

result = tuner.search(max_trials=12, mode="pareto")
result.save_best("optimized_sentiment_chain.yaml")

apply_best(chain, "optimized_sentiment_chain.yaml")
print(chain.predict("The new UI is a joy to use."))

Key Features

Multi-Provider Optimization

Automatically discover the best combination of:

Providers: OpenAI, Gemini, and more
Models: GPT-4o, GPT-4o-mini, Gemini Pro, etc.
Parameters: temperature, top_p, max_tokens, web search
Capabilities: Web search, function calling, etc.

Multiple Optimization Modes

Pareto: Balance quality, cost, and latency (default)
Constrained: Maximize quality within cost/latency limits
Scalarized: Optimize weighted combination of metrics
Quality-focused: Maximize performance regardless of cost/time
Cost-focused: Minimize spending while meeting quality thresholds
Speed-focused: Optimize for fastest response within quality bounds

Flexible Parameter Control

providers:
  openai:
    model_capabilities:
      gpt-4o-mini:
        supported_parameters: [temperature, top_p, max_tokens]
        parameter_ranges:
          temperature: [0.0, 2.0]
          max_tokens: [50, 4000]
        default_parameters:
          temperature: 0.7
          max_tokens: 1000

Web Search Integration

OpenAI: Built-in web search capabilities
Gemini: Native Google grounding tool for web context
Tunable: Let optimization decide when web search improves performance

Configuration Templates

Choose from ready-to-use templates in config_templates/:

openai_basic.yaml - Basic OpenAI setup (GPT-3.5, GPT-4o, GPT-4o-mini)
gemini_basic.yaml - Basic Gemini setup (cost-effective)
multi_provider.yaml - Multiple providers (let optimization choose)

Simple Configuration Example

# Copy a starter template
cp config_templates/openai_basic.yaml my_llm_config.yaml

# Set your API key
export OPENAI_API_KEY=sk-your-key-here

Use in Your Code

from octuner import MultiProviderTunableLLM

# Explicit configuration - no hidden global state
llm = MultiProviderTunableLLM(config_file="my_llm_config.yaml")
response = llm.call("What is the capital of France?")
print(response.text)

Installation

pip install octuner

Requirements

Python 3.10+
Optuna 3.0+
PyYAML 6.0+

License

MIT License - see LICENSE file for details.

Contributing

Contributions welcome! Please check the issues and pull requests for current discussions.

Octuner helps developers build better LLM applications by systematically optimizing the quality vs cost vs latency triangle through explicit configuration management and data-driven parameter tuning.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

joaoplay

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.2.post1

Oct 17, 2025

0.1.2

Oct 17, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

octuner-0.1.2.post1.tar.gz (46.3 kB view details)

Uploaded Oct 17, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

octuner-0.1.2.post1-py3-none-any.whl (57.3 kB view details)

Uploaded Oct 17, 2025 Python 3

File details

Details for the file octuner-0.1.2.post1.tar.gz.

File metadata

Download URL: octuner-0.1.2.post1.tar.gz
Upload date: Oct 17, 2025
Size: 46.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for octuner-0.1.2.post1.tar.gz
Algorithm	Hash digest
SHA256	`ab45e7558d44d2c65024db7215b87e3adaefa5476781ea605a1735957bedefc0`
MD5	`c8ecca30e06e4f84b60c29ab229a2fb5`
BLAKE2b-256	`e93bbbbeafb1be8edc36f13473555a5e01cd3242ec2dcd4b03b23a8bf83e1b57`

See more details on using hashes here.

Provenance

The following attestation bundles were made for octuner-0.1.2.post1.tar.gz:

Publisher: publish.yml on joaoplay/octuner

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: octuner-0.1.2.post1.tar.gz
- Subject digest: ab45e7558d44d2c65024db7215b87e3adaefa5476781ea605a1735957bedefc0
- Sigstore transparency entry: 619647216
- Sigstore integration time: Oct 17, 2025
Source repository:
- Permalink: joaoplay/octuner@d17d36c63672911c4fcf65672ab36b92fdd1bf32
- Branch / Tag: refs/tags/v0.1.2.post1
- Owner: https://github.com/joaoplay
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@d17d36c63672911c4fcf65672ab36b92fdd1bf32
- Trigger Event: release

File details

Details for the file octuner-0.1.2.post1-py3-none-any.whl.

File metadata

Download URL: octuner-0.1.2.post1-py3-none-any.whl
Upload date: Oct 17, 2025
Size: 57.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for octuner-0.1.2.post1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`bd3669aed4501301a873d6c3e01cd6df9fc07c213dbbf7616c77a723b4402a78`
MD5	`4c1cdd6b107408293f0a87ba796486e8`
BLAKE2b-256	`389a0e813628b3b588549c64c9dbd56fbdce2a44dfe92de793092ce1e14a3a73`

See more details on using hashes here.

Provenance

The following attestation bundles were made for octuner-0.1.2.post1-py3-none-any.whl:

Publisher: publish.yml on joaoplay/octuner

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: octuner-0.1.2.post1-py3-none-any.whl
- Subject digest: bd3669aed4501301a873d6c3e01cd6df9fc07c213dbbf7616c77a723b4402a78
- Sigstore transparency entry: 619647247
- Sigstore integration time: Oct 17, 2025
Source repository:
- Permalink: joaoplay/octuner@d17d36c63672911c4fcf65672ab36b92fdd1bf32
- Branch / Tag: refs/tags/v0.1.2.post1
- Owner: https://github.com/joaoplay
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@d17d36c63672911c4fcf65672ab36b92fdd1bf32
- Trigger Event: release

octuner 0.1.2.post1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Octuner - Multi-Provider LLM Optimizer

Why Octuner?

Quick Start

1. Create your model chain

2. Add a dataset and metric

3. Optimize

Key Features

Multi-Provider Optimization

Multiple Optimization Modes

Flexible Parameter Control

Web Search Integration

Configuration Templates

Simple Configuration Example

Use in Your Code

Installation

Requirements

License

Contributing

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance