Skip to main content

Multi-agent collaboration through pairwise comparisons

Project description

Arbitron ⚖️

Arbitron is an agentic pairwise comparison engine. Multiple jurors, each with unique value systems, evaluate items head-to-head and produce a set of pairwise comparisons that can be used to derive item's ranks and weights.

  • Why pairwise? It's easier to compare two items than to assign absolute scores.
  • Why multi-juror? Different models with different perspectives (instructions) lead to more balanced, less biased outcomes.

✨ Features

  • 🎯 Arbitrary Sets. Evaluate text, code, products, ideas
  • 🤖 Customizable Jurors. Specify custom instructions, tools, providers
  • 🛡️ Bias Reduction. Ensemble decision-making
  • 🧩 Remixable — Join data with human labels and apply personalized heuristics

🚀 Quickstart

Running your first Arbitron "contest" is easy!

pip install arbitron

Setup your favorite LLM provider's API keys in the environment (e.g: OPENAI_API_KEY) and then run the following code.

from arbitron import Competition, Item, Juror

items = [
    Item(id="arrival"),
    Item(id="interstellar"),
    Item(id="inception"),
]

jurors = [
    Juror(id="SciFi Purist", model="openai:gpt-5-nano"),
]

competition = Competition(
    id="sci-fi-soundtracks",
    description="Which movie has the better soundtrack?",
    jurors=jurors,
    items=items,
)

for comparison in competition.run():
    print(comparison)

print(f"Total cost: {competition.cost}")

🏛️ License

MIT License - see LICENSE file for details.

🙌 Acknowledgments

  • DeepGov and their use of AI for Democratic Capital Allocation and Governance.
  • Daniel Kronovet for his many writings on the power of pairwise comparisons.

Margur veit það sem einn veit ekki. Many know what one does not know.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arbitron-0.5.5.tar.gz (6.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

arbitron-0.5.5-py3-none-any.whl (8.8 kB view details)

Uploaded Python 3

File details

Details for the file arbitron-0.5.5.tar.gz.

File metadata

  • Download URL: arbitron-0.5.5.tar.gz
  • Upload date:
  • Size: 6.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.9.5

File hashes

Hashes for arbitron-0.5.5.tar.gz
Algorithm Hash digest
SHA256 0d946dc2774194991dc1fcb999e03ecc9c1b623207c2d37b97279083e5f327ec
MD5 ea4efede73c6801979fcc76bc6c33574
BLAKE2b-256 c79cfbf6d6acc0737886a6cfaf048bdca68181065a0bf44d31efcced1daf398c

See more details on using hashes here.

File details

Details for the file arbitron-0.5.5-py3-none-any.whl.

File metadata

  • Download URL: arbitron-0.5.5-py3-none-any.whl
  • Upload date:
  • Size: 8.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.9.5

File hashes

Hashes for arbitron-0.5.5-py3-none-any.whl
Algorithm Hash digest
SHA256 9d7ba3437a1a929f22e28b3403215f1ac269df29ee35679e7ba00361a9555302
MD5 df3f32ae53fdbc82ce28060d24f3c304
BLAKE2b-256 4cb78265b01b89cf3d77443d58c0ab8861c471c7e27a2768c8ff6ecff1022f7d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page