Multi-agent collaboration through pairwise comparisons
Project description
Arbitron ⚖️
Arbitron is an agentic pairwise comparison engine. Multiple jurors, each with unique value systems, evaluate items head-to-head and produce a set of pairwise comparisons that can be used to derive item's ranks and weights.
- Why pairwise? It's easier to compare two items than to assign absolute scores.
- Why multi-juror? Different models with different perspectives (instructions) lead to more balanced, less biased outcomes.
✨ Features
- 🎯 Arbitrary Sets. Evaluate text, code, products, ideas
- 🤖 Customizable Jurors. Specify custom instructions, tools, providers
- 🛡️ Bias Reduction. Ensemble decision-making
- 🧩 Remixable — Join data with human labels and apply personalized heuristics
🚀 Quickstart
Running your first Arbitron "contest" is easy!
pip install arbitron
Setup your favorite LLM provider's API keys in the environment (e.g: OPENAI_API_KEY) and then run the following code.
from arbitron import Competition, Item, Juror
items = [
Item(id="arrival"),
Item(id="interstellar"),
Item(id="inception"),
]
jurors = [
Juror(id="SciFi Purist", model="openai:gpt-5-nano"),
]
competition = Competition(
id="sci-fi-soundtracks",
description="Which movie has the better soundtrack?",
jurors=jurors,
items=items,
)
for comparison in competition.run():
print(comparison)
print(f"Total cost: {competition.cost}")
🏛️ License
MIT License - see LICENSE file for details.
🙌 Acknowledgments
- DeepGov and their use of AI for Democratic Capital Allocation and Governance.
- Daniel Kronovet for his many writings on the power of pairwise comparisons.
Margur veit það sem einn veit ekki. Many know what one does not know.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file arbitron-0.5.5.tar.gz.
File metadata
- Download URL: arbitron-0.5.5.tar.gz
- Upload date:
- Size: 6.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: uv/0.9.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0d946dc2774194991dc1fcb999e03ecc9c1b623207c2d37b97279083e5f327ec
|
|
| MD5 |
ea4efede73c6801979fcc76bc6c33574
|
|
| BLAKE2b-256 |
c79cfbf6d6acc0737886a6cfaf048bdca68181065a0bf44d31efcced1daf398c
|
File details
Details for the file arbitron-0.5.5-py3-none-any.whl.
File metadata
- Download URL: arbitron-0.5.5-py3-none-any.whl
- Upload date:
- Size: 8.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: uv/0.9.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9d7ba3437a1a929f22e28b3403215f1ac269df29ee35679e7ba00361a9555302
|
|
| MD5 |
df3f32ae53fdbc82ce28060d24f3c304
|
|
| BLAKE2b-256 |
4cb78265b01b89cf3d77443d58c0ab8861c471c7e27a2768c8ff6ecff1022f7d
|