Skip to main content

Multi-LLM consensus engine with semantic agreement and self-evolution. Source-available under FSL-1.1-Apache-2.0.

Reason this release was yanked:

Security: 3 transparency fixes in 0.2.2 (Jaccard scoring exposed, SelfPromptOptimizer wired into hot path, meta-loop enforcement). Upgrade required: pip install -U quorum-ai

Project description

Quorum

Multi-LLM consensus engine. 8+ models in parallel. Semantic consensus via embeddings. Self-evolves with use. Patent Pending: HSP (PCT/US26/11908).

pip install quorum-ai
from quorum import consensus

result = await consensus("What is the chemical structure of L-Cysteine?")

# result.answer            → final consensus answer
# result.confidence        → semantic agreement score 0..1
# result.models            → [{name, response, weight, latency_ms, cost_usd}]
# result.disagreements     → list of points where models disagreed
# result.evolution_signals → which loops fired this query

Pro tier: £49/mo — solo devs, BYOK, all 13 evolution loops, 5,000 queries/mo, £0.012 overage. Start in 30 seconds at https://quorum-ai.dev.

Recommended usage pattern — context profiles

Before asking Quorum anything substantial, create a context profile for the project or domain you're working in. The profile is auto-injected into every consensus query so all the LLMs start from the same ground truth instead of falling back to training-data priors.

quorum context add my-project --file README.md   # one-time setup
quorum ask --all "what should I prioritize next?"  # context auto-injects, web is live by default

Full guide: docs/CONTEXT_PROFILES.md (also documents the failure mode this prevents — same idea as Claude Projects / Cursor .cursorrules / ChatGPT Custom Instructions, applied to multi-LLM consensus).

What makes Quorum different

  • 8+ models in parallel by default: Claude, GPT, Gemini, Grok, Llama (local), Llama 3.3, Mistral, DeepSeek, Qwen, Phi
  • Semantic consensus, not lexical: cosine similarity on embeddings, not Jaccard noise
  • Adversarial revision: round 2 where models see each other's answers and can change their mind
  • 13 self-evolution loops: RLHF, Hebbian, distillation, router, memory, meta-learning, model-vs-model, A/B testing, synthetic data, federated, self-prompting, adversarial, architecture search
  • HSP gate on every high-stakes decision (patent pending)
  • Auto-certification for EU AI Act 2026-08 — every query generates an audit-ready PDF
  • Hosted API + BYOK: run locally with your own keys OR call our managed FastAPI with metered billing
  • Switzerland of LLMs: the incumbents structurally can't sell this — they'd commoditize themselves

Why we exist

Anthropic, OpenAI, Google, Microsoft structurally cannot sell a multi-vendor consensus engine — it would commoditize their own models. Quorum occupies that gap. Neutral. Auditable. Open source. Patent Pending.

The 13 self-evolution loops

Each loop closes a feedback gap that single-model deployments leak silently. They run async, behind the consensus call, and write back into router weights, memory, and prompts.

# Loop What it does File
1 RLHF Learns from explicit user thumbs / corrections evolution/rlhf.py
2 Hebbian "Neurons that fire together wire together" — co-correct models get correlated weights evolution/hebbian.py
3 Distillation Cheap models learn from expensive consensus evolution/distillation.py
4 Router Per-domain weighting — Claude for code, Gemini for vision, etc. evolution/router.py
5 Memory Vector recall of past consensus on similar prompts evolution/memory_loop.py + core/memory.py
6 Meta-learning Loops learn which loops are working — second-order updates evolution/meta.py
7 Competition (model-vs-model) Pairwise duels; ELO-style ranking evolution/competition.py
8 A/B testing Two prompt variants per call; track which wins evolution/ab_testing.py
9 Synthetic data High-confidence consensus becomes training data evolution/synthetic_data.py
10 Federated Cross-tenant signal aggregation without raw data leak evolution/federated.py
11 Self-prompting Quorum rewrites ambiguous prompts before fan-out evolution/self_prompt.py
12 Adversarial Red-team prompts; models that fall for them lose weight evolution/adversarial.py
13 Architecture search Tries new model combos / topologies; promotes winners evolution/architecture_search.py

Billing tiers (hosted)

The OSS package is free forever. The hosted API at api.quorum-ai.com is metered. BYOK only — Quorum never proxies your provider keys. You pay the platform fee; your LLM spend stays on your own Anthropic / OpenAI / Gemini / Grok bills.

Pro — £49/mo (start here)

Tier Price / mo Included Overage
Pro £49 5,000 queries, 8 models in parallel, all 13 evolution loops, BYOK £0.012 / query

Why Pro is the right tier for you. If you're a solo backend dev, indie hacker, or an agency engineer shipping LLM features under your own name, Pro is built for your workflow. You get the full consensus engine — 8 models, semantic agreement, every self-evolution loop — at a price that fits a single-developer P&L, and you keep your own provider keys so there's nothing to migrate when you scale. No seat minimums, no procurement call, no "contact sales" wall between you and shipping.

Sign up at https://quorum-ai.dev — 30 seconds, Stripe-backed, cancel any time.

Free sandbox

Tier Price / mo Included
Free £0 100 queries, 3 models max, no evolution loops — for dev/test only

Higher tiers (talk to us: jaqueline@hsp-protocol.com)

For multi-user accounts, regulated workloads, or the EU AI Act PDF certification path, the following exist but are deliberately out of the self-serve flow. Email if you need them.

Tier Price / mo Included Overage
Team £199 25,000 queries, federated loop on, audit log retention 90d £0.008 / query
Enterprise £1,499 Unlimited, SLA 99.9%, SSO, on-prem, training data licence Custom
Compliance add-on +£500 Per-query EU AI Act PDF certificate, signed, hash-chained

Stripe-backed. Webhook handler with in-memory fallback so tests run without keys. See billing/stripe_billing.py.

Hosted API endpoints

FastAPI server in server/main.py. Run locally with quorum-server or deploy to any container host. All endpoints rate-limited via slowapi; auth via Bearer JWT or BYOK header.

Method Path Purpose
POST /v1/consensus Run a consensus query. Returns ConsensusResult.
POST /v1/consensus/stream Same, server-sent events as each model returns
GET /v1/models List enabled providers + per-tenant overrides
POST /v1/feedback Thumbs up/down on a past query — feeds RLHF loop
GET /v1/cert/{query_id} Download EU AI Act PDF certificate
POST /v1/billing/checkout Create Stripe checkout session for tier upgrade
POST /v1/billing/webhook Stripe webhook (signature-verified)
POST /v1/hsp/webhook HSP gate decision callback (patent PCT/US26/11908)
GET /v1/usage Current period usage + remaining quota
GET /healthz Liveness / readiness
GET /metrics Prometheus scrape endpoint

EU AI Act certification (2026-08 deadline)

The EU AI Act enforcement window starts 2026-08-02 for general-purpose AI systems and 2027-08-02 for high-risk uses. Quorum auto-generates a per-query audit certificate that satisfies Art. 13 (transparency) and Art. 12 (record-keeping) obligations:

  • Every model that ran, its weight, its raw response
  • Consensus method (semantic / lexical), threshold used
  • HSP gate decision + signing key
  • Cost paid, latency, tokens — for energy-use disclosure
  • SHA-256 chain link to previous certificate in tenant (tamper-evident)

PDF generated via reportlab. Stored in tenant bucket; downloadable via /v1/cert/{query_id}. Code in hsp/ai_act_cert.py.

Architecture

                       ┌─────────────────────────────┐
   user prompt ───────▶│       FastAPI server        │  uvicorn + slowapi rate limit
                       │  /v1/consensus[/stream]     │  Bearer JWT or BYOK header
                       └──────────────┬──────────────┘
                                      │
                                      ▼
                  ┌───────────────────────────────────────┐
                  │  core/consensus.py — orchestrator     │
                  │  async fan-out, max_concurrency=8     │
                  └─────┬───────────────────────────┬─────┘
                        │                           │
                        ▼                           ▼
     ┌─────────────────────────────┐    ┌────────────────────────┐
     │ providers/  (BYOK)          │    │ core/embeddings.py     │
     │  anthropic, gemini, openai, │    │ core/memory.py         │
     │  ollama, replicate, ...     │    │  vector recall + sqlite│
     └─────────────────────────────┘    └────────────────────────┘
                        │                           │
                        └───────────┬───────────────┘
                                    │
                                    ▼
                       ┌────────────────────────────┐
                       │ HSP gate (patent PCT/      │
                       │  US26/11908) — decides if  │
                       │  consensus is binding for  │
                       │  a high-stakes domain      │
                       │  hsp/gate.py               │
                       └─────────────┬──────────────┘
                                     │
            ┌────────────────────────┼────────────────────────┐
            ▼                        ▼                        ▼
  ┌─────────────────┐   ┌──────────────────────┐  ┌────────────────────┐
  │ 13 evolution    │   │ EU AI Act cert PDF   │  │ Stripe billing     │
  │ loops (async    │   │ hsp/ai_act_cert.py   │  │ billing/           │
  │ writebacks):    │   │ — hash-chained,      │  │  stripe_billing.py │
  │  1 RLHF         │   │   reportlab signed   │  │  webhook handler   │
  │  2 Hebbian      │   └──────────────────────┘  │  in-memory fallback│
  │  3 Distillation │                             └────────────────────┘
  │  4 Router       │
  │  5 Memory       │
  │  6 Meta         │
  │  7 Competition  │
  │  8 A/B          │
  │  9 Synthetic    │
  │ 10 Federated    │
  │ 11 Self-prompt  │
  │ 12 Adversarial  │
  │ 13 Arch search  │
  └─────────────────┘

All evolution loops are async writebacks — they never block the response path. Loop outputs feed back into router weights, memory vectors, and self_prompt templates on the next query. Meta-learning audits loop effectiveness and can disable loops that regress.

Roadmap

Version Status Date
v0.0.1 🟢 5 providers, semantic consensus, CLI 2026-06-15
v0.1.0 🟢 13 evolution loops, vector memory, HSP gate, FastAPI server, Stripe billing, EU AI Act cert 2026-06-16
v0.1.5 🟢 BYOK, Firestore persistence, free signup, Hermes 3 (Nous), /v1/consensus provider filter 2026-06-18
v1.0.0 🟡 Hosted SaaS public launch, multi-tenant, federated loop GA Q4 2026

License

Apache 2.0 (core) + HSP Commercial Restrictions on evolution / compliance modules. See LICENSE-HSP.

Founder

Jaqueline Martins — Sovereign Chain Ltd, UK. Patent holder of HSP Protocol (PCT/US26/11908).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

quorum_ai-0.2.1.tar.gz (2.3 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

quorum_ai-0.2.1-py3-none-any.whl (344.2 kB view details)

Uploaded Python 3

File details

Details for the file quorum_ai-0.2.1.tar.gz.

File metadata

  • Download URL: quorum_ai-0.2.1.tar.gz
  • Upload date:
  • Size: 2.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for quorum_ai-0.2.1.tar.gz
Algorithm Hash digest
SHA256 6bbb042e667334d7fc7afc496c0991daada11fb4826e2178d458f0deae15b0de
MD5 a187cee8610823152d3075d017eef14b
BLAKE2b-256 3633cb69783b6bb3f37654719d8488fc8780a733674326f51633a32da9e47fd4

See more details on using hashes here.

File details

Details for the file quorum_ai-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: quorum_ai-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 344.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for quorum_ai-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 2d41f5d2a0bf673030b624c8a79ab49729d346487e91c35e1e5335926cb251f3
MD5 1042a41d328af2067daa5f71f8e11131
BLAKE2b-256 8b11f48c73b282331e4a4941781bf999f0293003a0ef491f30ba9640a41ca90c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page