Skip to main content

Reproducible benchmark suite for memory/QA systems — June + pluggable competitors.

Project description

june-bench

A pip-installable, reproducible benchmark suite for memory / QA systems — June + pluggable competitors — over LoCoMo, LongMemEval, HotpotQA/2Wiki/MuSiQue, and FinanceBench, with the same data and the same scorer.

pip install june-bench
june-bench list
june-bench run --system echo --dataset smoke --split smoke    # offline, no key, no download

A benchmark is run(system, dataset) → records → score. Two typed ports are the only extension points:

  • System — the thing benchmarked. JuneApiSystem (default; a thin HTTP client to June's /v1/answer, so no June source is shipped), JuneLocalSystem ([june-local] extra; a source-protected compiled wheel), CogneeSystem ([cognee] extra), or any future system as one adapter.
  • Dataset — what it runs on. The four benchmarks behind a registry.

The scorer is the canonical SQuAD/HotpotQA EM/F1 + selective-accuracy/coverage/cost — Cognee-comparable. Tiny smoke fixtures ship in the wheel (offline wiring proof); full splits are fetched, sha-verified, from a pinned release. No score is ever baked into the package — every result row records dataset + scorer + system + model + cost, so a published number is reproducible by a stranger.

Status: SB0 (contracts + no-deps smoke + skeleton). Datasets, June/Cognee systems, and the full CLI land in SB1–SB6.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

june_bench-0.0.6.tar.gz (68.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

june_bench-0.0.6-py3-none-any.whl (56.8 kB view details)

Uploaded Python 3

File details

Details for the file june_bench-0.0.6.tar.gz.

File metadata

  • Download URL: june_bench-0.0.6.tar.gz
  • Upload date:
  • Size: 68.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for june_bench-0.0.6.tar.gz
Algorithm Hash digest
SHA256 1e716eab9837a8db438e66a1cceff7737fcd3b6b4497cc35cca62c666d1ce03e
MD5 6ab4ff419bc709a37ed898927c5b6f37
BLAKE2b-256 95550c97a4449eaab3b470ce6ac345fc4f0863317d731e126ef1852dcaca171a

See more details on using hashes here.

Provenance

The following attestation bundles were made for june_bench-0.0.6.tar.gz:

Publisher: publish-bench.yml on Junemind/june-brain

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file june_bench-0.0.6-py3-none-any.whl.

File metadata

  • Download URL: june_bench-0.0.6-py3-none-any.whl
  • Upload date:
  • Size: 56.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for june_bench-0.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 005e0674e3d23bcf13da074006b8d1f8d15c7f120dfb9e17f37ce8014c6e391a
MD5 f85c2dacf0015e4d4582788256706d60
BLAKE2b-256 a80d5e32df5f8a3a8ef94b2aec964b44c5eaf47488f4790c153b3541124f1267

See more details on using hashes here.

Provenance

The following attestation bundles were made for june_bench-0.0.6-py3-none-any.whl:

Publisher: publish-bench.yml on Junemind/june-brain

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page