Skip to main content

Single-agent advocacy variant of sophistry-bench for the Prime Intellect Reward Hacking Sprint. Pre-registered hypothesis: training Llama-3.2-1B on a programmatic claim-count cliff (peak at n=8) will cause cliff convergence within 100 GRPO steps; three adversarial canary rewards detect format-hacking. Self-contained (vendored from sophistry-bench v0.1.19) — no runtime dependency on the main package, to work around PI training-infra's exclude-newer index filter.

Project description

The author of this package has not provided a project description

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sophistry_bench_sprint-0.1.5.tar.gz (498.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sophistry_bench_sprint-0.1.5-py3-none-any.whl (502.5 kB view details)

Uploaded Python 3

File details

Details for the file sophistry_bench_sprint-0.1.5.tar.gz.

File metadata

  • Download URL: sophistry_bench_sprint-0.1.5.tar.gz
  • Upload date:
  • Size: 498.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for sophistry_bench_sprint-0.1.5.tar.gz
Algorithm Hash digest
SHA256 640c59abf929fccc1b555aae634c3a8f170e46de04af1681e955697c221cd327
MD5 27dfaca3037cda81051c8915695bdf3c
BLAKE2b-256 70f677c2ca00d554fd4d39e11d01a3091d918ce2265af5464d05cb520b1e47d8

See more details on using hashes here.

File details

Details for the file sophistry_bench_sprint-0.1.5-py3-none-any.whl.

File metadata

File hashes

Hashes for sophistry_bench_sprint-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 bc2c29a3452aeee4e28a156926546922f16f3e58c2f63537ea253e99e0d75836
MD5 71a6772ed3383d0afd2ad7b1537f80e4
BLAKE2b-256 0ff729a0a1ce5f3af0d5e469357c55b1157f353342e8beb792ce249720c0237e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page