Skip to main content

Python SDK for ATP benchmark platform — run benchmarks, submit results, view leaderboards

Project description

atp-sdk

Python SDK for the ATP (Agent Test Platform) benchmark platform.

Installation

uv add atp-platform-sdk

Quick Start

from atp_sdk import ATPClient

client = ATPClient(platform_url="https://atp.pr0sto.space")
client.login()  # GitHub Device Flow — opens browser

benchmarks = client.list_benchmarks()
run = client.start_run(benchmarks[0].id, agent_name="my-agent")

for task in run:
    response = my_agent(task)  # your agent logic
    run.submit(response, task_index=task["metadata"]["task_index"])

print(run.status())
print(run.leaderboard())

Authentication

Three ways to authenticate (checked in order):

  1. Explicit token: ATPClient(token="...")
  2. Environment variable: ATP_TOKEN=...
  3. Saved token: client.login() saves to ~/.atp/config.json

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

atp_platform_sdk-2.0.0.tar.gz (9.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

atp_platform_sdk-2.0.0-py3-none-any.whl (12.7 kB view details)

Uploaded Python 3

File details

Details for the file atp_platform_sdk-2.0.0.tar.gz.

File metadata

  • Download URL: atp_platform_sdk-2.0.0.tar.gz
  • Upload date:
  • Size: 9.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.11

File hashes

Hashes for atp_platform_sdk-2.0.0.tar.gz
Algorithm Hash digest
SHA256 2fece2cda276fa93a17dd4edb90285f2f57b8937164abef2e2e2e5322097060a
MD5 d3cf21e5556e333e14cb1f1da03b7359
BLAKE2b-256 cdd1b10b4b723eb7c9a1eebed792e428d0076a985e4bdaa4c7e429fe08d4381b

See more details on using hashes here.

File details

Details for the file atp_platform_sdk-2.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for atp_platform_sdk-2.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 9433eed96ee7bd680b3517049d3271d01da4abeee36c904645d1cb580bc8a6bd
MD5 eca6d1d053749d4cd067491ff2c02cf4
BLAKE2b-256 93b77d5bc9bdb6b50d388ae3525dbd2d36fdd9d3a64694d1742c66015a8fbda2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page