Skip to main content

Benchmarking the performance of agents far and wide, regardless of how they are set up and how they work

Project description

Start-GPT Benchmarks

Built for the purpose of benchmarking the performance of agents regardless of how they work.

Objectively know how well your agent is performing in categories like code, retrieval, memory, and safety.

Save time and money while doing it through smart dependencies. The best part? It's all automated.

Scores:

Screenshot 2023-07-25 at 10 35 01 AM

Ranking overall:

Detailed results:

Screenshot 2023-07-25 at 10 42 15 AM

Click here to see the results and the raw data!!

More agents coming soon !

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

startbenchmark-0.0.11.tar.gz (93.6 kB view details)

Uploaded Source

Built Distribution

startbenchmark-0.0.11-py3-none-any.whl (199.5 kB view details)

Uploaded Python 3

File details

Details for the file startbenchmark-0.0.11.tar.gz.

File metadata

  • Download URL: startbenchmark-0.0.11.tar.gz
  • Upload date:
  • Size: 93.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.8.18 Linux/6.2.0-1018-azure

File hashes

Hashes for startbenchmark-0.0.11.tar.gz
Algorithm Hash digest
SHA256 225e6f52bb6455d4be3c1adda17e2b5dad0f1b7f241bda95c0e5c23c6a0dd85d
MD5 2ae860f69f384b0e7d2c76ed1a12a21a
BLAKE2b-256 948d14a2d9769ee5884affcd57a2fb718b2b078de2067bbbb3980edaac4171dd

See more details on using hashes here.

File details

Details for the file startbenchmark-0.0.11-py3-none-any.whl.

File metadata

  • Download URL: startbenchmark-0.0.11-py3-none-any.whl
  • Upload date:
  • Size: 199.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.8.18 Linux/6.2.0-1018-azure

File hashes

Hashes for startbenchmark-0.0.11-py3-none-any.whl
Algorithm Hash digest
SHA256 9cee587b5dff2ae8249826eefdbf8aca079d71b766d67c31d0d9d468578c5fd6
MD5 cbc8f5aa494e9ef7fc207efedb48993f
BLAKE2b-256 60cf8776d2dea9dab1b630513a299e0476131579320c3d512623034e1474aa9f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page