Some features may not work without JavaScript. Please try enabling it if you encounter problems.

Benchmarking the performance of agents far and wide, regardless of how they are set up and how they work

These details have not been verified by PyPI

Project description

Auto-GPT Benchmarks

Built for the purpose of benchmarking the performance of agents regardless of how they work.

Objectively know how well your agent is performing in categories like code, retrieval, memory, and safety.

Save time and money while doing it through smart dependencies. The best part? It's all automated.

Scores:

Screenshot 2023-07-25 at 10 35 01 AM

Ranking overall:

Detailed results:

Screenshot 2023-07-25 at 10 42 15 AM

Click here to see the results and the raw data!!

More agents coming soon !

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.0.10

Sep 17, 2023

0.0.9 yanked

Aug 17, 2023

Reason this release was yanked:

Outdated agent-protocol

0.0.8 yanked

Aug 15, 2023

Reason this release was yanked:

Outdated agent-protocol

0.0.7 yanked

Aug 12, 2023

Reason this release was yanked:

Outdated agent-protocol

0.0.6 yanked

Aug 11, 2023

Reason this release was yanked:

TestReadFile broken

0.0.5 yanked

Aug 9, 2023

Reason this release was yanked:

Outdated agent-protocol

0.0.4 yanked

Aug 9, 2023

Reason this release was yanked:

Outdated agent-protocol

0.0.3

Aug 3, 2023

0.0.2

Jul 24, 2023

0.0.1

Jul 23, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agbenchmark-0.0.10.tar.gz (101.5 kB view details)

Uploaded Sep 17, 2023 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agbenchmark-0.0.10-py3-none-any.whl (212.2 kB view details)

Uploaded Sep 17, 2023 Python 3

File details

Details for the file agbenchmark-0.0.10.tar.gz.

File metadata

Download URL: agbenchmark-0.0.10.tar.gz
Upload date: Sep 17, 2023
Size: 101.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.6.1 CPython/3.8.18 Linux/6.2.0-1011-azure

File hashes

Hashes for agbenchmark-0.0.10.tar.gz
Algorithm	Hash digest
SHA256	`14793e98507f530eab7473a3dc7dfc218a49260de7b34f71939cb081877fb981`
MD5	`451197022ce8451c4c36ef9b0f73a133`
BLAKE2b-256	`a56989d87beadf1ab4d5835b0d601bbdf8be3b4cbb3081209541f33b2819e1e6`

See more details on using hashes here.

File details

Details for the file agbenchmark-0.0.10-py3-none-any.whl.

File metadata

Download URL: agbenchmark-0.0.10-py3-none-any.whl
Upload date: Sep 17, 2023
Size: 212.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.6.1 CPython/3.8.18 Linux/6.2.0-1011-azure

File hashes

Hashes for agbenchmark-0.0.10-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e165be0d05eb84700057a0660fe05132b4d7857d1395466d0d59bb0592143200`
MD5	`b433851b7484772b18530fa4d6f02903`
BLAKE2b-256	`89cf984bbcea12511aff6437ffe0fbead4af1e7762ab27a5204cda339ef047d7`

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor

Datadog Monitoring

Depot Continuous Integration

Google Download Analytics

Pingdom Monitoring

Sentry Error logging

StatusPage Status page