Skip to main content

This is an unofficial, use-at-your-own risks port of the webarena benchmark, for use as a standalone library package.

Project description

Warning: use at your own risks!

Unofficial WebArena port for compatibility with BrowserGym. Changes below.

More flexible/recent dependencies

  • playwright>=1.32,<1.40
  • openai>=1
  • transformers
  • beartype>=0.12.0

Packaging into a single Python namespace

pip install libwebarena
import webarena
import webarena.browser_env
import webarena.agent
import webarena.evaluation_harness
import webarena.llms
import webarena.llms.providers

Making HTMLContentEvaluator idempotent (validate() should not alter the browser's state)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

webarenax-0.1.tar.gz (35.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

webarenax-0.1-py3-none-any.whl (42.2 kB view details)

Uploaded Python 3

File details

Details for the file webarenax-0.1.tar.gz.

File metadata

  • Download URL: webarenax-0.1.tar.gz
  • Upload date:
  • Size: 35.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for webarenax-0.1.tar.gz
Algorithm Hash digest
SHA256 e39eae0daa9cb9f86cb946b1f83a8954c0bc8fe5e1087b5b1924ff4f020c5c68
MD5 2eeee8f8753cf953afb695724267bf60
BLAKE2b-256 4b82595744123d6f8461950f753c61a74857f650cd703b7a7c2255e8c578987e

See more details on using hashes here.

File details

Details for the file webarenax-0.1-py3-none-any.whl.

File metadata

  • Download URL: webarenax-0.1-py3-none-any.whl
  • Upload date:
  • Size: 42.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for webarenax-0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 48ee018b6159fa0c8ecb4194d68f6a011fba649358b96a4036ff2ffece879623
MD5 5229f26feb08528d671d0fc887dd1158
BLAKE2b-256 544d31932210a77e16b198567e18f42d6271d63c9cb1e8b73e8d66009fc0b8f2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page