Skip to main content

This is an unofficial, use-at-your-own risks port of the webarena benchmark, for use as a standalone library package.

Project description

Warning: use at your own risks!

Unofficial WebArena port for compatibility with BrowserGym. Changes below.

More flexible/recent dependencies

  • playwright>=1.32,<1.40
  • openai>=1
  • transformers
  • beartype>=0.12.0

Packaging into a single Python namespace

pip install libwebarena
import webarena
import webarena.browser_env
import webarena.agent
import webarena.evaluation_harness
import webarena.llms
import webarena.llms.providers

Making HTMLContentEvaluator idempotent (validate() should not alter the browser's state)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

webarenax-0.3.tar.gz (39.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

webarenax-0.3-py3-none-any.whl (46.4 kB view details)

Uploaded Python 3

File details

Details for the file webarenax-0.3.tar.gz.

File metadata

  • Download URL: webarenax-0.3.tar.gz
  • Upload date:
  • Size: 39.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for webarenax-0.3.tar.gz
Algorithm Hash digest
SHA256 2be241bec311a64095fd6b35a97001e4add2e8f16ec0840efa6c8595e6c02f1c
MD5 0c553e1de83629fc736dfe5d3c9b2b69
BLAKE2b-256 0067ba349aa4a3ce6c6ba043583d3387c34859275200ef75e709e3c1e740ef42

See more details on using hashes here.

File details

Details for the file webarenax-0.3-py3-none-any.whl.

File metadata

  • Download URL: webarenax-0.3-py3-none-any.whl
  • Upload date:
  • Size: 46.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for webarenax-0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 95e819bd8092c1cdfeb56471fc4a682959a3a34ecd4909d98cae5dc6c0606f9e
MD5 ad6bca58541b5152c2b1a7279463690b
BLAKE2b-256 5a311445b8aaa7855ff85eaee9fe11dc42ab4daaead7f76faedcf3a4aa4a19e5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page