13 projects
browsergym-webarena
WebArena benchmark for BrowserGym
browsergym-visualwebarena
VisualWebArena benchmark for BrowserGym
browsergym-miniwob
MiniWoB++ benchmark for BrowserGym
browsergym-experiments
Experimentation tools for BrowserGym
browsergym-core
BrowserGym: a gym environment for web task automation in the Chromium browser
browsergym
BrowserGym: a gym environment for web task automation in the Chromium browser
libwebarena
This is an unofficial, use-at-your-own risks port of the webarena benchmark, for use as a standalone library package.
pyscm-ml
The Set Covering Machine algorithm
browsergym-workarena
WorkArena benchmark for BrowserGym
geobench
A benchmark designed to advance foundation models for Earth monitoring, tailored for remote sensing. It encompasses six classification and six segmentation tasks, curated for precision and model evaluation. The package also features a comprehensive evaluation methodology and showcases results from 20 established baseline models.
tactis
Transformer-Attentional Copulas for Multivariate Time Series
geo-benchmark
A benchmark designed to advance foundation models for Earth monitoring, tailored for remote sensing. It encompasses six classification and six segmentation tasks, curated for precision and model evaluation. The package also features a comprehensive evaluation methodology and showcases results from 20 established baseline models.
synbols
Synbols: Probing Learning Algorithms with Synthetic Datasets