16 projects
cube-standard
Common Unified Benchmark Environments
cube-web-tool
Web search and fetch tools for cube benchmarks
workarena-cube
WorkArena ServiceNow benchmark for cube
webarena-verified-cube
WebArena-Verified benchmark cube — 812 verified web automation tasks
osworld-cube
OSWorld benchmark ported to the CUBE protocol
miniwob-cube
MiniWob++ benchmark for cube
cube-chat
Concrete chat session implementations for cube-standard
cube-browser-playwright
Concrete browser session implementations for cube-standard
cube-browser-tool
Concrete browser tool implementations for cube-standard benchmarks
cube-chat-tool
Chat tool implementation for cube-standard benchmarks
cube-vm-backend
VM backend implementations for CUBE desktop-automation benchmarks
cube-computer-tool
Generic desktop computer tool for CUBE VM-based benchmarks
agentlab
Main package for developing agents and experiments
browsergym-webarenalite
WebArena Lite benchmark for BrowserGym
browsergym-webarena-verified
WebArena Verified benchmark for BrowserGym
browsergym
BrowserGym: a gym environment for web task automation in the Chromium browser