Skip to main content

HUD environments for Reinforcement Learning through Verifiers

Project description

hud-vf-gym

Verifiers Adapter for HUD environments - bridges Verifiers RL framework with HUD's MCP infrastructure for training and evaluating agents.

Training Support

Verifiers' GRPOTrainer does not support multimodal training as of now, you can use an experimental trainer for single-turn environments (with single prompt image due to transformer's limitations). Multi-turn multimodal support is WIP.

Prerequisites

  • Python >=3.12
  • HUD API key from https://app.hud.so
  • Environment variables:
    export HUD_API_KEY="your-api-key"
    export OPENAI_API_KEY="your-key"  # or ANTHROPIC_API_KEY
    

Quick Start

import verifiers as vf

# Load environment with HUD taskset and config
env = vf.load_environment(
    env_id="hud-vf-gym",
    taskset="hud-evals/2048-taskset",  # HuggingFace dataset
    config_path="./configs/2048.yaml",  # Environment config
    num_tasks=10
)

Documentation

For comprehensive usage, examples, and configuration details, see: hud-python/rl README

The main documentation covers:

  • Running evaluations with various models
  • Training agents with GRPO
  • Creating custom environments and configs
  • Dataset format and creation
  • Action mappings and tool configuration
  • Troubleshooting guide

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hud_vf_gym-0.1.1.tar.gz (13.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

hud_vf_gym-0.1.1-py3-none-any.whl (13.2 kB view details)

Uploaded Python 3

File details

Details for the file hud_vf_gym-0.1.1.tar.gz.

File metadata

  • Download URL: hud_vf_gym-0.1.1.tar.gz
  • Upload date:
  • Size: 13.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.13

File hashes

Hashes for hud_vf_gym-0.1.1.tar.gz
Algorithm Hash digest
SHA256 c161f8396e0c9551000876693f7cee2c58b1604b3c9231b94f19dca179a97a6f
MD5 f2dac2fe746508df9cb3dd8c295d6f1a
BLAKE2b-256 bfeef3056e8331dfd6b42714ea72089f4ff94e6646f015e3d91e3c7bc82fe427

See more details on using hashes here.

File details

Details for the file hud_vf_gym-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: hud_vf_gym-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 13.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.13

File hashes

Hashes for hud_vf_gym-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 49180831a4db8187e00c38526663a75550da550010e635e7d78653cd11d8af7b
MD5 8988be81d94825619adf35b664918d5e
BLAKE2b-256 ef91e0a65796416a14743ce142c7af65201f1a29f4a3786e05ebd6ede6f70e41

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page