Skip to main content

HUD environments for Reinforcement Learning through Verifiers

Project description

hud-vf-gym

Verifiers Adapter for HUD environments - bridges Verifiers RL framework with HUD's MCP infrastructure for training and evaluating agents.

Training Support

Verifiers' GRPOTrainer does not support multimodal training as of now, you can use an [experimental trainer]https://github.com/jdchawla29/verifiers) for single-turn environments (with single prompt image due to transformer's limitations). Multi-turn multimodal support is WIP.

Prerequisites

  • Python >=3.12
  • HUD API key from https://app.hud.so
  • Environment variables:
    export HUD_API_KEY="your-api-key"
    export OPENAI_API_KEY="your-key"  # or ANTHROPIC_API_KEY
    

Quick Start

import verifiers as vf

# Load environment with HUD taskset and config
env = vf.load_environment(
    env_id="hud-vf-gym",
    taskset="hud-evals/2048-taskset",  # HuggingFace dataset
    config_path="./configs/2048.yaml",  # Environment config
    num_tasks=10
)

Documentation

For comprehensive usage, examples, and configuration details, see: hud-python/rl README

The main documentation covers:

  • Running evaluations with various models
  • Training agents with GRPO
  • Creating custom environments and configs
  • Dataset format and creation
  • Action mappings and tool configuration
  • Troubleshooting guide

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hud_vf_gym-0.1.0.tar.gz (13.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

hud_vf_gym-0.1.0-py3-none-any.whl (12.9 kB view details)

Uploaded Python 3

File details

Details for the file hud_vf_gym-0.1.0.tar.gz.

File metadata

  • Download URL: hud_vf_gym-0.1.0.tar.gz
  • Upload date:
  • Size: 13.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.13

File hashes

Hashes for hud_vf_gym-0.1.0.tar.gz
Algorithm Hash digest
SHA256 4594702154eaa8a49775d57ef1fa5a53d818701429cfc6d904d8e931dc2f2b97
MD5 41a6818f2e60c49bfc24e3c7e3f1d7d5
BLAKE2b-256 350d1a91ec9273c990e4f67d43b5505dc939d90261ab1fd1f5599b5269bd22ca

See more details on using hashes here.

File details

Details for the file hud_vf_gym-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: hud_vf_gym-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 12.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.13

File hashes

Hashes for hud_vf_gym-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 39b4255bca19281250db70ade31d5b856bc6f9d74c750302fd14389fcddb5723
MD5 0a7912bd637b6da4959d118cb2d045a2
BLAKE2b-256 f49375c08d015f6a895fcc6b6a6390225231227a024fe45d3990c9f468871526

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page