Skip to main content

Standardizing environment infrastructure with Strands Agents — step, observe, reward.

Project description

strands-env

CI PyPI License Ask DeepWiki

Standardizing environment infrastructure with Strands Agents — step, observe, reward.

Features

This package treats each env.step() as a full agent loop (prompt → (tool_call, tool_response)* → response), not a single model call.

  • Define Environments — Subclass Environment, add @tool functions, plug in RewardFunction
  • RL Training — Token-level observations for on-policy training with strands-sglang
  • Benchmarking — CLI and Evaluator with checkpointing, resume, and custom metrics

Install

pip install strands-env

For development:

git clone https://github.com/horizon-rl/strands-env.git && cd strands-env
pip install -e ".[dev]"

Quick Start

Define an Environment

Subclass Environment and add tools as @tool-decorated functions:

from strands import tool
from strands_env.core import Environment

@tool
def calculator(expression: str) -> str:
    """Evaluate a math expression."""
    return str(eval(expression))

class MathEnv(Environment):
    def get_tools(self):
        return [calculator]

Run It

env = MathEnv(model_factory=factory, reward_fn=reward_fn)
result = await env.step(Action(message="What is 2^10?", task_context=TaskContext(ground_truth="1024")))

result.observation.final_response   # "The answer is 1024"
result.reward.reward                # 1.0
result.termination_reason           # TerminationReason.TASK_COMPLETE

See examples/calculator_demo.py for a complete example.

Run Evaluations

strands-env eval aime-2024 \
    --env examples/envs/calculator_env.py \
    --backend sglang \
    --base-url http://localhost:30000 \
    --n-samples-per-prompt 8 \
    --max-concurrency 30

Tip: For a non-agentic benchmark (no tool use), simply don't override get_tools() in your environment — the base class returns [] by default.

Documentation

Development

# Lint
ruff check src/ && ruff format --check src/

# Unit tests
pytest tests/unit/ -v

# Integration tests (requires running SGLang server)
pytest tests/integration/ -v --sglang-base-url=http://localhost:30000

Or if using Claude Code, just use /run-unit-tests and /run-integration-tests slash commands.

License

Apache License 2.0 — see LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

strands_env-0.2.2.tar.gz (76.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

strands_env-0.2.2-py3-none-any.whl (75.0 kB view details)

Uploaded Python 3

File details

Details for the file strands_env-0.2.2.tar.gz.

File metadata

  • Download URL: strands_env-0.2.2.tar.gz
  • Upload date:
  • Size: 76.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for strands_env-0.2.2.tar.gz
Algorithm Hash digest
SHA256 ebde7051cb38b24a91a91ed7485baebe08a76f86e4a7b083cb895993b8c567c0
MD5 a6da263056917dd3d687e1f1d767effa
BLAKE2b-256 5999aee2568c6324a4744863fac5ea484af0b5fc71d23df38cf0418a90a6dd6e

See more details on using hashes here.

Provenance

The following attestation bundles were made for strands_env-0.2.2.tar.gz:

Publisher: publish.yml on horizon-rl/strands-env

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file strands_env-0.2.2-py3-none-any.whl.

File metadata

  • Download URL: strands_env-0.2.2-py3-none-any.whl
  • Upload date:
  • Size: 75.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for strands_env-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 61c1079ff812574c1dc366d488db79ea675778040462d24b02aecf5a6beac501
MD5 e357ad71edc54219417a09d224e5ee72
BLAKE2b-256 3b9d65644d88c968d3a41cf7aebfd393296c7b5285dfe3fdcd8884246329ad64

See more details on using hashes here.

Provenance

The following attestation bundles were made for strands_env-0.2.2-py3-none-any.whl:

Publisher: publish.yml on horizon-rl/strands-env

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page