Skip to main content

Standardizing environment infrastructure with Strands Agents — step, observe, reward.

Project description

strands-env

CI PyPI License Ask DeepWiki

Standardizing environment infrastructure with Strands Agents — step, observe, reward.

Features

This package treats each env.step() as a full agent loop (prompt → (tool_call, tool_response)* → response), not a single model call.

  • Define Environments — Subclass Environment, add @tool functions, plug in RewardFunction
  • RL Training — Token-level observations for on-policy training with strands-sglang
  • Benchmarking — CLI and Evaluator with checkpointing, resume, and custom metrics

Install

pip install strands-env

For development:

git clone https://github.com/horizon-rl/strands-env.git && cd strands-env
pip install -e ".[dev]"

Quick Start

Define an Environment

Subclass Environment and add tools as @tool-decorated functions:

from strands import tool
from strands_env.core import Environment

@tool
def calculator(expression: str) -> str:
    """Evaluate a math expression."""
    return str(eval(expression))

class MathEnv(Environment):
    def get_tools(self):
        return [calculator]

Run It

env = MathEnv(model_factory=factory, reward_fn=reward_fn)
result = await env.step(Action(message="What is 2^10?", task_context=TaskContext(ground_truth="1024")))

result.observation.final_response   # "The answer is 1024"
result.reward.reward                # 1.0
result.termination_reason           # TerminationReason.TASK_COMPLETE

See examples/calculator_demo.py for a complete example.

Run Evaluations

strands-env eval aime-2024 \
    --env examples/envs/calculator_env.py \
    --backend sglang \
    --base-url http://localhost:30000 \
    --n-samples-per-prompt 8 \
    --max-concurrency 30

Tip: For a non-agentic benchmark (no tool use), simply don't override get_tools() in your environment — the base class returns [] by default.

Documentation

Development

# Lint
ruff check src/ && ruff format --check src/

# Unit tests
pytest tests/unit/ -v

# Integration tests (requires running SGLang server)
pytest tests/integration/ -v --sglang-base-url=http://localhost:30000

Or if using Claude Code, just use /run-unit-tests and /run-integration-tests slash commands.

License

Apache License 2.0 — see LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

strands_env-0.2.1.tar.gz (75.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

strands_env-0.2.1-py3-none-any.whl (71.8 kB view details)

Uploaded Python 3

File details

Details for the file strands_env-0.2.1.tar.gz.

File metadata

  • Download URL: strands_env-0.2.1.tar.gz
  • Upload date:
  • Size: 75.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for strands_env-0.2.1.tar.gz
Algorithm Hash digest
SHA256 b76617001ca66df5589689851a3fd5decc38026730f153c6c256b8a58596cc6b
MD5 1f1a9353940a9570a1c375efbcb14e6e
BLAKE2b-256 2b33d7898411c568095a746b46d6045793bd6f1e2b3b37ff57967667a5cbd634

See more details on using hashes here.

Provenance

The following attestation bundles were made for strands_env-0.2.1.tar.gz:

Publisher: publish.yml on horizon-rl/strands-env

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file strands_env-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: strands_env-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 71.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for strands_env-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 513956f7aa6ad0c16e316986a290f1680dc971495fbcc302144fc7403153903e
MD5 fe9d91ea98892030c4337176ec3a6214
BLAKE2b-256 7984ad3289c055ffcdb1d576bf75718f7f3f93dbdd2a77e9c1d94157f125e146

See more details on using hashes here.

Provenance

The following attestation bundles were made for strands_env-0.2.1-py3-none-any.whl:

Publisher: publish.yml on horizon-rl/strands-env

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page