Skip to main content

No project description provided

Project description

Environments

Synthetic Environments / Long Horizon Tasks / Digital Control Agents

Coverage Python License ![Tests](https://img.shields.io/badge/tests-7/19 failing-red) ![Type Check Examples](https://img.shields.io/badge/types (examples)-98%25 (5 errors)-green)-98%25 (5 errors)-green)-98%25 (5 errors)-green) ![Type Check Core](https://img.shields.io/badge/types (core)-100%25 (0 errors)-green)-100%25 (0 errors)-green)-100%25 (0 errors)-green) ![PyPI Dev](https://img.shields.io/badge/PyPI dev-0.0.1.dev2-orange)

uvicorn src.synth_env.service.app:app --reload --port 8000 uv run python src/synth_env/examples/nethack/agent_demos/test_synth_react.py

Motivation

  • We're building Environments to have key desiderata for long-horizon language agents
  • Snapshotting and reproducibility
  • Statefulness as a first-class citizen
  • Consistent abstractions for engine interaction and state reads across settings
  • Environment observability and tracing
  • HTTP access for simplified training and evaluation
  • Strong data abstractions to enable easily-configurable filtering and curriculum learning
  • Simple abstractions for agent-originated tool edits, etc.

User Guide

Note - this repo is under extremely active development. Hic sunt dracones, if not contributing it may be more useful as a reference or development resource than as core code for your production systems.

Supported Environments

[] Sokoban (maturing, not active dev) - This environment is not of our own making! Please credit the great researchers at Deepmind and Ragen, among others. - Checkout test_synth_react.py for a hello world example! [] Hendryks Math [] (maturing) [] Crafter-Classic (maturing) [] EnronBench (active dev) [] SWE-Bench (active dev - do not attempt) [] NMMO (active dev - do not attempt) [] Red (active dev) [] Verilog (maturing)

...

Development

For development setup, testing, and contribution guidelines, see:

Quick Development Commands

# Install dependencies
uv sync

# Run tests
./scripts/run_tests.sh

# Update README metrics
./scripts/run_dev_helpers.sh

# Format code
ruff format .

# Publish package (increment dev version)
python scripts/publish.py

# Publish with version selection
python scripts/publish.py --minor

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

synth_env-0.0.1.dev9.tar.gz (4.6 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

synth_env-0.0.1.dev9-py3-none-any.whl (2.2 MB view details)

Uploaded Python 3

File details

Details for the file synth_env-0.0.1.dev9.tar.gz.

File metadata

  • Download URL: synth_env-0.0.1.dev9.tar.gz
  • Upload date:
  • Size: 4.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.8

File hashes

Hashes for synth_env-0.0.1.dev9.tar.gz
Algorithm Hash digest
SHA256 618483a2daf52888058e3c18b363e876cbbe1dc19993b2808d41552b9278927a
MD5 9110669e5d36f82437da74b6accb684d
BLAKE2b-256 9137493d6fd714319bab84b0998a150af4116a2e9bc35452a0b59be7e2818b0a

See more details on using hashes here.

File details

Details for the file synth_env-0.0.1.dev9-py3-none-any.whl.

File metadata

  • Download URL: synth_env-0.0.1.dev9-py3-none-any.whl
  • Upload date:
  • Size: 2.2 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.8

File hashes

Hashes for synth_env-0.0.1.dev9-py3-none-any.whl
Algorithm Hash digest
SHA256 cb244c004f0a66cac94104cf18908f3d20529f5983b0cbb40f749a40c2ba99ae
MD5 61aac7a1d5cb2579a33c1de278a8baa2
BLAKE2b-256 89f20fbdc07b086ab0c69de5e7657484def73ccb300a8233187a7054c8eebf8d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page