Skip to main content

Fast reinforcement learning research

Project description

PyPI   Docs

Embodied

Fast reinforcement learning research.

Overview

The goal of Embodied is to empower researchers to quickly implement new agents at scale. Embodied achieves this by specifying an interface both for environments and agents, allowing users to mix and match agents, envs, and evaluation protocols. Embodied provides common building blocks that users are encouraged to fork when more control is needed. The only dependency is Numpy and agents can be implemented in any framework.

Packages

embodied/
  core/    # Config, logging, checkpointing, simulation, wrappers
  run/     # Evaluation protocols that combine agents and environments
  envs/    # Environment suites such as Gym, Atari, DMC, Crafter
  agents/  # Agent implementations

Agent API

class Agent:
  __init__(obs_space, act_space, config)
  policy(obs, carry, mode='train') -> act, carry
  train(data, carry) -> metrics, carry
  report(data, carry) -> metrics, carry
  init_policy(batch_size) -> carry
  init_train(batch_size) -> carry
  init_report(batch_size) -> carry
  dataset(generator) -> generator

Env API

class Env:
  __len__() -> int
  @obs_space -> dict of spaces
  @act_space -> dict of spaces
  step(act) -> obs dict
  render() -> array
  close()

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

embodied-1.2.0.tar.gz (24.3 kB view details)

Uploaded Source

File details

Details for the file embodied-1.2.0.tar.gz.

File metadata

  • Download URL: embodied-1.2.0.tar.gz
  • Upload date:
  • Size: 24.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.8

File hashes

Hashes for embodied-1.2.0.tar.gz
Algorithm Hash digest
SHA256 0f66b513a515240d9d936b7ac070451b92afc9bb4f6b9a422facf235e9d9a611
MD5 2753f2ce9dcafdb08b7c4e30ac83638d
BLAKE2b-256 8dcecd00c848d382bbd93e4b993407f496cfabb574a19a36a2a64d2d4759d626

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page