Skip to main content

Tools for single-embodiment, multiple-task, Reinforcement Learning

Project description

AgentFlow: A Modular Toolkit for Scalable RL Research

Overview

AgentFlow is a library for composing Reinforcement-Learning agents. The core features that AgentFlow provides are:

  1. tools for slicing, transforming, and composing specs
  2. tools for encapsulating and composing RL-tasks.

Unlike the standard RL setup, which assumes a single environment and an agent, AgentFlow is designed for the single-embodiment, multiple-task regime. This was motivated by the robotics use-case, which frequently requires training RL modules for various skills, and then composing them (possibly with non-learned controllers too).

Instead of having to implement a separate RL environment for each skill and combine them ad hoc, with AgentFlow you can define one or more SubTasks which modify a timestep from a single top-level environment, e.g. adding observations and defining rewards, or isolating a particular sub-system of the environment, such as a robot arm.

You then compose SubTasks with regular RL-agents to form modules, and use a set of graph-building operators to define the flow of these modules over time (hence the name AgentFlow).

The graph-building step is entirely optional, and is intended only for use-cases that require something like a (possibly learnable, possibly stochastic) state-machine.

Components

Control Flow

Examples

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

dm_robotics_agentflow-0.0.4-py3-none-any.whl (131.4 kB view details)

Uploaded Python 3

File details

Details for the file dm_robotics_agentflow-0.0.4-py3-none-any.whl.

File metadata

  • Download URL: dm_robotics_agentflow-0.0.4-py3-none-any.whl
  • Upload date:
  • Size: 131.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.9.7

File hashes

Hashes for dm_robotics_agentflow-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 e5fb8b9afb59756ffb787d874c01a5e676fa2dc40ac006ace85c5783df929a30
MD5 58ca4305c4fcaca9b02c56eb9fe5d902
BLAKE2b-256 631e7d84a02928c35f6a41f3dc6b932de5811e6591bbf05d53f98b1ded3f88ee

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page