Tools for single-embodiment, multiple-task, Reinforcement Learning
Project description
AgentFlow: A Modular Toolkit for Scalable RL Research
Overview
AgentFlow is a library for composing Reinforcement-Learning agents. The core
features that AgentFlow provides are:
- tools for slicing, transforming, and composing specs
- tools for encapsulating and composing RL-tasks.
Unlike the standard RL setup, which assumes a single environment and an agent,
AgentFlow is designed for the single-embodiment, multiple-task regime. This
was motivated by the robotics use-case, which frequently requires training RL
modules for various skills, and then composing them (possibly with non-learned
controllers too).
Instead of having to implement a separate RL environment for each skill and
combine them ad hoc, with AgentFlow you can define one or more SubTasks
which modify a timestep from a single top-level environment, e.g. adding
observations and defining rewards, or isolating a particular sub-system of the
environment, such as a robot arm.
You then compose SubTasks with regular RL-agents to form modules, and use a
set of graph-building operators to define the flow of these modules over time
(hence the name AgentFlow).
The graph-building step is entirely optional, and is intended only for use-cases that require something like a (possibly learnable, possibly stochastic) state-machine.
Components
Control Flow
Examples
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file dm_robotics_agentflow-0.10.0-py3-none-any.whl.
File metadata
- Download URL: dm_robotics_agentflow-0.10.0-py3-none-any.whl
- Upload date:
- Size: 144.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ad2a651a93b0cb5287345faa2b94e6de046a80bc3d69fb5a1828059dc9ba3710
|
|
| MD5 |
c510bf915934700bd3c2f98ea7359ebf
|
|
| BLAKE2b-256 |
d8b438b2c9e1cdb307a63b87d685379ec5cfbdb13a10fc98e23c3cdd00942b49
|