Methods for robot planning with abstractions.

Project description

Bilevel Planning

Methods for robot planning with abstractions.

Requirements

Python 3.10+
Tested on MacOS Catalina

Installation

We strongly recommend uv. The steps below assume that you have uv installed. If you do not, just remove uv from the commands and the installation should still work.

# Install PRPL dependencies.
uv pip install -r prpl_requirements.txt
# Install this package and third-party dependencies.
uv pip install -e ".[develop]"

Primer on Bilevel Planning

This section gives a brief overview of bilevel planning as implemented in this codebase.

Problem Setting

We consider fully-observed, deterministic, goal-based planning problems. (There are ways to weaken each of these assumptions, but we will keep things simple to start.) Each problem is characterized by:

A state space $\mathcal{X}$ where $x \in \mathcal{X}$ is a state
An action space $\mathcal{U}$ where $u \in \mathcal{U}$ is an action
A transition function $f: \mathcal{X} \times \mathcal{U} \to \mathcal{X}$
An initial state $x_0 \in \mathcal{X}$
A goal $g \subseteq \mathcal{X}$

In general, our objective is to find a plan $u_1, \dots, u_T$ that achieves the goal, that is, $x_T \in g$ and $x_t = f(x_{t-1}, u_t)$ for $1 \le t \le T$. Depending on the setting, we may also care about the plan length $T$, and the planning time, that is, the actual time taken to find the plan itself, among other possible metrics.

We typically consider not just one planning problem, but a distribution of planning problems that have shared state spaces, action spaces, and transition functions, but different initial states and goals. Sometimes we assume that a training set of planning problems are available (perhaps with example plans) and reserve a test set.

Example:
A robot is tasked with moving objects from a table onto a shelf. The initial state includes the poses of all objects and the configuration of the robot. Actions are changes to the robot configuration. The transition function models physics.

Abstractions

Bilevel planning uses abstractions to make planning more efficient and effective. In particular:

An abstract state space $\mathcal{S}$ where $s \in \mathcal{S}$ is an abstract state
An abstract action space $\mathcal{A}$ where $a \in \mathcal{A}$ is an abstract action
An abstract transition function $F: \mathcal{S} \times \mathcal{A} \to \mathcal{S}$
A state abstractor $\Psi: \mathcal{X} \to \mathcal{S}$

Example:
The initial abstract state is "cup on table, bowl on table, robot hand empty, ...". Note that this (discrete) abstract state loses information about the exact (continuous) object poses and robot configuration. Abstract actions are "grasp cup", "place bowl in shelf", etc. The abstract transition function says, for example, that "grasp cup" in "cup on table, bowl on table, robot hand empty, ..." will lead to "cup grasped, bowl on table, ...".

In practice, rather than using an abstract transition function, we use an abstract successor function that outputs a sequence of (abstract action, next abstract state). This is convenient when there are a large number of possible abstract actions, but a small number that are valid for any given abstract state.

Bilevel Planning Graph

There are different ways that abstractions can help with planning. We discuss a few below. The common thread is that all planners construct a bilevel planning graph as they search for a plan. A bilevel planning graph includes:

State nodes: associated with states $x \in \mathcal{X}$
Abstract state nodes: associated with abstract states $s \in \mathcal{S}$
Action edges: between state nodes, associated with actions $u \in \mathcal{U}$
Abstract action edges: between abstract state nodes, associated with abstract actions $a \in \mathcal{A}$
State abstractor edges: from a state node with $x$ to an abstract state node with $s$ where $\Psi(x) = s$

Note that this graph is certainly "partial" in the sense that it does not include all states, abstract states, actions, and abstract actions. The job of a bilevel planner is to determine which nodes and edges to create while searching for a plan.

Bilevel Planner Examples

We now discuss a few examples of bilevel planners.

Abstract Breadth-First Search

Visualization of abstract breadth-first

See test_abstract_bfs_planner.py for GIF generation.

This bilevel planner starts by abstracting the initial state. It then performs breadth-first search in the abstract state and action space. Expanding an abstract state node with $s$ works as follows:

for a in A:  # optimization: valid actions only
  ns = F(s, a)  # defines a "subgoal"
  repeat N times:
    x = sample_state(s)  # among x already in graph
    try:
      x_traj, u_traj = sample_trajectory(x, s, a, ns)
    except TrajectorySamplingFailure:
      continue

The sample_trajectory function attempts to find a trajectory of states and actions where:

The initial state x_traj[0] = x is already in the graph and abstracts to s
The final state x_traj[-1] abstracts to ns
The trajectory follows the transition function $f$

There are different ways to implement sample_trajectory. One technique is to use trajectory optimization. This typically requires the transition function $f$ to be smooth, and sometimes requires it to be implemented in a certain way (e.g., with Jax or PyTorch or Drake wrappers). Another technique is to assume that each abstract action $a$ is associated with a parameterized policy $u = \pi_a(x \mid \theta)$ and a sampler that proposes parameters $\theta$ conditioned on the state where the policy is first invoked. We typically use this second technique.

This bilevel planner can work better than a "flat" planner when the subgoals produced by the abstractions represent a "good" problem decomposition. However, this bilevel planner can also be very slow, especially when $|\mathcal{S}|$, $|\mathcal{A}|$, and/or $N$ are large.

Single Abstract Plan + Greedy Refinement

Success Example	Failure Example

See test_sesame_planner.py for GIF generation.

This bilevel planner improves planning time by finding and committing to a single abstract plan before any (low level) states and actions are considered. To do this, we need to be able to check goals at the abstract level. Typically we assume that goals are represented as sets of abstract states. Once an abstract plan is found, we attempt to "refine" each abstract transition in sequence (greedily). See the pseudocode below.

s0 = get_abstract_state(x0)  # ψ
s_plan, a_plan = run_abstract_search(s0, A, F, g)
x_plan, u_plan = [x0], []
for s, a, ns in zip(s_plan[:-1], a_plan, s_plan[1:]):
  x_traj, u_traj = sample_trajectory(x_plan[-1], s, a, ns)  # may fail
  x_plan.extend(x_traj)
  u_plan.extend(u_traj)

This can be much faster, but at the expense of completeness: it is possible that no plan is found, even if one exists. One failure mode is that the trajectory sampled for one abstract transition can make a future abstract transition "unrefinable." For instance, imagine grasping a cup from the top, but then later trying to place it into a very short shelf; collisions between the wrist and shelf ceiling may prevent placement.

Single Abstract Plan + Backtracking Refinement

Success Example	Failure Example

See test_sesame_planner.py for GIF generation. In the failure example, there is a simulated obstacle that fully blocks the first abstract plan.

To get unstuck in cases like the one mentioned above, we can backtrack. A simple way to implement backtracking is to attempt $K$ refinements for each abstract transition before moving back to the previous one.

s0 = get_abstract_state(x0)  # ψ
s_plan, a_plan = run_abstract_search(s0, A, F, g)

def refine_from_step(index, x):
  if index == len(a_plan):
    return True, []  # successfully refined all transitions

  s = s_plan[index]
  a = a_plan[index]
  ns = s_plan[index + 1]

  for k in range(K):
    try:
      x_traj, u_traj = sample_trajectory(x, s, a, ns)
      success, remainder = refine_from_step(index + 1, x_traj[-1])
      if success:
        return True, [(x_traj, u_traj)] + remainder
      except TrajectorySamplingFailure:
        continue

  return False, None  # backtrack

success, full_plan = refine_from_step(0, x0)

This is better, but even putting aside the issue of choosing $K$, this planner is still incomplete: some abstract transitions may always be unrefinable given the initial state of the problem $x_0$. We can address this issue by considering multiple abstract plans, rather than committing to a single one.

Multi-Abstract Plan + Backtracking Refinement ("SeSamE")

See test_sesame_planner.py for GIF generation.

This bilevel planner is the same as the one above, except that multiple abstract plans are generated in an outer loop until one is successfully refined. It is also possible to generate and refine abstract plans in parallel rather than in sequence.

for (s_plan, a_plan) in iter_abstract_plans(s0, A, F, g):
  try:
    x_plan, u_plan = refine(s_plan, a_plan)  # backtracking
  except RefinementFailure:
    continue

We typically iterate over abstract plans in order from shortest to longest (in terms of abstract plan length), breaking ties randomly.

For "historical" reasons, we refer to this combination of multi-abstract plan generation and backtracking refinement as "SeSamE", which stands for Search, Sample, and Execute. This planner is still technically incomplete, because each abstract transition is only refined a finite number of times. Moreover, when there are frequent refinement failures or poorly behaved abstractions, this planner can be very slow. However, it does have the nice property that when refinement failures are rare, planning is fast. And unlike the previous planners, when there are refinement failures, this planner can recover eventually (for sufficiently large $K$). This is the planner that we have most often used in our work on learning for bilevel planning.

Relational Abstractions and PDDL

See test_sesame_planner.py for GIF generation.

We have so far been very "abstract" in our discussion of abstractions. In practice, we often use relational abstractions that allow us to take advantage of powerful techniques from classical AI planning. In particular, if we assume that abstract states and goals are represented with atoms and abstract actions are represented with operators, then we can use PDDL planners to implement abstract search. This can lead to dramatic improvements in abstract plan generation.

Project details

Release history Release notifications | RSS feed

0.1.2

Apr 6, 2026

This version

0.1.1

Apr 5, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bilevel_planning-0.1.1.tar.gz (11.0 MB view details)

Uploaded Apr 5, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

bilevel_planning-0.1.1-py3-none-any.whl (24.3 kB view details)

Uploaded Apr 5, 2026 Python 3

File details

Details for the file bilevel_planning-0.1.1.tar.gz.

File metadata

Download URL: bilevel_planning-0.1.1.tar.gz
Upload date: Apr 5, 2026
Size: 11.0 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.4.16

File hashes

Hashes for bilevel_planning-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`facb05662c8d8192181910951554324cfca37494f8bff176c4295fb1ad6ad551`
MD5	`0f5491758d657472fedd5cdc95df84c1`
BLAKE2b-256	`3c86b911f2e990ead882a7f41985384a23e52b791f3fae3b11956573e1a40879`

See more details on using hashes here.

File details

Details for the file bilevel_planning-0.1.1-py3-none-any.whl.

File metadata

Download URL: bilevel_planning-0.1.1-py3-none-any.whl
Upload date: Apr 5, 2026
Size: 24.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.4.16

File hashes

Hashes for bilevel_planning-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9b656e27a55f81ac0f9e9055eefb9a5502d731b88e3adb8e23f75922c4f081f4`
MD5	`254cc06288f690d3c5bcc7365967df3e`
BLAKE2b-256	`4dbc5eb34d3293a9b966e41f0ddb2867fa88550317af1a996cc42eb363554032`

See more details on using hashes here.

bilevel_planning 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Bilevel Planning

Requirements

Installation

Primer on Bilevel Planning

Problem Setting

Abstractions

Bilevel Planning Graph

Bilevel Planner Examples

Abstract Breadth-First Search

Single Abstract Plan + Greedy Refinement

Single Abstract Plan + Backtracking Refinement

Multi-Abstract Plan + Backtracking Refinement ("SeSamE")

Relational Abstractions and PDDL

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes