A Python package for POMDP planning algorithms and environments

These details have not been verified by PyPI

Project links

Project description

POMDPPlanners

A comprehensive Python package for POMDP (Partially Observable Markov Decision Process) planning algorithms and environments. POMDPPlanners provides standardized simulation studies for research and reliable implementations of planning algorithms for industrial applications.

🎯 Key Features

Comprehensive Algorithm Library: Implementations of state-of-the-art POMDP planning algorithms including POMCP, POMCPOW, POMCP-DPW, PFT-DPW, Sparse PFT, BetaZero, ConstrainedZero, and more
Rich Environment Collection: Classic and modern POMDP environments (Tiger, Light-Dark, RockSample, LaserTag, PacMan, CartPole, Push, Safety-Ant-Velocity, etc.)
Flexible Belief Representations: Particle filters, weighted beliefs, Gaussian beliefs, Gaussian mixture beliefs, and vectorized belief updaters
Simulation Framework: Complete experiment management with hyperparameter tuning, high-level evaluation workflows, and distributed computing support
Visualization Tools: Built-in plotting and visualization capabilities for analysis and debugging
Production Ready: Designed for both research experiments and industrial applications

🚀 Quick Start

Installation

# Clone the repository
git clone https://github.com/yaacovpariente/POMDPPlanners.git
cd POMDPPlanners

# Create and activate virtual environment
python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

# Install package (standard)
pip install -e .

# Install with development dependencies
pip install -e ".[dev]"

Basic Usage

from POMDPPlanners.environments.tiger_pomdp import TigerPOMDP
from POMDPPlanners.planners.mcts_planners.pomcp import POMCP
from POMDPPlanners.core.belief import WeightedParticleBelief

# Create environment and planner
env = TigerPOMDP()
planner = POMCP(env, num_simulations=1000)

# Initialize belief and run planning
initial_belief = WeightedParticleBelief.create_uniform_belief(
    env.get_states(), num_particles=1000
)

# Get action from planner
action = planner.get_action(initial_belief)
print(f"Recommended action: {action}")

🏗️ Architecture Overview

Core Components

POMDPPlanners.core: Fundamental abstractions and interfaces
- environment.py: Base Environment, DiscreteActionsEnvironment, SpaceType, ObservationModel, StateTransitionModel
- policy.py: Base Policy and TrainablePolicy classes with config management and logging
- belief/: Belief state representations including:
  - WeightedParticleBelief, UnweightedParticleBelief — particle filter beliefs
  - GaussianBelief, GaussianMixtureBelief — parametric beliefs
  - VectorizedWeightedParticleBelief — vectorized particle belief for performance
  - Gaussian belief updaters and vectorized particle belief updaters
- cost.py: Cost function abstractions for constrained planning
- distributions.py: Probability distribution implementations
- tree.py: Tree-based data structures for planning algorithms
- config_types.py: Centralized configuration type definitions
POMDPPlanners.environments: POMDP environment implementations
POMDPPlanners.planners: Planning algorithm implementations
POMDPPlanners.simulations: Experiment management and execution framework
POMDPPlanners.utils: Helper functions and visualization tools

Supported Algorithms

MCTS-Based Planners

POMCP: Partially Observable Monte Carlo Planning
POMCPOW: POMCP with Online Widening (progressive widening for observations)
POMCP-DPW: POMCP with Double Progressive Widening
PFT-DPW: Progressive Widening with Particle Filter Trees
Sparse PFT: Sparse sampling with Particle Filter Trees

Neural MCTS Planners

BetaZero: Neural-guided MCTS for POMDPs — adapts AlphaZero to belief-space planning with a dual-head neural network (value + policy)
ConstrainedZero: Safety-constrained extension of BetaZero for chance-constrained POMDPs — adds a failure-probability head and adaptive threshold calibration via conformal inference

Other Planners

Sparse Sampling: Classical sparse sampling algorithm
Open Loop Planners: Non-feedback planning approaches (discrete action sequences)

Available Environments

Tiger POMDP: Classic two-door problem
Light-Dark POMDP: Navigation with position-dependent observation noise (discrete and continuous variants)
Rock Sample POMDP: Grid-world rover navigation with multiple rock samples; rover must sense and collect good rocks
Laser Tag POMDP: Tag-based pursuit environment (discrete and continuous geometry variants)
PacMan POMDP: Partially observable Pac-Man navigation with ghost uncertainty
CartPole POMDP: Partially observable cart-pole balancing
Mountain Car POMDP: Partially observable mountain car
Push POMDP: Object manipulation environment
Safety Ant Velocity: Safety-constrained locomotion task

Simulations Framework

POMDPPlanners.simulations.simulator: Main simulation runner
POMDPPlanners.simulations.episodes: Episode execution logic
POMDPPlanners.simulations.workflows: High-level evaluation and optimization workflows
- evaluation.py, optimization.py, integrated.py
- planner_evaluation_workflow.py, hyperparameter_tuning_evaluation_workflows.py
POMDPPlanners.simulations.simulation_apis: Pluggable backends for experiment execution (see Distributed Computing section)
POMDPPlanners.simulations.simulations_deployment: Task managers and caching for distributed runs

Utilities

POMDPPlanners.utils.visualization: Visualization submodule with
- metrics_plots.py, returns_plots.py, tree_plots.py, policy_simulation_plots.py
POMDPPlanners.utils.config_loader: Configuration file management
POMDPPlanners.utils.logger: Centralized logging setup
POMDPPlanners.utils.statistics_utils: Statistical analysis functions
POMDPPlanners.utils.tree_statistics: Tree statistics computation

📊 Running Experiments

Simple Experiment

from POMDPPlanners.simulations.simulator import Simulator
from POMDPPlanners.utils.config_loader import load_config

# Load experiment configuration
config = load_config("experiments/configs/tiger_pomcp_experiment.yaml")

# Run simulation
simulator = Simulator(config)
results = simulator.run()

print(f"Average reward: {results['average_reward']}")

Hyperparameter Tuning

from POMDPPlanners.simulations.hyper_parameter_tuning_simulations import HyperParameterTuningSimulations

# Define hyperparameter space
hyperparams = {
    "num_simulations": [500, 1000, 2000],
    "exploration_constant": [1.0, 2.0, 5.0],
    "discount_factor": [0.9, 0.95, 0.99]
}

# Run tuning experiment
tuner = HyperParameterTuningSimulations(
    base_config="experiments/configs/base_config.yaml",
    hyperparameters=hyperparams
)
best_params = tuner.optimize()

Example Notebooks

Interactive Jupyter notebooks with detailed usage examples are available in docs/examples/:

docs/examples/basic_usage.ipynb — Environment setup, belief initialization, and basic planning
docs/examples/hyperparameter_tuning.ipynb — End-to-end hyperparameter search
docs/examples/planners_comparison.ipynb — Side-by-side comparison of planning algorithms

🌐 Distributed Computing

POMDPPlanners supports multiple execution backends for scaling experiments:

Backend	Description	Use Case
Local	Sequential single-process	Development and debugging
Dask	Distributed multi-machine cluster	Large-scale parallelism
PBS	HPC cluster via `dask-jobqueue`	Supercomputer / PBS job scheduler

Select the backend via the simulation API in POMDPPlanners.simulations.simulation_apis:

# Local sequential execution
from POMDPPlanners.simulations.simulation_apis.local_simulations_api import LocalSimulationsAPI

# Dask distributed execution
from POMDPPlanners.simulations.simulation_apis.dask_simulations_api import DaskSimulationsAPI

# PBS (HPC) cluster execution
from POMDPPlanners.simulations.simulation_apis.pbs_simulations_api import PBSSimulationsAPI

🧪 Testing

Run the comprehensive test suite:

# Activate virtual environment
source .venv/bin/activate

# Run all tests
pytest

# Run specific test categories
pytest POMDPPlanners/tests/test_core/
pytest POMDPPlanners/tests/test_environments/
pytest POMDPPlanners/tests/test_planners/

# Run with verbose output
pytest -v

# Run specific test file
pytest POMDPPlanners/tests/test_core/test_belief.py

🔧 Development

Code Quality

# Format code
black .

# Type checking
python -m pyright POMDPPlanners/

# Run linting
pylint POMDPPlanners/
flake8 .

# Install pre-commit hooks
pre-commit install

Virtual Environment

Important: Always activate the virtual environment before development:

source .venv/bin/activate  # Linux/Mac
# .venv\Scripts\activate   # Windows

All commands should be run within this environment for consistent dependency management.

📚 Documentation

Comprehensive documentation is generated from docstrings using Sphinx:

# Build documentation
cd docs/
sphinx-build -b html . _build/html

# Serve locally
python -m http.server 8000 -d _build/html

Visit the documentation at: Project Documentation

📄 License

This project is licensed under the MIT License - see the LICENSE.md file for details.

🎓 Citation

If you use POMDPPlanners in your research, please cite:

@misc{pariente2026pomdpplannersopensourcepackagepomdp,
      title={POMDPPlanners: Open-Source Package for POMDP Planning}, 
      author={Yaacov Pariente and Vadim Indelman},
      year={2026},
      eprint={2602.20810},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2602.20810}, 
}

🛠️ Requirements

Python 3.10 or higher
Core dependencies managed via pyproject.toml (pip install -e .)
Development dependencies: pip install -e ".[dev]"

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.0

Apr 19, 2026

0.1.0

Mar 3, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pomdpplanners-0.2.0.tar.gz (2.1 MB view details)

Uploaded Apr 19, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pomdpplanners-0.2.0-py3-none-any.whl (2.4 MB view details)

Uploaded Apr 19, 2026 Python 3

File details

Details for the file pomdpplanners-0.2.0.tar.gz.

File metadata

Download URL: pomdpplanners-0.2.0.tar.gz
Upload date: Apr 19, 2026
Size: 2.1 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for pomdpplanners-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`06bcbe4bdadb135ddf5a97fb3f6ded4865706f2e44d1ba16647c01d22c5136d2`
MD5	`8d139348b3a1b025242474604f0660d0`
BLAKE2b-256	`26c9d8f8c44e785570eb990b09460399f2152f043f6151c50bb2849a0cf60aa1`

See more details on using hashes here.

File details

Details for the file pomdpplanners-0.2.0-py3-none-any.whl.

File metadata

Download URL: pomdpplanners-0.2.0-py3-none-any.whl
Upload date: Apr 19, 2026
Size: 2.4 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for pomdpplanners-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`651bf1693434d731ecb5b90ff2d6fcc901283e470671c39c65908efc5951bec4`
MD5	`077cd7a34e35c672146f0aca560f0fad`
BLAKE2b-256	`a34236e6ac658bd2635be050d8bcc750dca8c67f93780ec0bab68749f91d99bb`

See more details on using hashes here.

POMDPPlanners 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

POMDPPlanners

🎯 Key Features

🚀 Quick Start

Installation

Basic Usage

🏗️ Architecture Overview

Core Components

Supported Algorithms

MCTS-Based Planners

Neural MCTS Planners

Other Planners

Available Environments

Simulations Framework

Utilities

📊 Running Experiments

Simple Experiment

Hyperparameter Tuning

Example Notebooks

🌐 Distributed Computing

🧪 Testing

🔧 Development

Code Quality

Virtual Environment

📚 Documentation

📄 License

🎓 Citation

🛠️ Requirements

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes