Planning through backpropagation using TensorFlow.

These details have not been verified by PyPI

Project links

Homepage

Project description

tf-plan

Planning via gradient-based optimization in continuous MDPs using TensorFlow.

tf-plan is an implementation based on the NIPS 2017 paper:

Wu Ga, Buser Say, and Scott Sanner, 2017
Scalable Planning with Tensorflow for Hybrid Nonlinear Domains.
In Advances in Neural Information Processing Systems (pp. 6273-6283).

Quickstart

tf-plan is a Python3.5+ package available in PyPI.

$ pip3 install tf-plan

Features

tf-plan solves discrete time MDPs with continuous state-action spaces and deterministic transitions.

The domains/instances are specified using the RDDL language.

It is built on Python3's RDDL toolkit:

pyrddl: RDDL lexer/parser in Python3.
rddlgym: A toolkit for working with RDDL domains in Python3.
rddl2tf: RDDL2TensorFlow compiler.
tf-rddlsim: A RDDL simulator running in TensorFlow.

Please refer to the projects' documentation for further details.

Usage

$ tfplan --help

usage: tfplan [-h] [-m {offline,online}] [-b BATCH_SIZE] [-hr HORIZON]
              [-e EPOCHS] [-lr LEARNING_RATE] [--viz {generic,navigation}]
              [-v]
              rddl

tf-plan (v0.5.0): Planning via gradient-based optimization in TensorFlow.

positional arguments:
  rddl                  RDDL file or rddlgym domain id

optional arguments:
  -h, --help            show this help message and exit
  -m {offline,online}, --mode {offline,online}
                        planning mode (default=offline)
  -b BATCH_SIZE, --batch-size BATCH_SIZE
                        number of trajectories in a batch (default=128)
  -hr HORIZON, --horizon HORIZON
                        number of timesteps (default=40)
  -e EPOCHS, --epochs EPOCHS
                        number of timesteps (default=500)
  -lr LEARNING_RATE, --learning-rate LEARNING_RATE
                        optimizer learning rate (default=0.001)
  --viz {generic,navigation}
                        type of visualizer (default=generic)
  -v, --verbose         verbosity mode

Examples

Navigation

$ tfplan Navigation-v1 -b 32 -hr 15 -e 1000 -v --viz=navigation

Running tf-plan v0.5.0 ...
>> RDDL:            Navigation-v1
>> Planning mode:   offline
>> Horizon:         15
>> Batch size:      32
>> Training epochs: 1000
>> Learning rate:   0.01

Epoch   999: loss = 6879.5073244
>> total reward = -82.927887

docs/img/navigation-v1.png

HVAC

$ tfplan HVAC-V1 -b 64 -hr 40 -e 1000 --viz=generic

Epoch   999: loss = 58134777856.00000000
>> total reward = -241098.296875

Documentation

Please refer to https://tf-plan.readthedocs.io/ for the code documentation.

Support

If you are having issues with tf-plan, please let me know at: thiago.pbueno@gmail.com.

License

tf-plan is free software: you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

tf-plan is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.

You should have received a copy of the GNU Lesser General Public License along with tf-plan. If not, see http://www.gnu.org/licenses/.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.7.5

Oct 29, 2020

0.7.4

Oct 29, 2020

0.7.3

Oct 29, 2020

0.7.2

Oct 29, 2020

0.7.1

Apr 10, 2020

0.7.0

Apr 7, 2020

0.6.1

Mar 29, 2020

0.6.0

Dec 11, 2019

This version

0.5.1

May 15, 2019

0.5.0

Apr 2, 2019

0.4.7

Nov 25, 2018

0.4.6

Nov 24, 2018

0.4.5

Nov 14, 2018

0.4.4

Nov 14, 2018

0.4.3

Nov 9, 2018

0.4.2

Sep 29, 2018

0.4.1

Sep 28, 2018

0.4.0

Aug 30, 2018

0.3.1

Aug 19, 2018

0.2.1

Aug 12, 2018

0.2.0

Aug 12, 2018

0.1.0

Aug 8, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tf-plan-0.5.1.tar.gz (15.9 kB view details)

Uploaded May 15, 2019 Source

File details

Details for the file tf-plan-0.5.1.tar.gz.

File metadata

Download URL: tf-plan-0.5.1.tar.gz
Upload date: May 15, 2019
Size: 15.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.5

File hashes

Hashes for tf-plan-0.5.1.tar.gz
Algorithm	Hash digest
SHA256	`2a54e969cbbf353dfc19ccc420485a67e8e2de050271c892c40159ffef9bbcd1`
MD5	`3060685a32dcecb851b15a38d4554fd3`
BLAKE2b-256	`5edab02d436a520ff6c2535aaf3abc0ccab61d5d5581d35264a7f85048800087`

See more details on using hashes here.

tf-plan 0.5.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

tf-plan

Quickstart

Features

Usage

Examples

Navigation

HVAC

Documentation

Support

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes