rddl2tf

RDDL2TensorFlow compiler.

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 3 - Alpha
Environment
- Console
Intended Audience
- Science/Research
License
- OSI Approved :: GNU General Public License v3 (GPLv3)
Natural Language
- English
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Artificial Intelligence

Project description

# rddl2tf [![Build Status](https://travis-ci.org/thiagopbueno/rddl2tf.svg?branch=master)](https://travis-ci.org/thiagopbueno/rddl2tf) [![License](https://img.shields.io/aur/license/yaourt.svg)](https://github.com/thiagopbueno/rddl2tf/blob/master/LICENSE)

RDDL2TensorFlow compiler in Python3.

# Quickstart

```text
$ pip3 install rddl2tf
```

# Usage

rddl2tf can be used as a standalone script or programmatically.

## Script mode

```text
$ rddl2tf --help
usage: rddl2tf [-h] [-b BATCH_SIZE] [--logdir LOGDIR] rddl

RDDL2TensorFlow compiler in Python3.

positional arguments:
rddl path to RDDL file or rddlgym problem id

optional arguments:
-h, --help show this help message and exit
-b BATCH_SIZE, --batch-size BATCH_SIZE
number of fluents in a batch (default=256)
--logdir LOGDIR log directory for tensorboard graph visualization
(default=/tmp/rddl2tf)
```

### Examples

```text
$ rddl2tf Reservoir-8 --batch-size=1024 --logdir=/tmp/rddl2tf
tensorboard --logdir /tmp/rddl2tf/reservoir/inst_reservoir_res8
```

```text
$ rddl2tf Mars_Rover --batch-size=1024 --logdir=/tmp/rddl2tf
tensorboard --logdir /tmp/rddl2tf/simple_mars_rover/inst_simple_mars_rover_pics3
```

## Programmatic mode

```python
import rddlgym

from rddl2tf.compiler import Compiler

# parse and compile RDDL
model_id = 'Reservoir-8'
model = rddlgym.make(model_id, mode=rddlgym.AST)
compiler = Compiler(model)

# set batch mode
compiler.batch_mode_on()
batch_size = 256

# compile initial state and default action fluents
state = compiler.compile_initial_state(batch_size)
action = compiler.compile_default_action(batch_size)

# compile state invariants and action preconditions
invariants = compiler.compile_state_invariants(state)
preconditions = compiler.compile_action_preconditions(state, action)

# compile intermediate fluents and next state fluents
scope = compiler.transition_scope(state, action)
interms, next_state = compiler.compile_cpfs(scope, batch_size)

# compile reward function
scope.update(next_state)
reward = compiler.compile_reward(scope)
```

# Compiler

## Parameterized Variables (pvariables)

Each RDDL fluent is compiled to a ``rddl2tf.TensorFluent`` after instantiation.

A ``rddl2tf.TensorFluent`` object wraps a ``tf.Tensor`` object. The arity and the number of objects corresponding to the type of each parameter of a fluent are reflected in a ``rddl2tf.TensorFluentShape`` object (the rank of a ``rddl2tf.TensorFluent`` corresponds to the fluent arity and the size of its dimensions corresponds to the number of objects of each type). Also, a ``rddl2tf.TensorFluentShape`` manages batch sizes when evaluating operations in batch mode.

Additionally, a ``rddl2tf.TensorFluent``keeps information about the ordering of the fluent parameters in a ``rddl2tf.TensorScope`` object.

The ``rddl2tf.TensorFluent`` abstraction is necessary in the evaluation of RDDL expressions due the broadcasting rules of operations in TensorFlow.

## Conditional Probability Functions (CPFs)

Each CPF expression is compiled into an operation in a ``tf.Graph``, possibly composed of many other operations. Typical RDDL operations, functions, and probability distributions are mapped to equivalent TensorFlow ops. These operations are added to a ``tf.Graph`` by recursively compiling the expressions in a CPF into wrapped operations and functions implemented at the ``rddl2tf.TensorFluent`` level.

Note that the RDDL2TensorFlow compiler currently only supports element-wise operations (e.g. ``a(?x, ?y) = b(?x) * c(?y)`` is not allowed). However, all compiled operations are vectorized, i.e., computations are done simultaneously for all object instantiations of a pvariable.

Optionally, during simulation operations can be evaluated in batch mode. In this case, state-action trajectories are generated in parallel by the ``rddl2tf.Simulator``.

# License

Copyright (c) 2018 Thiago Pereira Bueno All Rights Reserved.

rddl2tf is free software: you can redistribute it and/or modify it
under the terms of the GNU Lesser General Public License as published by
the Free Software Foundation, either version 3 of the License, or (at
your option) any later version.

rddl2tf is distributed in the hope that it will be useful, but
WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser
General Public License for more details.

You should have received a copy of the GNU Lesser General Public License
along with rddl2tf. If not, see http://www.gnu.org/licenses/.

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 3 - Alpha
Environment
- Console
Intended Audience
- Science/Research
License
- OSI Approved :: GNU General Public License v3 (GPLv3)
Natural Language
- English
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Artificial Intelligence

Release history Release notifications | RSS feed

0.5.13

Nov 16, 2020

0.5.12

Nov 16, 2020

0.5.11

Sep 14, 2020

0.5.10

Dec 11, 2019

0.5.9

Dec 11, 2019

0.5.8

Dec 11, 2019

0.5.6

May 2, 2019

0.5.5

Apr 18, 2019

0.5.4

Apr 17, 2019

0.5.3

Apr 17, 2019

0.5.2

Apr 15, 2019

0.5.1

Apr 2, 2019

0.4.12

Nov 24, 2018

0.4.10

Nov 15, 2018

0.4.9

Nov 15, 2018

0.4.8

Nov 14, 2018

0.4.7

Nov 9, 2018

0.4.6

Nov 8, 2018

0.4.5

Nov 8, 2018

0.4.4

Nov 4, 2018

0.4.3

Oct 29, 2018

0.4.2

Oct 27, 2018

0.4.1

Oct 24, 2018

0.4.0

Oct 23, 2018

0.3.4

Oct 23, 2018

0.3.3

Oct 21, 2018

0.3.2

Sep 29, 2018

0.3.1

Sep 24, 2018

This version

0.3.0

Sep 23, 2018

0.2.1

Sep 22, 2018

0.2.0

Sep 11, 2018

0.1.0

Sep 11, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rddl2tf-0.3.0.tar.gz (22.2 kB view hashes)

Uploaded Sep 23, 2018 Source

Hashes for rddl2tf-0.3.0.tar.gz

Hashes for rddl2tf-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`0eb80462964e664679e1782cd1a8adb90556965f0426bf88f285a13b3191e1cf`
MD5	`f68eaab9fa0caebe1d31c4720da6bd7d`
BLAKE2b-256	`d017271afd8cbc1e61e3ad7bf847de1e35751f7696017210f232b3097c1704e7`