Sum Product Flow: An Easy and Extensible Library for Sum-Product Networks

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

SPFlow: An Easy and Extensible Library for Sum-Product Networks

SPFlow, an open-source Python library providing a simple interface to inference, learning and manipulation routines for deep and tractable probabilistic models called Sum-Product Networks (SPNs). The library allows one to quickly create SPNs both from data and through a domain specific language (DSL). It efficiently implements several probabilistic inference routines like computing marginals, conditionals and (approximate) most probable explanations (MPEs) along with sampling as well as utilities for serializing,plotting and structure statistics on an SPN.

Furthermore, SPFlow is extremely extensible and customizable, allowing users to promptly create new inference and learning routines by injecting custom code into a light-weight functional-oriented API framework.

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.

Installing

To install the latest released version of SPFlow using pip

pip3 install spflow

Examples

We start by creating an SPN. Using a Domain-Specific Language (DSL), we can quickly create an SPN of categorical leave nodes like this:

from spn.structure.leaves.parametric.Parametric import Categorical

spn = 0.4 * (Categorical(p=[0.2, 0.8], scope=0) *
             (0.3 * (Categorical(p=[0.3, 0.7], scope=1) *
                     Categorical(p=[0.4, 0.6], scope=2))
            + 0.7 * (Categorical(p=[0.5, 0.5], scope=1) *
                     Categorical(p=[0.6, 0.4], scope=2))))
    + 0.6 * (Categorical(p=[0.2, 0.8], scope=0) *
             Categorical(p=[0.3, 0.7], scope=1) *
             Categorical(p=[0.4, 0.6], scope=2))

We can create the same SPN using the object hierarchy:

from spn.structure.leaves.parametric.Parametric import Categorical

from spn.structure.Base import Sum, Product


p0 = Product(children=[Categorical(p=[0.3, 0.7], scope=1), Categorical(p=[0.4, 0.6], scope=2)])
p1 = Product(children=[Categorical(p=[0.5, 0.5], scope=1), Categorical(p=[0.6, 0.4], scope=2)])
s1 = Sum(weights=[0.3, 0.7], children=[p0, p1])
p2 = Product(children=[Categorical(p=[0.2, 0.8], scope=0), s1])
p3 = Product(children=[Categorical(p=[0.2, 0.8], scope=0), Categorical(p=[0.3, 0.7], scope=1)])
p4 = Product(children=[p3, Categorical(p=[0.4, 0.6], scope=2)])
spn = Sum(weights=[0.4, 0.6], children=[p2, p4])

assign_ids(spn)
rebuild_scopes_bottom_up(spn)

return spn

The p parameter indicates the probabilities, and the scope indicates the variable we are modeling.

We can now visualize the SPN using:

from spn.io.Graphics import plot_spn

plot_spn(spn, 'basicspn.png')

Marginalizing an SPN means summing out all the other non-relevant variables. So, if we want to marginalize the above SPN and sum out all other variables leaving only variables 1 and 2, we can do:

from spn.algorithms.Marginalization import marginalize

spn_marg = marginalize(spn, [1,2])

Here, we marginalize all the variables not in [1,2], and create a NEW structure that knows nothing about the previous one nor about the variable 0.

We can use this new spn to do all the operations we are interested in. That means, we can also plot it!

plot_spn(spn_marg, 'marginalspn.png')

We can also dump the SPN as text:

from spn.io.Text import spn_to_str_equation
txt = spn_to_str_equation(spn_marg)
print(txt)

And the output is:

(0.6*((Categorical(V1|p=[0.3, 0.7]) * Categorical(V2|p=[0.4, 0.6]))) + 0.12000000000000002*((Categorical(V1|p=[0.3, 0.7]) * Categorical(V2|p=[0.4, 0.6]))) + 0.27999999999999997*((Categorical(V1|p=[0.5, 0.5]) * Categorical(V2|p=[0.6, 0.4]))))

However, the most interesting aspect of SPNs is the tractable inference. Here is an example on how to evaluate the SPNs from above. Since we have 3 variables, we want to create a 2D numpy array of 3 columns and 1 row.

import numpy as np
test_data = np.array([1.0, 0.0, 1.0]).reshape(-1, 3)

We then compute the log-likelihood:

from spn.algorithms.Inference import log_likelihood

ll = log_likelihood(spn, test_data)
print(ll, np.exp(ll))

And the output is:

[[-1.90730501]] [[0.14848]]

We can also compute the log-likelihood of the marginal SPN:

llm = log_likelihood(spn_marg, test_data)
print(llm, np.exp(llm))

Note that we used the same test_data input, as the SPN is still expecting a numpy array with data at columns 1 and 2, ignoring column 0. The output is:

[[-1.68416146]] [[0.1856]]

Another alternative, is marginal inference on the original SPN. This is done by setting as np.nan the feature we want to marginalize on the fly. It does not change the structure.

test_data2 = np.array([np.nan, 0.0, 1.0]).reshape(-1, 3)
llom =  log_likelihood(spn, test_data2)
print(llom, np.exp(llom))

The output is exactly the same as the evaluation of the marginal spn:

[[-1.68416146]] [[0.1856]]

We can use tensorflow to do the evaluation in a GPU:

from spn.gpu.TensorFlow import eval_tf
lltf = eval_tf(spn, test_data)
print(lltf, np.exp(lltf))

The output is as expected, equal to the one in python:

[[-1.90730501]] [[0.14848]]

We can also use tensorflow to do the parameter optimization in a GPU:

from spn.gpu.TensorFlow import optimize_tf
optimized_spn = optimize_tf(spn, test_data)
lloptimized = log_likelihood(optimized_spn, test_data)
print(lloptimized, np.exp(lloptimized))

The output is of course, higher likelihoods:

[[-1.38152628]] [[0.25119487]]

We can generate new samples that follow the joint distribution captured by the SPN!

from numpy.random.mtrand import RandomState
from spn.algorithms.Sampling import sample_instances
print(sample_instances(spn, np.array([np.nan, np.nan, np.nan] * 5).reshape(-1, 3), RandomState(123)))

Here we created 5 new instances that follow the distribution

[[0. 1. 0.]
 [1. 0. 0.]
 [1. 1. 0.]
 [1. 1. 1.]
 [1. 1. 0.]]

the np.nan values indicate the columns we want to sample.

We can also do conditional sampling, that is, if we have evidence for some of the variables we can pass that information to the SPN and sample for the rest of the variables:

from numpy.random.mtrand import RandomState
from spn.algorithms.Sampling import sample_instances
print(sample_instances(spn, np.array([np.nan, 0, 0] * 5).reshape(-1, 3), RandomState(123)))

Here we created 5 new instances whose evidence is V1=0 and V2=0

[[0. 0. 0.]
 [1. 0. 0.]
 [0. 0. 0.]
 [1. 0. 0.]
 [1. 0. 0.]]

We can do classification, by learning an SPN from data and then comparing the probabilities for the given classes: Imagine we have the following dataset:

generated by two gaussians with means (5,5) and (10,10), and we label the cluster at (5,5) to be class 0 and the cluster at (10,10) to be class 1.

np.random.seed(123)
train_data = np.c_[np.r_[np.random.normal(5, 1, (500, 2)), np.random.normal(10, 1, (500, 2))],
                   np.r_[np.zeros((500, 1)), np.ones((500, 1))]]

We can learn an SPN from data:

from spn.algorithms.LearningWrappers import learn_parametric, learn_classifier
from spn.structure.leaves.parametric.Parametric import Categorical, Gaussian
from spn.structure.Base import Context
spn_classification = learn_classifier(train_data,
                       Context(parametric_types=[Gaussian, Gaussian, Categorical]).add_domains(train_data),
                       learn_parametric, 2)

Here, we model our problem as containing 3 features, two Gaussians for the coordinates and one Categorical for the label. We specify that the label is in column 2, and create the corresponding SPN.

Now, imagine we want to classify two instances, one located at (3,4) and another one at (12,8). To do that, we first create an array with two rows and 3 columns. We set the last column to np.nan to indicate that we don't know the labels. And we set the rest of the values in the 2D array accordingly.

test_classification = np.array([3.0, 4.0, np.nan, 12.0, 18.0, np.nan]).reshape(-1, 3)

the first row is the first instance, the second row is the second instance.

[[ 3.  4. nan]
 [12. 18. nan]]

We can do classification via approximate most probable explanation (MPE). Here, we expect the first instance to be labeled as 0 and the second one as 1.

from spn.algorithms.MPE import mpe
print(mpe(spn_classification, test_classification))

as we can see, both instances are classified correctly, as the correct label is set in the last column

[[ 3.  4.  0.]
 [12. 18.  1.]]

We can learn an MSPN and a parametric SPN from data:

import numpy as np
np.random.seed(123)

a = np.random.randint(2, size=1000).reshape(-1, 1)
b = np.random.randint(3, size=1000).reshape(-1, 1)
c = np.r_[np.random.normal(10, 5, (300, 1)), np.random.normal(20, 10, (700, 1))]
d = 5 * a + 3 * b + c
train_data = np.c_[a, b, c, d]

Here, we have a dataset containing four features, two Discrete and two Real valued.

We can learn an MSPN with:

from spn.structure.Base import Context
from spn.structure.StatisticalTypes import MetaType

ds_context = Context(meta_types=[MetaType.DISCRETE, MetaType.DISCRETE, MetaType.REAL, MetaType.REAL])
ds_context.add_domains(train_data)

from spn.algorithms.LearningWrappers import learn_mspn

mspn = learn_mspn(train_data, ds_context, min_instances_slice=20)

We can learn a parametric SPN with:

from spn.structure.Base import Context
from spn.structure.leaves.parametric.Parametric import Categorical, Gaussian

ds_context = Context(parametric_types=[Categorical, Categorical, Gaussian, Gaussian]).add_domains(train_data)

from spn.algorithms.LearningWrappers import learn_parametric

spn = learn_parametric(train_data, ds_context, min_instances_slice=20)

Finally, we have some basic utilities for working with SPNs:

We can make sure that the SPN that we are using is valid, that is, it is consistent and complete.

from spn.algorithms.Validity import is_valid
print(is_valid(spn))

The output indicates that the SPN is valid and there are no debugging error messages:

(True, None)

To compute basic statistics on the structure of the SPN:

from spn.algorithms.Statistics import get_structure_stats
print(get_structure_stats(spn))

Extending the library

Using the SPN is as we have seen, relatively easy. However, we might need to extend it if we want to work with new distributions.

Imagine, we wanted to create a new Leaf type that models the Pareto distribution. We start by creating a new class:

from spn.structure.leaves.parametric.Parametric import Leaf
class Pareto(Leaf):
    def __init__(self, a, scope=None):
        Leaf.__init__(self, scope=scope)
        self.a = a

Now, if we want to do inference with this new node type, we just implement the corresponding likelihood function:

def pareto_likelihood(node, data=None, dtype=np.float64):
    probs = np.ones((data.shape[0], 1), dtype=dtype)
    from scipy.stats import pareto
    probs[:] = pareto.pdf(data[:, node.scope], node.a)
    return probs

This function receives the node, the data on which to compute the probability and the numpy dtype for the result.

Now, we just need to register this function so that it can be used seamlessly by the rest of the infrastructure:

from spn.algorithms.Inference import add_node_likelihood
add_node_likelihood(Pareto, pareto_likelihood)

Now, we can create SPNs that use the new distribution and also evaluate them.

spn = 0.3 * Pareto(2.0, scope=0) + 0.7 * Pareto(3.0, scope=0)
log_likelihood(spn, np.array([1.5]).reshape(-1, 1))

this produces the output:

[[-0.52324814]]

All other aspects of the SPN library can be extended in a similar same way.

Authors

Alejandro Molina - TU Darmstadt
Antonio Vergari - Max-Planck-Institute
Karl Stelzner - TU Darmstadt
Robert Peharz - University of Cambridge
Kristian Kersting - TU Darmstadt

See also the list of contributors who participated in this project.

License

This project is licensed under the Apache License, Version 2.0 - see the LICENSE.md file for details

Acknowledgments

Moritz Kulessa for the valuable code contributions

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

0.0.41

Nov 20, 2020

0.0.40

Feb 6, 2020

0.0.39

Nov 1, 2019

0.0.38

Sep 12, 2019

0.0.37

Sep 12, 2019

0.0.36

Jul 22, 2019

0.0.34

Mar 22, 2019

0.0.33

Mar 14, 2019

0.0.32

Mar 14, 2019

0.0.31

Mar 7, 2019

0.0.30

Feb 28, 2019

0.0.29

Feb 21, 2019

0.0.28

Jan 19, 2019

0.0.27

Jan 17, 2019

0.0.26

Jan 17, 2019

0.0.25

Jan 11, 2019

0.0.24

Jan 8, 2019

0.0.23

Jan 8, 2019

0.0.22

Dec 21, 2018

0.0.21

Dec 19, 2018

0.0.20

Dec 12, 2018

0.0.19

Dec 12, 2018

0.0.18

Dec 11, 2018

0.0.17

Dec 11, 2018

0.0.16

Dec 10, 2018

0.0.15

Dec 6, 2018

0.0.14

Dec 6, 2018

0.0.13

Dec 4, 2018

0.0.12

Nov 30, 2018

0.0.11

Nov 25, 2018

0.0.10

Nov 24, 2018

0.0.9

Nov 23, 2018

0.0.8

Nov 22, 2018

0.0.7

Oct 18, 2018

0.0.6

Oct 17, 2018

0.0.5

Oct 17, 2018

This version

0.0.4

Oct 15, 2018

0.0.3

Oct 8, 2018

0.0.2

Oct 8, 2018

0.0.1

Oct 8, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

spflow-0.0.4.tar.gz (89.1 kB view details)

Uploaded Oct 15, 2018 Source

Built Distribution

spflow-0.0.4-py3-none-any.whl (157.5 kB view details)

Uploaded Oct 15, 2018 Python 3

File details

Details for the file spflow-0.0.4.tar.gz.

File metadata

Download URL: spflow-0.0.4.tar.gz
Upload date: Oct 15, 2018
Size: 89.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.18.4 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.23.4 CPython/3.5.2

File hashes

Hashes for spflow-0.0.4.tar.gz
Algorithm	Hash digest
SHA256	`590c9d7c4c6c53e9db744bccbea59475065bc4fb4c8d2502e7530d3d63c6b74a`
MD5	`72063a88e466b3ecf388dbadf680fda4`
BLAKE2b-256	`becf5aba7c374c61d6ba74584c08e156418ccc80be96ab498174cef8b364fe33`

See more details on using hashes here.

File details

Details for the file spflow-0.0.4-py3-none-any.whl.

File metadata

Download URL: spflow-0.0.4-py3-none-any.whl
Upload date: Oct 15, 2018
Size: 157.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.18.4 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.23.4 CPython/3.5.2

File hashes

Hashes for spflow-0.0.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`85cd38a00effcb260fa9488527ba22af078b0e9f3172169457d89f9098d11ffc`
MD5	`7cb75e6a874294b45b429374f7ea260f`
BLAKE2b-256	`92b20b261515eaa3e16b66237b09d597f11dade340561f093dae70b211b2effa`

See more details on using hashes here.

spflow 0.0.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

SPFlow: An Easy and Extensible Library for Sum-Product Networks

Getting Started

Installing

Examples

Extending the library

Authors

License

Acknowledgments

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes