Empirical Information Bottleneck

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Environment
- Console
Intended Audience
- Science/Research
License
- OSI Approved :: GNU General Public License v3 or later (GPLv3+)
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Information Analysis

Project description

EMBO - Empirical Bottleneck

A Python implementation of the Information Bottleneck analysis framework [Tishby, Pereira, Bialek 2001], especially geared towards the analysis of concrete, finite-size data sets.

Requirements

embo requires Python 3 and numpy.

Installation

To install the latest release, run:

pip install embo

(depending on your system, you may need to use pip3 instead of pip in the command above).

Testing

(requires setuptools). If embo is already installed on your system, look for the copy of the test_embo.py script installed alongside the rest of the embo files and execute it. For example:

python /usr/lib/python3.X/site-packages/embo/test_embo.py

Alternatively, if you have downloaded the source, from within the root folder of the source distribution run:

python setup.py test

This should run through all tests specified in embo/test.

Usage

The Information Bottleneck

We refer to [Tishby, Pereira, Bialek 2001] for a general introduction to the Information Bottleneck. Briefly, if X and Y are two random variables, we are interested in finding another random variable M (called the "bottleneck" variable) that solves the following optimisation problem:

min_{p(m|x)}I(M:X) - β I(M:Y)

for any β>0, and where M is constrained to be independent on Y conditional on X:

p(x,m,y) = p(x)p(m|x)p(y|x)

Intuitively, we want to find the stochastic mapping p(M|X) that extracts from X as much information about Y as possible while forgetting all irrelevant information. β is a free parameter that sets the relative importance of forgetting irrelevant information versus remembering useful information. Usually, one is interested in the curve described by I(M:X) and I(M:Y) at the solution of the bottleneck problem for a range of values of β. This curve gives the optimal tradeoff of compression and prediction, telling us what is the minimum amount of information one needs to know about X to be able to predict Y to a certain accuracy, or vice versa, what is the maximum accuracy one can have in predicting Y given a certain amount of information about X.

Using `embo`

In embo, we assume that the true joint distribution of X and Y is not available, and that we only have a set of joint empirical observations. We also assume that X and Y both take on a finite number of discrete values. The main point of entry to the package is the EmpiricalBottleneck class. In its constructor, EmpiricalBottleneck takes as arguments an array of observations for X and an (equally long) array of observations for Y, together with other optional parameters (see the docstring for details). In the most basic use case, users can call the get_information_bottleneck method of an EmpiricalBottleneck object, which will return a set of β values and the optimal values of I(M:X) and I(M:Y) corresponding to those β. The optimal tradeoff can then be visualised by plotting I(M:Y) vs I(M:Y).

For instance:

import numpy as np
from matplotlib import pyplot as plt
from embo import EmpiricalBottleneck

# data sequences
x = np.array([0,0,0,1,0,1,0,1,0,1])
y = np.array([0,1,0,1,0,1,0,1,0,1])

# compute the IB bound from the data
I_x,I_y,β = EmpiricalBottleneck(x,y).get_empirical_bottleneck()

# plot the optimal compression-prediction bound
plt.plot(I_x,I_y)

More examples

A simple example of usage with synthetic data is located at embo/examples/Basic-Example.ipynb. A more meaningful example is located at embo/examples/Markov-Chains.ipynb, where we compute the Information Bottleneck between the past and the future of time series generated from different Markov chains.

Further details

For more details, please consult the docstrings for empirical_bottleneck and IB.

Authors

embo is maintained by Eugenio Piasini, Alexandre Filipowicz and Jonathan Levine.

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Environment
- Console
Intended Audience
- Science/Research
License
- OSI Approved :: GNU General Public License v3 or later (GPLv3+)
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Information Analysis

Release history Release notifications | RSS feed

1.1.0

Feb 22, 2021

1.0.6

Feb 6, 2021

1.0.5

Feb 6, 2021

This version

1.0.4

Feb 5, 2021

1.0.3

Feb 5, 2021

1.0.2

May 6, 2020

1.0.1

Feb 4, 2020

1.0.0

Feb 3, 2020

0.4.0

Jan 23, 2020

0.3.3

Jan 23, 2020

0.3.1

Jan 21, 2020

0.3.1.dev0 pre-release

Jan 21, 2020

0.3.0

Dec 20, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

embo-1.0.4.tar.gz (261.5 kB view hashes)

Uploaded Feb 5, 2021 Source

Built Distribution

embo-1.0.4-py3-none-any.whl (260.5 kB view hashes)

Uploaded Feb 5, 2021 Python 3

Hashes for embo-1.0.4.tar.gz

Hashes for embo-1.0.4.tar.gz
Algorithm	Hash digest
SHA256	`17fc7f232b7e413fe3f0f7f8d7bc42a14e56f9544e73af7c65bb869feff3a7c9`
MD5	`d80d406d443f461aca86d6995b2acc8a`
BLAKE2b-256	`40b99baf4327fec44e5e61dfa4c982d5880e52ffab0458ad7b79f09bf1e3efc3`

Hashes for embo-1.0.4-py3-none-any.whl

Hashes for embo-1.0.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`dc2c5598b35c5cf063a425fba182b264215c6d70ff0c332a78eb940f04ba7c0e`
MD5	`ae686baaa4bd1dd30457520f482120d7`
BLAKE2b-256	`97781e66d75bdb77a404785058a91aa1d2979acdd6a2aca71540b3003d052d7d`

embo 1.0.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

EMBO - Empirical Bottleneck

Requirements

Installation

Testing

Usage

The Information Bottleneck

Using `embo`

More examples

Further details

Authors

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

embo 1.0.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

EMBO - Empirical Bottleneck

Requirements

Installation

Testing

Usage

The Information Bottleneck

Using embo

More examples

Further details

Authors

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

Using `embo`