Skip to main content

Python package for the Kemeny Decomposition Algorithm (KDA) together with some Markov chain tooling.

Project description

PyPI version ALNS codecov

pykda (you say "pie-k-d-a") is a Python package for the Kemeny Decomposition Algorithm (KDA) which allows to decompose a Markov chain into clusters of states, where states within a cluster are relatively more connected to each other than states outside the cluster. This is useful for analyzing influence graphs, such as social networks and internet networks. KDA was developed in the paper from Berkhout and Heidergott (2019) and uses the Kemeny constant as a connectivity measure.

Installing pykda

Package pykda depends on numpy, tarjan and pyvis. Use the package manager pip to install PyKDA

pip install pykda

Getting started

The first step is to load a Markov chain as a MarkovChain object using a transition matrix P.

from pykda.Markov_chain import MarkovChain

P = [[0, 0.3, 0.7, 0, 0],
     [0.7, 0.2, 0.09, 0, 0.01],
     [0.5, 0.25, 0.25, 0, 0],
     [0, 0, 0, 0.5, 0.5],
     [0.01, 0, 0, 0.74, 0.25]]  # artificial transition matrix
MC = MarkovChain(P)

We can study some properties of the Markov chain, such as the stationary distribution:

print(MC.stationary_distribution.flatten().round(3))

This gives [0.226 0.156 0.23 0.232 0.156]. We can also plot the Markov chain:

MC.plot(file_name="An artificial Markov chain")

Now, let us decompose the Markov chain into clusters using KDA. We start by initializing a KDA object using the Markov chain and the KDA settings (such as the number of clusters). For more details about setting choices, see the KDA documentation or Berkhout and Heidergott (2019). Here, we apply the default settings, which is to cut all edges with a negative Kemeny constant derivative and normalizing the transition matrix afterward.

kda = KDA(
    original_MC=MC, CO_A="CO_A_1(1)", CO_B="CO_B_3(0)", symmetric_cut=False
    )

Now, let us run the KDA algorithm and visualize the results.

kda.run()
kda.plot(file_name="An artificial Markov chain after KDA_A1_1_B3_0")

We can study the resulting Markov chain in more detail via the current Markov chain attribute MC of the KDA object.

print(kda.MC)

This gives the following output:

MC with 5 states.
Ergodic classes: [[2, 0], [3]].
Transient classes: [[1], [4]].

So KDA led to a Markov multi-chain with two ergodic classes and two transient classes. We can also study the edges that KDA cut via the log attribute of the KDA object.

print(kda.log['edges cut'])

This gives the following output:

[[None], [(4, 0), (1, 4), (2, 1), (0, 1), (3, 4)]]

We can also study the Markov chains that KDA found in each (outer) iteration via kda.log['Markov chains'])`.

As another KDA application example, let us apply KDA until we find two ergodic classes explicitly. We will also ensure that the Kemeny constant derivatives are recalculated after each cut (and normalize the cut transition matrix to ensure it is a stochastic matrix again). To that end, we use:

kda2 = KDA(
    original_MC=MC, CO_A="CO_A_2(2)", CO_B="CO_B_1(1)", symmetric_cut=False
    )
kda2.run()
kda2.plot(file_name="An artificial Markov chain after KDA_A2_2_B1_1")

which gives (edges (4, 0) and (1, 4) are cut in two iterations):

How to learn more about pykda?

To learn more about pykda have a look at the documentation. There, you can also find links to interactive Google Colab notebooks in examples. If you have any questions, feel free to open an issue here on Github Issues.

How to cite pykda?

If you use pykda in your research, please consider citing the following paper:

Joost Berkhout, Bernd F. Heidergott (2019). Analysis of Markov influence graphs. Operations Research, 67(3):892-904. https://doi.org/10.1287/opre.2018.1813

Or, using the following BibTeX entry:

@article{Berkhout_Heidergott_2019,
	title = {Analysis of {Markov} influence graphs},
	volume = {67},
	number = {3},
	journal = {Operations Research},
	author = {Berkhout, J. and Heidergott, B. F.},
	year = {2019},
	pages = {892--904},
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pykda-0.9.3.tar.gz (67.9 kB view details)

Uploaded Source

Built Distribution

pykda-0.9.3-py3-none-any.whl (76.3 kB view details)

Uploaded Python 3

File details

Details for the file pykda-0.9.3.tar.gz.

File metadata

  • Download URL: pykda-0.9.3.tar.gz
  • Upload date:
  • Size: 67.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.10.14 Linux/6.5.0-1023-azure

File hashes

Hashes for pykda-0.9.3.tar.gz
Algorithm Hash digest
SHA256 a58ac934a038fd466f314122148622c8c7c7877dd7519f4cdc70e5d41353be28
MD5 2bbfe68927a9f61fcfefcef39e0842f7
BLAKE2b-256 d83ae495ec833677fb29df9013299db760af3fe9663e3d3e0674c9fa5d73c745

See more details on using hashes here.

File details

Details for the file pykda-0.9.3-py3-none-any.whl.

File metadata

  • Download URL: pykda-0.9.3-py3-none-any.whl
  • Upload date:
  • Size: 76.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.10.14 Linux/6.5.0-1023-azure

File hashes

Hashes for pykda-0.9.3-py3-none-any.whl
Algorithm Hash digest
SHA256 34deb38af8314c64432701f3e33ddebba9691f8996d65b79cfc338679c0ab3a0
MD5 2605af51561e024c2c9c65e9d748ba5f
BLAKE2b-256 fdb120ed07581e8a9319b1d495cc95bce591188a5cba4471d327115d43d6e63b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page