tempowork

A library for mining temporal networks where time is represented as a continuous dimension!

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Frequent subgraph mining in temporal networks

The tempowork Python library comprises a series of algorithms for mining temporal networks where time is represented as a continuous dimension. In the current version, given a user-defined frequency threshold supp in {1,2,...,n}, it can mine all the frequent subgraphs in a data set of n temporal networks. The frequent subgraphs are subgraphs that appear in at least supp networks of the data set. The mining can be executed under one of the four following definitions of isomorphism:

Exact-time isomorphism

Inexact-time isomorphism

Sequence-preserved exact-time isomorphism

Sequence-preserved inexact-time isomorphism

Mining evolving patterns in temporal networks

Coming soon!

Install

Install the latest version of tempowork:

pip install tempowork

Usage

The main function can be executed as follows:

import tempowork as tw
test = tw.frequent(file_name, support, isomorphism = 'e', disc_threshold = 0.05, binning=False, nbins = 10, directed = False)

Arguments

file_name: a string referring to the data set of temporal networks (.txt file).
support: the support integer value in {1,2,...,n}, representing the number of networks in the data set that should include a pattern to make the pattern frequent
isomorphism: it shows when two networks are considered isomorphic. It can take one of the following options:
- ‘e’: Exact-time isomorphism
- ‘i’: Inexact-time isomorphism
- ‘es’: Sequence-preserved exact-time isomorphism
- ‘is’: Sequence-preserved inexact-time isomorphism
disc_threshold: a user-defined threshold used for ‘i’ and ‘is’ isomorphism definitions. Once all the durations are sorted, disc_threshold is the maximum value that the difference between two consecutive durations divided by the first duration is permitted, in order to consider two consecutive values inexactly identical.
binning: used for ‘i’ and ‘is’ isomorphism definitions and takes one of the following values.
- False: the inexact identical durations are determined by disc_threshold.
- width: the algorithm uses equal-width periods to find inexact identical values
- frequency: the algorithm uses an equal-frequency strategy to find inexact identical values
nbins: it is a parameter that works with ‘i’ and ‘is’ isomorphism definitions when binning is not False. It determines the number of bins for ‘width’ and ‘frequency’ options of binning.
directed: it specifies whether the (edges of) networks in the data set are directed.

Important

Each network in the temporal network data set comprises a list of temporal edges, ordered based on their starting time. Each line in the data set represents either a sequential number for a network, t # net_id, or a temporal edge, e v1_id v2_id v1_lbl e_lbl v2_lbl st dt, where st and dt are starting point and duration of the edge, respectively. Note that all the identifiers, labels, time points, and durations should be integer values. The following few lines show an example of a data set of temporal networks composed of two networks, each consisting of four temporal edges.
t # 0
e 0 1 1 1 1 0 80
e 1 2 1 3 1 20 70
e 0 2 1 2 1 30 60
e 1 3 1 4 1 60 40
t # 1
e 0 1 1 1 1 0 80
e 1 2 1 3 1 20 70
e 0 2 1 2 1 30 60
e 1 3 1 4 1 60 40

Examples

Here are a few examples of using the tempowork library to mine frequent subgraphs by adopting different isomorphism definitions and parameters. The corresponding text files are provided in the test folder.

import tempowork as tw
exact_example = tw.frequent('exact.txt', 2, isomorphism = 'e')
inexact_example = tw.frequent('inexact.txt', 2, isomorphism = 'i', disc_threshold = 0.05)
seq_exact_example = tw.frequent('seq_exact.txt', 2, isomorphism = 'es')
seq_inexact_example = tw.frequent('seq_inexact.txt', 2, isomorphism = 'is', disc_threshold = 0.5)
seq_inexact_example = tw.frequent('seq_inexact.txt', 2, isomorphism = 'is', binning = 'width', nbins = 10)
seq_inexact_example = tw.frequent('seq_inexact.txt', 2, isomorphism = 'is', binning = 'frequency', nbins = 10)

Then, the results can be examined using:

number_of_frequent_patterns = exact_example.frequent_cntr
frequent_patterns_detected = exact_example.frequent_patterns

Request for feedback (It remains a work in progress!)

The implementation of this algorithm requires multiple components, such as interval trees, constrained interval graphs, different definitions of isomorphism, and …, to work seamlessly together. I tried to implement them accordingly. So, if you encounter any strange behavior, I would be happy to hear about your experience for further improvements. Please feel free to reach out via email (ali.jazayeri@drexel.edu).

Citation

Paper: To Be Provided!

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.1.5

Jan 9, 2024

0.1.4

Jan 9, 2024

0.1.3

Jan 9, 2024

0.1.2

Jan 9, 2024

0.1.1

Jan 9, 2024

0.1.0

May 14, 2021

0.0.23

May 11, 2021

0.0.22

May 11, 2021

0.0.21

May 11, 2021

0.0.20

May 11, 2021

0.0.19

May 11, 2021

This version

0.0.18

May 11, 2021

0.0.17

May 11, 2021

0.0.16

May 11, 2021

0.0.15

May 11, 2021

0.0.14

May 11, 2021

0.0.13

May 11, 2021

0.0.12

May 11, 2021

0.0.11

May 11, 2021

0.0.10

May 11, 2021

0.0.9

May 11, 2021

0.0.8

May 11, 2021

0.0.7

May 11, 2021

0.0.6

May 11, 2021

0.0.5

May 11, 2021

0.0.4

May 11, 2021

0.0.3

May 11, 2021

0.0.2

May 11, 2021

0.0.1

Apr 2, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tempowork-0.0.18.tar.gz (16.6 kB view hashes)

Uploaded May 11, 2021 Source

Built Distribution

tempowork-0.0.18-py3-none-any.whl (24.1 kB view hashes)

Uploaded May 11, 2021 Python 3

Hashes for tempowork-0.0.18.tar.gz

Hashes for tempowork-0.0.18.tar.gz
Algorithm	Hash digest
SHA256	`518d66cd5895de8b7e1ad3a2a20c28137ed66af2a52f57eb6efce5c23196a78c`
MD5	`fd3a0c59f1b2766ad7004758440f7eb1`
BLAKE2b-256	`f7a66769f5fdbe5b275e7000f7681a1fe5dbd1cce168408831e0f5e5b06c8d43`

Hashes for tempowork-0.0.18-py3-none-any.whl

Hashes for tempowork-0.0.18-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d79453f96f26ced5add4ad19795a4cc469ba00be6695005cefb0f3a00d032d8c`
MD5	`fa4fb17039fbc2174052465382cc77d8`
BLAKE2b-256	`52bda3dfb80eda5f29256425bbe0625402ed9cddbb47081bed6222ad9761c969`

tempowork 0.0.18

Navigation

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Project description

Frequent subgraph mining in temporal networks

Mining evolving patterns in temporal networks

Install

Usage

Examples

Request for feedback (It remains a work in progress!)

Citation

Project details

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

tempowork 0.0.18

Navigation

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Project description

Frequent subgraph mining in temporal networks

Mining evolving patterns in temporal networks

Install

Usage

Examples

Request for feedback (It remains a work in progress!)

Related work

Citation

Project details

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution