A real-time data processing pipeline

These details have not been verified by PyPI

Project links

Repository

Project description

logicsponge-processmining is a library for process-mining tasks that is built on logicsponge-core. Process mining involves a set of tools for modeling, analyzing, and improving business processes.

In a nutshell

The current implementation includes the following features:

Event-log prediction in both batch and streaming modes, using frequency prefix trees, n-grams, LSTMs, and ensemble methods.
Visualization of event streams based on their prefix trees.

Getting started

We recommend starting with our logicsponge tutorial to get acquainted with the basics of how logicsponge processes data streams. Afterwards, to get started with logicsponge-processmining, install it using pip:

pip install logicsponge-processmining

Event-log prediction

Event-log prediction involves anticipating events given historical data about a process. In the streaming case, we receive a sequence of events, where each event is a pair (case ID, activity) consisting of a case ID and an activity (also referred to as action). As events arrive, we train a model incrementally, allowing it to predict the next activity for a given case based on the sequence of activities observed so far.

logicsponge-processmining offers several predefined models: frequency prefix trees, n-grams, LSTMs, and ensemble methods (soft, hard, and adaptive voting).

Let’s walk through the required imports to understand the structure of the library:

# example.py

import logicsponge.core as ls
from logicsponge.processmining.algorithms_and_structures import Bag, FrequencyPrefixTree, NGram
from logicsponge.processmining.models import BasicMiner, SoftVoting
from logicsponge.processmining.streaming import IteratorStreamer, StreamingActivityPredictor
from logicsponge.processmining.test_data import dataset

This imports algorithms like frequency prefix trees and n-grams. These classes also allow you to define your own data structures.

You will then import models:

BasicMiner wraps a single algorithm (e.g., an n-gram) to produce a predictor model.
SoftVoting (and other ensemble methods) takes a list of models and produces a new model that applies soft voting.

Instances of these classes are ready for batch learning. To use them in streaming mode, wrap them with StreamingActivityPredictor. Below, we define two models:

The first is a 6-gram (look-back window size of 5).
The second combines several algorithms using soft voting.

By configuring "include_stop": False, stop predictions are ignored, and probabilities are normalized. This is often suitable in streaming settings unless explicit stop activities are present.

config = {
    "include_stop": False,
}

model1 = StreamingActivityPredictor(
    strategy=BasicMiner(algorithm=NGram(window_length=5), config=config),
)

model2 = StreamingActivityPredictor(
    strategy=SoftVoting(
        models=[
            BasicMiner(algorithm=Bag()),
            BasicMiner(algorithm=FrequencyPrefixTree(min_total_visits=10)),
            BasicMiner(algorithm=NGram(window_length=2)),
            BasicMiner(algorithm=NGram(window_length=3)),
            BasicMiner(algorithm=NGram(window_length=4)),
        ],
        config=config,
    )
)

Next, we set up the sponge to stream data from a dataset and apply a model. For clarity, a key filter is applied first.

The dataset can be any iterator. For illustration, we use the Sepsis dataset available at 4TU.ResearchData. When you run the Python script, you will be prompted to download it.

streamer = IteratorStreamer(data_iterator=dataset)

sponge = (
    streamer
    * ls.KeyFilter(keys=["case_id", "activity"])
    * model2
    * ls.AddIndex(key="index", index=1)
    * ls.Print()
)


sponge.start()

A single prediction might look like this. In addition to the actual case_id and activity, it provides:

The most likely predicted activity.
The top-3 activities.
The probability distribution over all possible activities.

{
    'case_id': 'FAA',
    'activity': 'Return ER',
    'prediction': {
        'activity': 'Return ER',
        'top_k_actions': ['Return ER', 'Leucocytes', 'Release E'],
        'probability': 0.9986388006307096,
        'probs': {
            # [...]
            'Leucocytes': 0.0013611993692904283,
            'Return ER': 0.9986388006307096,
            # [...]
        }
    },
    'latency': 0.06985664367675781,
    'index': 15214
}

Project details

These details have not been verified by PyPI

Project links

Repository

Release history Release notifications | RSS feed

0.0.5

Jun 13, 2025

This version

0.0.4

Jan 21, 2025

0.0.3

Dec 16, 2024

0.0.2

Dec 12, 2024

0.0.1

Dec 12, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

logicsponge_processmining-0.0.4.tar.gz (34.3 kB view details)

Uploaded Jan 21, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

logicsponge_processmining-0.0.4-py3-none-any.whl (40.0 kB view details)

Uploaded Jan 21, 2025 Python 3

File details

Details for the file logicsponge_processmining-0.0.4.tar.gz.

File metadata

Download URL: logicsponge_processmining-0.0.4.tar.gz
Upload date: Jan 21, 2025
Size: 34.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: python-httpx/0.28.1

File hashes

Hashes for logicsponge_processmining-0.0.4.tar.gz
Algorithm	Hash digest
SHA256	`5618714f224570b3166e68c7210c90ddd65c548bb41727bd47c0543d55e35094`
MD5	`e2507473567912ebd8c392fb1a633701`
BLAKE2b-256	`47f8e770b2f21db54b0e27310061ba9c38cf7c1d11c6ca21fd5ae9a6c4db1030`

See more details on using hashes here.

File details

Details for the file logicsponge_processmining-0.0.4-py3-none-any.whl.

File metadata

Download URL: logicsponge_processmining-0.0.4-py3-none-any.whl
Upload date: Jan 21, 2025
Size: 40.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: python-httpx/0.28.1

File hashes

Hashes for logicsponge_processmining-0.0.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`98b012d1ff631085d63453891b2a9d688cb53a5bff11a45a9a0cd90e0833c13d`
MD5	`76c5eb680b02e8b57f61b9aa00086836`
BLAKE2b-256	`86aadd7cd709aa242d6dc9837356124b4e959db3b46f77494ccc9a398b455f7e`

See more details on using hashes here.

logicsponge-processmining 0.0.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

In a nutshell

Getting started

Event-log prediction

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes