Data preparation for object-centric process mining.
Project description
PyStack't (pystackt)
PyStack't (pystackt) is a Python package based on Stack't that supports data preparation for object-centric process mining.
While Stack't is a proof-of-concept on how to embed the data transformations needed for object-centric process mining into an (existing) data stack, it's not very user-friendly for ad-hoc use. PyStack't aims to fill that gap by exposing some functionality in a Python package.
📦 Installation
You can install pystackt using pip:
pip install pystackt
⛏️🐙 Extracting object-centric event logs from Github
📝 Example
from pystackt import *
get_github_log(
GITHUB_ACCESS_TOKEN="insert_your_github_access_token_here",
repo_owner="LienBosmans",
repo_name="stack-t",
max_issues=None, # None returns all issues, can also be set to an integer to extract a limited data set
quack_db="./stackt.duckdb"
)
🔑 Generating a GitHub Access Token
To generate a GitHub access token, go to GitHub Developer Settings, click "Generate new token (classic)", and proceed without selecting any scopes (leave all checkboxes unchecked). Copy the token and store it securely, as it won’t be shown again.
🔍 Viewing Data
This function creates a DuckDB database file. To explore the data, you'll need a database manager. You can follow this DuckDB guide to download and install DBeaver for easy access.
📜 Data Usage Policies
Please ensure that you use the extracted data in compliance with GitHub policies, including Information Usage Restrictions and API Terms.
📤 Export to OCEL 2.0
📝 Example
from pystackt import *
export_to_ocel2(
quack_db="./stackt.duckdb",
schema_in="main",
schema_out="ocel2",
sqlite_db="./ocel2_stackt.sqlite"
)
ℹ️ More information
- The OCEL 2.0 standard is defined in OCEL (Object-Centric Event Log) 2.0 Specification.
- To explore event logs in the OCEL 2.0 format, you can use Ocelot.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pystackt-0.0.2.tar.gz.
File metadata
- Download URL: pystackt-0.0.2.tar.gz
- Upload date:
- Size: 23.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
960d61b7730b3392a689fb61f388a04273f278c45ea1d48218c778abb19e2a82
|
|
| MD5 |
f0d1572c49f159d3966dd63690412289
|
|
| BLAKE2b-256 |
113cab172f7dd6680133b13350c5b11ad60c38fd21ce278ad4f720336d1d7ae5
|
File details
Details for the file pystackt-0.0.2-py3-none-any.whl.
File metadata
- Download URL: pystackt-0.0.2-py3-none-any.whl
- Upload date:
- Size: 24.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6b183a7c375c27e3071919e707d7f866cbbed877f8c80a48cd0246f087dcbe4e
|
|
| MD5 |
be031cb9d79e44d4547a243359646f5b
|
|
| BLAKE2b-256 |
6782b4dff5090233b697eb1b2ba0528a0afbe6032d14d6d7a8e353fedb3d3cb6
|