Skip to main content

Data preparation for object-centric process mining.

Project description

PyStack't (pystackt)

PyStack't (pystackt) is a Python package based on Stack't that supports data preparation for object-centric process mining.

While Stack't is a proof-of-concept on how to embed the data transformations needed for object-centric process mining into an (existing) data stack, it's not very user-friendly for ad-hoc use. PyStack't aims to fill that gap by exposing some functionality in a Python package.

📦 Installation

You can install pystackt using pip:

pip install pystackt

⛏️🐙 Extracting object-centric event logs from Github

📝 Example

from pystackt import *

get_github_log(
    GITHUB_ACCESS_TOKEN="insert_your_github_access_token_here",
    repo_owner="LienBosmans",
    repo_name="stack-t",
    max_issues=None, # None returns all issues, can also be set to an integer to extract a limited data set
    quack_db="./stackt.duckdb"
)

🔑 Generating a GitHub Access Token

To generate a GitHub access token, go to GitHub Developer Settings, click "Generate new token (classic)", and proceed without selecting any scopes (leave all checkboxes unchecked). Copy the token and store it securely, as it won’t be shown again.

🔍 Viewing Data

This function creates a DuckDB database file. To explore the data, you'll need a database manager. You can follow this DuckDB guide to download and install DBeaver for easy access.

📜 Data Usage Policies

Please ensure that you use the extracted data in compliance with GitHub policies, including Information Usage Restrictions and API Terms.

📤 Export to OCEL 2.0

📝 Example

from pystackt import *

export_to_ocel2(
    quack_db="./stackt.duckdb",
    schema_in="main",
    schema_out="ocel2",
    sqlite_db="./ocel2_stackt.sqlite"
)

ℹ️ More information

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pystackt-0.0.2.tar.gz (23.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pystackt-0.0.2-py3-none-any.whl (24.7 kB view details)

Uploaded Python 3

File details

Details for the file pystackt-0.0.2.tar.gz.

File metadata

  • Download URL: pystackt-0.0.2.tar.gz
  • Upload date:
  • Size: 23.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.0

File hashes

Hashes for pystackt-0.0.2.tar.gz
Algorithm Hash digest
SHA256 960d61b7730b3392a689fb61f388a04273f278c45ea1d48218c778abb19e2a82
MD5 f0d1572c49f159d3966dd63690412289
BLAKE2b-256 113cab172f7dd6680133b13350c5b11ad60c38fd21ce278ad4f720336d1d7ae5

See more details on using hashes here.

File details

Details for the file pystackt-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: pystackt-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 24.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.0

File hashes

Hashes for pystackt-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 6b183a7c375c27e3071919e707d7f866cbbed877f8c80a48cd0246f087dcbe4e
MD5 be031cb9d79e44d4547a243359646f5b
BLAKE2b-256 6782b4dff5090233b697eb1b2ba0528a0afbe6032d14d6d7a8e353fedb3d3cb6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page