Skip to main content

GSP Python implementation

Project description

GSP-python

A Python implementation of Generalized Sequential Patterns (GSP) algorithm for sequential pattern mining

This project implements the Generalized Sequential Patterns (GSP) algorithm to find frequent sequences within a given dataset. This implementation includes parameters for the mingap, maxgap, and maxspan time constraints.

The project also features a simple dataset generator.

Installation

python3 -m pip install gsp-python

Usage

To run the gsp algorithm:

python3 -m gsp-python GSP infile outfile minsup -t maxgap mingap maxspan

To generate a random dataset:

python3 -m gsp-python DatasetGen outfile size nevents maxevents avgelems

For more information about arguments and additional optional arguments, type:

python3 -m gsp-python GSP -h

or

python3 -m gsp-python DatasetGen -h

Alternatively, the modules can be manually imported and used in a script. An example is given below:

from gsp_py.gsp import load_ds
from gsp_py.gsp import GSP

dataset, {}, {} = load_ds("path/to/file.txt")

algo_gsp = GSP(dataset, minsup=0.3, mingap=1, maxgap=2, maxspan=5)
output = algo_gsp.run_gsp()

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gsp-python-0.0.7.tar.gz (9.8 kB view hashes)

Uploaded Source

Built Distribution

gsp_python-0.0.7-py3-none-any.whl (10.2 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page