GSP Python implementation
Project description
GSP-python
A Python implementation of Generalized Sequential Patterns (GSP) algorithm for sequential pattern mining
This project implements the Generalized Sequential Patterns (GSP) algorithm to find frequent sequences within a given dataset. This implementation includes parameters for the mingap, maxgap, and maxspan time constraints.
The project also features a simple dataset generator.
Installation
python3 -m pip install gsp-python
Usage
To run the gsp algorithm:
python3 -m gsp-python GSP infile outfile minsup -t maxgap mingap maxspan
To generate a random dataset:
python3 -m gsp-python DatasetGen outfile size nevents maxevents avgelems
For more information about arguments and additional optional arguments, type:
python3 -m gsp-python GSP -h
or
python3 -m gsp-python DatasetGen -h
Alternatively, the modules can be manually imported and used in a script. An example is given below:
from gsp_py.gsp import load_ds
from gsp_py.gsp import GSP
dataset, {}, {} = load_ds("path/to/file.txt")
algo_gsp = GSP(dataset, minsup=0.3, mingap=1, maxgap=2, maxspan=5)
output = algo_gsp.run_gsp()
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
gsp-python-0.0.6.tar.gz
(9.8 kB
view hashes)
Built Distribution
gsp_python-0.0.6-py3-none-any.whl
(10.2 kB
view hashes)
Close
Hashes for gsp_python-0.0.6-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 88b08e29eaa996370ec46c7af5401d3d0e51f8067495ab29302c3af9825ee0ac |
|
MD5 | 454871a6cb90d95c8707533310c15dd1 |
|
BLAKE2b-256 | fb41bd3fc4bf2a78f21545ec38050923f11d300864c71ab77dd5b96681339236 |