GSP Python implementation
Project description
GSP-python
A Python implementation of Generalized Sequential Patterns (GSP) algorithm for sequential pattern mining
This project implements the Generalized Sequential Patterns (GSP) algorithm to find frequent sequences within a given dataset. This implementation includes parameters for the mingap, maxgap, and maxspan time constraints.
The project also features a simple dataset generator.
Installation
python3 -m pip install gsp-python
Usage
To run the gsp algorithm:
python3 -m gsp-python GSP infile outfile minsup -t maxgap mingap maxspan
To generate a random dataset:
python3 -m gsp-python DatasetGen outfile size nevents maxevents avgelems
For more information about arguments and additional optional arguments, type:
python3 -m gsp-python GSP -h
or
python3 -m gsp-python DatasetGen -h
Alternatively, the modules can be manually imported and used in a script. An example is given below:
from gsp_py.gsp import load_ds
from gsp_py.gsp import GSP
dataset, {}, {} = load_ds("path/to/file.txt")
algo_gsp = GSP(dataset, minsup=0.3, mingap=1, maxgap=2, maxspan=5)
output = algo_gsp.run_gsp()
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
gsp-python-0.0.7.tar.gz
(9.8 kB
view hashes)
Built Distribution
gsp_python-0.0.7-py3-none-any.whl
(10.2 kB
view hashes)
Close
Hashes for gsp_python-0.0.7-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 66e44c8cf1cc5cd611c8f6cb70ad0ef05b013e79fa1456e4244ca348c24ce11d |
|
MD5 | 3f6824be3d903f650ad79b68b59773d5 |
|
BLAKE2b-256 | af38ee1750b4cbb195cfd9305fe85ebdf28e4fdc884ec792f5999fdcfc2a22e8 |