GSP Python implementation
Project description
GSP-python
A Python implementation of Generalized Sequential Patterns (GSP) algorithm for sequential pattern mining
This project implements the Generalized Sequential Patterns (GSP) algorithm to find frequent sequences within a given dataset. This implementation includes parameters for the mingap, maxgap, and maxspan time constraints.
The project also features a simple dataset generator.
Installation
python3 -m pip install gsp_python
Usage
To run the gsp algorithm:
python3 -m gsp_pythonn GSP infile outfile minsup -t maxgap mingap maxspan
To generate a random dataset:
python3 -m gsp_python DatasetGen outfile size nevents maxevents avgelems
For more information about arguments and additional optional arguments, type:
python3 -m gsp_python GSP -h
or
python3 -m gsp_python DatasetGen -h
Alternatively, the modules can be manually imported and used in a script. An example is given below:
from gsp_python.gsp import load_ds
from gsp_python.gsp import GSP
dataset, {}, {} = load_ds("path/to/file.txt")
algo_gsp = GSP(dataset, minsup=0.3, mingap=1, maxgap=2, maxspan=5)
output = algo_gsp.run_gsp()
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
gsp_python-0.0.8.tar.gz
(9.8 kB
view hashes)
Built Distribution
gsp_python-0.0.8-py3-none-any.whl
(10.2 kB
view hashes)
Close
Hashes for gsp_python-0.0.8-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | cae3e1514de9ea14499e12f356e7e8ce7ddb279518ad481cd7a2ff723dccd33e |
|
MD5 | a4d38087939b2fca8af4ba14b636964d |
|
BLAKE2b-256 | eb498c0062bd7b38e5991313fc935cf94b72c23350711302e74c9129b1522b60 |