A C implemented Suffix Tree package
Project description
Introduction
This is a Suffix Tree data structure lib implemented in C++, wrapped with python.
Expect for contructing the tree, this lib also provides to construct the query tree for speeding up querying strings.
Install and Build
Install from Pypi
run pip install csuffixtree
Build
You can build this either on linux or windows.
Install Python Package
- Build SuffixTreePyBinding solution in Windows
- On linux you should go to
linux
directory, and runmake python
- under SuffixTreePy directory. run
python setup.py install
How to use
You can construct a suffix tree in the following way.
from suffixtree import *
tree = SuffixTree(True,["abc","123","321"])
tree.addStrings(["xyz","abcd"])
print(tree.findString("1"))
print(tree.findString("a"))
If you finished inserting strings to a suffix tree, and you want the querying be very fast you can do:
# convert tree to query tree, this release a part of memory
qtree = tree.createQueryTree()
qtree.cacheNodes() # take some time to cache intermediate nodes
qtree.findString("a")
For simplicity you can also write
# do not preserve string
qtree = SuffixQueryTree(False,["abc","abcd","123","321"])
idx = qtree.findStringIdx("abc") # you cannot use findString now
print(idx)
Serialization
SuffixQueryTree can be serialized and deserialized to/from a file a bytes object. For example:
# do not preserve string
qtree = SuffixQueryTree(False,["abc","abcd","123","321"])
idx = qtree.findStringIdx("abc") # you cannot use findString now
print(idx)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
csuffixtree-0.2.2.tar.gz
(5.1 kB
view hashes)
Built Distribution
csuffixtree-0.2.2-py3-none-any.whl
(546.4 kB
view hashes)
Close
Hashes for csuffixtree-0.2.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c8e48e7575a5f6d444ffe1d6c9af31f8ce1f7f3da7a11a16036d2a76dc8f2ccf |
|
MD5 | e291ba5ebfb368f027abe73e95781cd4 |
|
BLAKE2b-256 | c4396067ac2fbd6d38e8341a0d7bb019f9c382833372cefc531791183b961fbd |