Skip to main content

A C implemented Suffix Tree package

Project description

Introduction

This is a Suffix Tree data structure lib implemented in C++, wrapped with python.

Expect for contructing the tree, this lib also provides to construct the query tree for speeding up querying strings.

Install and Build

Install from Pypi

run pip install csuffixtree

Build

You can build this either on linux or windows.

Install Python Package

  1. Build SuffixTreePyBinding solution in Windows
  2. On linux you should go to linux directory, and run make python
  3. under SuffixTreePy directory. run python setup.py install

How to use

You can construct a suffix tree in the following way.

from suffixtree import *
tree = SuffixTree(True,["abc","123","321"])
tree.addStrings(["xyz","abcd"])
print(tree.findString("1"))
print(tree.findString("a"))

If you finished inserting strings to a suffix tree, and you want the querying be very fast you can do:

# convert tree to query tree, this release a part of memory
qtree = tree.createQueryTree()
qtree.cacheNodes() # take some time to cache intermediate nodes
qtree.findString("a")

For simplicity you can also write

# do not preserve string
qtree = SuffixQueryTree(False,["abc","abcd","123","321"])
idx = qtree.findStringIdx("abc") # you cannot use findString now
print(idx)

Serialization

SuffixQueryTree can be serialized and deserialized to/from a file a bytes object. For example:

# do not preserve string
qtree = SuffixQueryTree(False,["abc","abcd","123","321"])
idx = qtree.findStringIdx("abc") # you cannot use findString now
print(idx)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

csuffixtree-0.2.1.tar.gz (5.0 kB view details)

Uploaded Source

Built Distribution

csuffixtree-0.2.1-py3-none-any.whl (546.3 kB view details)

Uploaded Python 3

File details

Details for the file csuffixtree-0.2.1.tar.gz.

File metadata

  • Download URL: csuffixtree-0.2.1.tar.gz
  • Upload date:
  • Size: 5.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.2.0 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.7.0

File hashes

Hashes for csuffixtree-0.2.1.tar.gz
Algorithm Hash digest
SHA256 dca05c350921e6a2adb2fc6351f23b36d66d53bd931b630a831ae022bc2ea0ce
MD5 4f6cf44224ffecabf23e7aa480765c40
BLAKE2b-256 fa37eb01f6143466d0607e588a267b58d3a8a1b4764b226a279dc6f6297aad09

See more details on using hashes here.

File details

Details for the file csuffixtree-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: csuffixtree-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 546.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.2.0 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.7.0

File hashes

Hashes for csuffixtree-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 c0ce63b404f9c05be4ac18193de917c4cbb5d873014820c15eb4a86bb5d4f55f
MD5 b24404371e99489666d4f858ea5ea655
BLAKE2b-256 c550f5cbfc18898d571866a9372fb2eb3bd9d69fac320217284838417ddf0b43

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page