Suffix Array for all-substring search

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
Topic
- Software Development :: Libraries

Project description

sufarray: implementation of suffix array

It uses prefix doubling for calculating the suffix array, which is O(n log n) time. When searching for all occurrences of a string, it takes O(log n) time. Further occurrences takes constant time.

At present the implementation is Python only. This has the drawback that the speed is not comparable with those of other languages.

Usage

To construct a suffix array of string s:

import sufarray
sarray = sufarray.SufArray(s)

After that, you can get the actual array with:

sarray.get_array()

This produces a permutation a of range(len(s)), where the values a[0], a[1], etc. are such that the suffixes s[a[i]:] is sorted.

Other than that, you can find the positions where a string n appears within s using:

sarray.find_all(n)

This is a generator function, yielding all the possible positions.

Implementations

Two implementations are provided. The simple version is called SufArrayBruteForce, and a version SufArrayPD is provided which calculates the suffix array using prefix doubling, and augment the result with a LCP (Longest common prefix) array for use with searching. The latter has better bounded performance guarantees, but with arrays of small text it is slower. The default is SufArrayPD. You can use the brute force version using sufarray.SufArrayBruteForce.

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
Topic
- Software Development :: Libraries

Release history Release notifications | RSS feed

This version

0.1.1

Jan 28, 2020

0.1

Jan 28, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sufarray_kkto-0.1.1.tar.gz (5.3 kB view details)

Uploaded Jan 28, 2020 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sufarray_kkto-0.1.1-py3-none-any.whl (5.8 kB view details)

Uploaded Jan 28, 2020 Python 3

File details

Details for the file sufarray_kkto-0.1.1.tar.gz.

File metadata

Download URL: sufarray_kkto-0.1.1.tar.gz
Upload date: Jan 28, 2020
Size: 5.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.1.0 requests-toolbelt/0.9.1 tqdm/4.42.0 CPython/3.5.3

File hashes

Hashes for sufarray_kkto-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`7705aa9b6ee6fc36bad4f8a725fa77d1468e85e5688f74574a447cee14ca80c0`
MD5	`1ead1f834230e0f90760ec9ad443acf0`
BLAKE2b-256	`a703f4393512c543a46d56b2c9e76f5ca177e82f68d8d05497fdd9533ef3efea`

See more details on using hashes here.

File details

Details for the file sufarray_kkto-0.1.1-py3-none-any.whl.

File metadata

Download URL: sufarray_kkto-0.1.1-py3-none-any.whl
Upload date: Jan 28, 2020
Size: 5.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.1.0 requests-toolbelt/0.9.1 tqdm/4.42.0 CPython/3.5.3

File hashes

Hashes for sufarray_kkto-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`07b689566ce3e0448318d21b27a97340aa2decf1a254bbae3805613283a7422f`
MD5	`9dd9b285ca7a91323cb371a6d3b6bfcf`
BLAKE2b-256	`dab72dfdbf65d52ce6ee0186df5ab358743321903d1b99c95d151e074f6f215f`

See more details on using hashes here.

sufarray-kkto 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

sufarray: implementation of suffix array

Usage

Implementations

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes