Skip to main content

package for indexing text datasets using prime number factorisation for fast word frequency analysis

Project description

primetext

python package for indexing text datasets for fast word frequency analysis

Usage

from primetext import primetext

data = ["black cat on mat",
"black hat for you",
"cat sat on you"]

# initiate primetext
pt = primetext.primetext()

# indexing data
pt.index(data)

# finding words
recordsWithCat = pt.find(['cat'])
# returns boolean vector : [True,False,True]

recordsWithCatAndSat = pt.find(['cat','sat'])
# returns boolean vector : [False,False,True]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

primetext-0.2.2.tar.gz (1.7 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page