A general purpose indexer written in Python.
Project description
A general purpose indexer written in Python. Licensed under the MIT license.
Features
The indexr.buildr
package is capable of constructing an inverted index.
The indexr.utils
package contains utilities, such as a tokenization method for converting a text to tokens.
Setup
This package can be installed using pip:
pip install indexr
Examples
The following code sample can be found in the demo directory.
# Get a demo file system
files = get_demo_filesystem()
# Build the index
index = buildr(files, INDEX_PATH, force_rebuild=True, indexer=BSB(show_progress=True))
# Find all files which contain the word "movie" and also the corresponding frequency
print('All found occurrences of "movie":')
print(index.find('movie', frequencies=True))
Documentation
Credits
Tools used in rendering this package:
History
0.1.10 (2015-12-05)
Add submodules.
0.1.9 (2015-12-05)
Add submodules.
0.1.8 (2015-12-05)
Add submodules.
0.1.7 (2015-12-05)
Add submodules.
0.1.6 (2015-12-05)
Add submodules.
0.1.5 (2015-12-05)
Bugfix.
0.1.4 (2015-12-05)
Bugfix.
0.1.3 (2015-12-05)
Bugfix.
0.1.2 (2015-12-05)
Moved the BSB algorithm.
0.1.1 (2015-12-05)
Block sort-based indexing implemented.
0.1.0 (2015-12-04)
First release on PyPI.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for indexr-0.1.10-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ccdd3765c6ed0eaefdba8b70911eba588141d5b5b1db4ffa664b9bd33a97f736 |
|
MD5 | bddadc8893cfba6095b6b6feb17e553e |
|
BLAKE2b-256 | b4e5294be344f24e48b2697a512f84be5a3fb4454ba2de1bd9d31807839d2c25 |