Fast subsentence searching
Project description
lacbd
lacbd is a Python library written in Rust that implements the Aho Corasick algorithm for fast subsentence matching of many keywords against one string.
You can find the actual Rust library as nitros12/like-aho-corasick-but-different.
Features
- Supports arbitrary values associated with each keyword
- Operates on Unicode word bounds, rather than naïve substring matching
- Case insensitive
- 10× faster than an equivalent regex
None of the existing python libraries fit my needs.
License
This library is AGPLv3+ licensed. That may seem like an odd choice for a library. However, doing so ensures that users of this code must make their application open source, even if run as a service (such as in a Discord bot). If you want to use this to make proprietary software, look somewhere else.
Copyright © 2019 Ben Simms and Ben Mintz
This program is free software: you can redistribute it and/or modify it under the terms of the GNU Affero General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Affero General Public License for more details.
You should have received a copy of the GNU Affero General Public License along with this program. If not, see https://www.gnu.org/licenses/.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
Hashes for lacbd-0.1.4-cp37-cp37m-manylinux1_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 53f18ec9cd46c9a1867e6462c5775b955da66c1cc951820fcaad5c286803a31a |
|
MD5 | 912373b46e698ceed2d430a67f8c3223 |
|
BLAKE2b-256 | a47b4ba175401b2efefbe94fcdfe1b61bad311940aa7405f3d0b916b328f5b08 |
Hashes for lacbd-0.1.4-cp36-cp36m-manylinux1_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 08a526c942dd8171ac94bcf94d3a0bd6a2d13cb8795ec8339e61c56c4ebd203c |
|
MD5 | 97a14414ab3c2236a78e899be4a503e1 |
|
BLAKE2b-256 | 2eed7b860cede280ae135e154bbce29d6470cc4788aac1422f86dbf37f93d65c |