Fast subsentence searching
Project description
# lacbd
lacbd is a Python library written in Rust that implements the Aho Corasick algorithm for fast subsentence matching of many keywords against one string.
You can find the actual Rust library as [nitros12/like-aho-corasick-but-different](https://github.com/nitros12/like-aho-corasick-but-different).
## Features
Supports arbitrary values associated with each keyword
Operates on Unicode word bounds, rather than naïve substring matching
Case insensitive
10× faster than an equivalent regex
None of the existing python libraries fit my needs.
## License
This library is AGPLv3+ licensed. That may seem like an odd choice for a library. However, doing so ensures that users of this code must make their application open source, even if run as a service (such as in a Discord bot). If you want to use this to make proprietary software, look somewhere else.
Copyright © 2019 Ben Simms and Ben Mintz
This program is free software: you can redistribute it and/or modify it under the terms of the GNU Affero General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Affero General Public License for more details.
You should have received a copy of the GNU Affero General Public License along with this program. If not, see <https://www.gnu.org/licenses/>.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
Hashes for lacbd-0.1.6-cp37-cp37m-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 18f1ed25a63cb8f730db31bfb9af489ab583eeafdcb3d6e0b4701a9ed6e98df0 |
|
MD5 | 09298ef8914dad24bff328a4fa7e3043 |
|
BLAKE2b-256 | cec088138d85b95bd22452c4a591d152504215890fc2a2b8373aec936fe50c63 |
Hashes for lacbd-0.1.6-cp37-cp37m-manylinux1_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 26083a761c16830f3d66f5811367a8d57e15302ff144daccee88163999045ab2 |
|
MD5 | 9e78faca6a3fad83786df835004e5a3e |
|
BLAKE2b-256 | 8c02873e86baf5116483a4b855ae6d3db7c71e3a6a18d2f09e4edea7790492f8 |
Hashes for lacbd-0.1.6-cp36-cp36m-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8b3d87af957b72c308bae43a7ea2d857ceb82e9c054f65beafdd1e2f39513c6e |
|
MD5 | e297f2856266661fa6ec491ea818b0f8 |
|
BLAKE2b-256 | b46d3dc1e8725c2f7508a57f407ceb660761b5e5c41380ed693db543e15b7b1f |
Hashes for lacbd-0.1.6-cp36-cp36m-manylinux1_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3a61348dd1d88cfd1bc3dd99db34c620c2768d45810811a6374815e89bf02ea5 |
|
MD5 | c7041326566936e74fb55f918393adaa |
|
BLAKE2b-256 | 925361eacd92c615bbc9a3fc67b67ff6cb4fe26ab25eb0780ea92b448c6bc845 |