Skip to main content
Join the official 2019 Python Developers SurveyStart the survey!

Fast subsentence searching

Project description

lacbd

lacbd is a Python library written in Rust that implements the Aho Corasick algorithm for fast subsentence matching of many keywords against one string.

You can find the actual Rust library as nitros12/like-aho-corasick-but-different.

Features

  • Supports arbitrary values associated with each keyword
  • Operates on Unicode word bounds, rather than naïve substring matching
  • Case insensitive
  • 10× faster than an equivalent regex

None of the existing python libraries fit my needs.

License

This library is AGPLv3+ licensed. That may seem like an odd choice for a library. However, doing so ensures that users of this code must make their application open source, even if run as a service (such as in a Discord bot). If you want to use this to make proprietary software, look somewhere else.

Copyright © 2019 Ben Simms and Ben Mintz

This program is free software: you can redistribute it and/or modify it under the terms of the GNU Affero General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Affero General Public License for more details.

You should have received a copy of the GNU Affero General Public License along with this program. If not, see https://www.gnu.org/licenses/.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for lacbd, version 0.2.0
Filename, size File type Python version Upload date Hashes
Filename, size lacbd-0.2.0-cp36-cp36m-manylinux1_x86_64.whl (201.9 kB) File type Wheel Python version cp36 Upload date Hashes View hashes
Filename, size lacbd-0.2.0-cp37-cp37m-macosx_10_13_x86_64.whl (98.7 kB) File type Wheel Python version cp37 Upload date Hashes View hashes
Filename, size lacbd-0.2.0-cp37-cp37m-manylinux1_x86_64.whl (201.9 kB) File type Wheel Python version cp37 Upload date Hashes View hashes
Filename, size lacbd-0.2.0-cp38-cp38-manylinux1_x86_64.whl (107.3 kB) File type Wheel Python version cp38 Upload date Hashes View hashes
Filename, size lacbd-0.2.0-cp38-cp38-manylinux2010_x86_64.whl (107.3 kB) File type Wheel Python version cp38 Upload date Hashes View hashes
Filename, size lacbd-0.2.0.tar.gz (16.1 kB) File type Source Python version None Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page