Skip to main content

Finds the best matching string in a larger string.

Project description

Find Best String

Description

Finds the best matching string in a larger string. Returns either the first match or all matches, and their match value.

Installation

pip install find-best-string

or

pipenv install find-best-string

Arguments

  1. query - String of text to find matches of in corpus.
  2. corpus - String of text to scan for matches of query.
  3. step - Step size of first match-value scan through corpus. Can be thought of as a sort of "scan resolution". Should not exceed length of query.
  4. flex - Max left/right substring position adjustment value. Should not exceed length of query / 2.
  5. case_sensitive - If false, converts corpus and query to lowercase before fiding matches.

Usage

from find_best_string import find_best_string

corpus = "Lorem Ipsum is simply dummy text of the printing and typesetting industry. Lorem Ipsum has been the industry's standard dummy text ever since the 1500s, when an unknown printer took a galley of type and scrambled it to make a type specimen book. It has survived not only five centuries, but also the leap into electronic typesetting, remaining essentially unchanged. It was popularised in the 1960s with the release of Letraset sheets containing Lorem Ipsum passages, and more recently with desktop publishing software like Aldus PageMaker including versions of Lorem Ipsum."
query = "Ipsum"

first_match = find_best_string(query, corpus)

Project details


Release history Release notifications

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
find_best_string-0.1.0-py3-none-any.whl (3.7 kB) Copy SHA256 hash SHA256 Wheel py3 Sep 11, 2018
find_best_string-0.1.0.tar.gz (13.1 kB) Copy SHA256 hash SHA256 Source None Sep 11, 2018

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page