Utility to assist with string comparisons.
Project description
String Comparison
Text normalization for string comparison in Python.
This maps a Unicode code point key represented as an integer to a corresponding ASCII character mapping. In some cases such as Hebrew niqqud & cantillation, we remove the character altogether or replace certain punctuation with a space.
In addition, some characters are removed prior to normalization. For example, Unicode control chars.
Requirements
This project requires Python 3.7+
Installation
To install with pip
pip install string_comparison
Usage
import string_comparison
normalized = string_comparison.normalize("é")
assert normalized == "e"
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for string_comparison-1.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 46cda70eaafcf1bdc43c081f0eddcd2d2985efe137e7f71e530fc0c6440b2401 |
|
MD5 | 293b5ce21c430cd0fb4d12ddd28121f9 |
|
BLAKE2b-256 | bb234338a943d6de7bc7c2dde669c08803e214eeeec12de9ca6a32b4592fe436 |