Skip to main content

A module for decomposing strings in the latin alphabet to the ascii subset.

Project description

Decompose

This module provides a method for decomposing strings containing latin diacretics, ligatures, and other unusual latin letters into their appropriate ascii counterparts generally for the purpose of data for submission into systems requiring the usage of the ascii subset.

Example

from decompose import decompose

decompose('Málaga') # returns 'Malaga'

Comments

decompose.decompose() depends upon unicodedata.normalize() for most of its heavy lifting. However, it also contains a lookup table (decompose.charmap) for those unicode characters that are not necessarily handled by normalize(). Any letter not handled by either normalize or charmap will be silently dropped.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

decompose-mrobison-0.0.2.tar.gz (1.9 kB view details)

Uploaded Source

File details

Details for the file decompose-mrobison-0.0.2.tar.gz.

File metadata

  • Download URL: decompose-mrobison-0.0.2.tar.gz
  • Upload date:
  • Size: 1.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.6.0 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.8.5

File hashes

Hashes for decompose-mrobison-0.0.2.tar.gz
Algorithm Hash digest
SHA256 5f718e48d07ef24ff933c12fd36ebb9c1ae96fd795657b2220e088d7fb734ba4
MD5 1bd523d67ebb5af00c8c79dccced6146
BLAKE2b-256 ad175454e84fbbf3ab1269886c4fd322b90e2c74bce9dfbe7ad132b1c4fc108d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page