Skip to main content

Lightweight text processing utilities for normalization, transliteration, and deduplication

Project description

TextLoom

Lint Python Version License: MIT

Lightweight text processing utilities for Python.

TextLoom provides a small collection of reusable tools for:

  • Transliteration
  • Text normalization
  • Identifier generation
  • Filename normalization
  • Name deduplication

The library was extracted from larger data-processing projects to provide a focused and dependency-free toolkit for text handling.

Installation

pip install textloom

Usage

import textloom as tl

Transliteration

tl.transliterate("Привет Мир")
# "Privet Mir"

Text normalization

tl.normalize_text("  Привет   Мир  ")
# "privet mir"

Identifier normalization

tl.normalize_identifier("Цена товара")
# "tsena_tovara"

Filename normalization

tl.normalize_filename("Мой файл.xlsx")
# "moy-fayl.xlsx"

Deduplication

tl.deduplicate_names(["name", "name", "name"])
# ["name", "name_2", "name_3"]

Design Goals

  • Zero runtime dependencies
  • Predictable behavior
  • Small, intuitive API
  • Reusable across projects
  • Easy to extend

License

MIT License. See the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

textloom-0.1.0.tar.gz (6.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

textloom-0.1.0-py3-none-any.whl (6.7 kB view details)

Uploaded Python 3

File details

Details for the file textloom-0.1.0.tar.gz.

File metadata

  • Download URL: textloom-0.1.0.tar.gz
  • Upload date:
  • Size: 6.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.20

File hashes

Hashes for textloom-0.1.0.tar.gz
Algorithm Hash digest
SHA256 bac6288c3b3111863b3838c4f5b03cdf5971dedc6e2e548dab6f18f5cf7e4898
MD5 cf1ee3ed42a6ad442332e922559c1bcc
BLAKE2b-256 71a8e90951feec7384e68e879cbd1c93425fdd60d6c80f4a9479624b3c0e928e

See more details on using hashes here.

File details

Details for the file textloom-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: textloom-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 6.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.20

File hashes

Hashes for textloom-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 062bff005006fe5a1e29e5076703696df770e4f57f460162995097db3a6496f7
MD5 feacac24925534899516fdb9ab5886e6
BLAKE2b-256 8a2e95d07330537d7220c642c308f4ab243929492bf8d8d79de72f7e6293a4ec

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page