Skip to main content

Lightweight preprocessing and reversible Modular Linear Tokenization (MLT) utilities for categorical and continuous data.

Project description

🧩 light-mlt

PyPI version Python versions License Tests Downloads

Lightweight preprocessing and reversible Modular Linear Tokenization (MLT) utilities for categorical and continuous data.


✨ Overview

light-mlt is a lightweight Python package that implements Modular Linear Tokenization (MLT) — a deterministic and fully reversible method for encoding high-cardinality categorical identifiers into compact numerical vectors.

Unlike hashing or one-hot encodings, MLT guarantees bijective mappings, offers explicit control of dimensionality, and integrates seamlessly with machine learning pipelines.

It was developed as part of applied research on scalable tokenization and efficient preprocessing for tabular and recommendation systems.


🚀 Installation

pip install light-mlt

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

light_mlt-0.1.0.tar.gz (10.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

light_mlt-0.1.0-py3-none-any.whl (8.1 kB view details)

Uploaded Python 3

File details

Details for the file light_mlt-0.1.0.tar.gz.

File metadata

  • Download URL: light_mlt-0.1.0.tar.gz
  • Upload date:
  • Size: 10.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for light_mlt-0.1.0.tar.gz
Algorithm Hash digest
SHA256 f1d5c245b3fc835475f2ae35af42e010451f4885b9a0d06d4619e2d625c46e6b
MD5 dfb157c07b94149a5e161077cf139fed
BLAKE2b-256 4aab9cd00855bc906c94ac9a7a614eb18737c3dff09f49aac5d5187e43ffd78b

See more details on using hashes here.

File details

Details for the file light_mlt-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: light_mlt-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 8.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for light_mlt-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 2eff66e1c02bf99e84a1f56f458fc79be0a176784cfe6b3a3282ec1c97bb7506
MD5 24ede80bc1b70d303c299421fd42d5ed
BLAKE2b-256 3b869ddbbcb5f846c77ab993347793aa5972c5a3cab909f9dc3980f0bd88fbf2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page