Skip to main content

Enhanced Lao NLP tokenizer, compatible with laonlp POS tagging

Project description

laonlp-enhanced

An enhanced tokenizer for the Lao language, built with a dictionary-based maximal matching approach and number handling. Integrates with laonlp for POS tagging.

Reference

Installation

pip install laonlp-enhanced

Usage

from laonlp_enhanced import tokenize

text = "ສະບາຍດີ 245,394"
tokens = tokenize(text)
print(tokens)  # Output: ['ສະບາຍດີ', '245,394']

POS Tagging with laonlp

Combine tokenization with POS tagging using the laonlp library:

from laonlp_enhanced import tokenize, pos_tag

text = "ສະບາຍດີ 245,394"
tokens = tokenize(text)
tagged = pos_tag(tokens)
print(tagged)  # Output: [('ສະບາຍດີ', 'NOUN'), ('245,394', 'NUM')]

License

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

   http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

laonlp_enhanced-0.1.1.tar.gz (748.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

laonlp_enhanced-0.1.1-py3-none-any.whl (770.2 kB view details)

Uploaded Python 3

File details

Details for the file laonlp_enhanced-0.1.1.tar.gz.

File metadata

  • Download URL: laonlp_enhanced-0.1.1.tar.gz
  • Upload date:
  • Size: 748.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for laonlp_enhanced-0.1.1.tar.gz
Algorithm Hash digest
SHA256 db845d7aa8ec72250c0de7fe74597bab16484f90c6cc6ed484424183c1ecbc1e
MD5 c38998fa5fd0919559f0b04f38fd0f27
BLAKE2b-256 516d4fc32a60cadc1eea01051c044e5ae46a4eb2eb218234bf9dd9cb941117ca

See more details on using hashes here.

File details

Details for the file laonlp_enhanced-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for laonlp_enhanced-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 6498cf6d35fc0ae7b9fb89cedb93efadc877678e94801e8259cc56ea6ed08e89
MD5 d8c5fc58a1a9b6e8cc515f018819495c
BLAKE2b-256 20d5f9c185d9b292430f9f475cf2742f2eb06731e1d85487d7639ffd78efe81c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page