Collection of Khmer language toolkits
Project description
PyKhmerNLP
PyKhmerNLP is a library designed to process and analyze Khmer language data. It includes modules for working with addresses, dictionaries, and tokenization. This documentation will guide you through the functionalities of each module and provide examples to help you get s
Documentations
Check out our details documentations here
Installation
Install from Source
git clone https://github.com/MetythornPenn/pykhmernlp.git
cd pykhmernlp
pip install -e .
Install from PYPI
pip install pykhmernlp
Features
- Corpus
- Khmer words
- English Word
- Khmer to Khmer Dictionary
- English to English Dictionary
- Khmer Address
- Tokenizer
- Pronounce
- Tha
Reference
This library wraps around other awesome Khmer libraries. Without these other libraries, this library wouldn't exist.
Libraries:
- khmercut: from seanghay
- khmerpronounce: from seanghay
- tha: from seanghay
Datasets:
- khmer words: from unicode-org/icu
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pykhmernlp-0.0.11.tar.gz
(10.6 kB
view details)
Built Distribution
File details
Details for the file pykhmernlp-0.0.11.tar.gz
.
File metadata
- Download URL: pykhmernlp-0.0.11.tar.gz
- Upload date:
- Size: 10.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2f1b2bdc17703d527ac1ae5ec96d3851a38c8e3472971d6a120dbfd5d560483d |
|
MD5 | 9cc3059259d8a653849709cc768633ca |
|
BLAKE2b-256 | 0419b3f061100249f717d530c19382e1e9a74b5900d555896fc8c72c086c13b7 |
File details
Details for the file pykhmernlp-0.0.11-py3-none-any.whl
.
File metadata
- Download URL: pykhmernlp-0.0.11-py3-none-any.whl
- Upload date:
- Size: 9.0 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 008a607ed0a5c47e81b4b5792c05e8d76209bf2022fa95c1bc5e592812e54c21 |
|
MD5 | c5f34ca5b5f488ca96e97a85fa553bac |
|
BLAKE2b-256 | bb6ae7d194d24958141959b6ec3c1720ce1b9ff6d02ffa176603508aac460745 |