Simple Thai Preprocess Functions
Project description
th-preprocessor
Simple Thai Preprocess Functions
Objectives
This repository provides simple preprocess techniques for Thai sentences/phrases
Supports
The module supports Python 3.6+
Installation
pip install th-simple-preprocessor
How to Use
from th_preprocessor.preprocess import preprocess
text = '"::::: อย่างไรก็ตามนูร์ ฮิชัม อับดุลเลาะห์ 21-09-2018 https://www.malaysiakini.com/news/444015"'
words = preprocess(text)
print(words)
# อย่างไรก็ตามนูร์ ฮิชัม อับดุลเลาะห์ WSNUMBER WSNUMBER WSNUMBER WSLINK
Package reference:
th_preprocessor.preprocess.normalize_linkth_preprocessor.preprocess.normalize_at_mentionth_preprocessor.preprocess.normalize_emailth_preprocessor.preprocess.normalize_hahath_preprocessor.preprocess.normalize_numth_preprocessor.preprocess.normalize_phoneth_preprocessor.preprocess.normalize_accented_charsth_preprocessor.preprocess.normalize_special_charsth_preprocessor.preprocess.remove_hashtagsth_preprocessor.preprocess.remove_tagth_preprocessor.preprocess.remove_dup_spacesth_preprocessor.preprocess.remove_emojith_preprocessor.preprocess.replace_dup_charsth_preprocessor.preprocess.replace_dup_emojisth_preprocessor.preprocess.insert_spacesth_preprocessor.preprocess.normalize_emojith_preprocessor.preprocess.remove_others_charth_preprocessor.preprocess.remove_stopwordsth_preprocessor.preprocess.preprocess
Copyright
All licenses in this repository are copyrighted by their respective authors. Everything else is released under CC0. See LICENSE for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file th-simple-preprocessor-0.10.1.tar.gz.
File metadata
- Download URL: th-simple-preprocessor-0.10.1.tar.gz
- Upload date:
- Size: 21.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5900044680ab3681571216f043ace1b4e01e46dacb88ca55643c8293782817ba
|
|
| MD5 |
36eff3d66e62581ac7df541a5bb9ba98
|
|
| BLAKE2b-256 |
6c530dc483089e5efec48e0f3ed57fec4b1c18c360cc37256cb5ccffbe8e6e12
|
File details
Details for the file th_simple_preprocessor-0.10.1-py3-none-any.whl.
File metadata
- Download URL: th_simple_preprocessor-0.10.1-py3-none-any.whl
- Upload date:
- Size: 20.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
be0264d87aeed90612baea860cb09b2481f1127b8ff12fe655577e4e4b8ad486
|
|
| MD5 |
cd77ccd592261e505662887916ebbed4
|
|
| BLAKE2b-256 |
d73ea76ec468700d4fc2405c74d4bd733cd97b670d2c7cd6d1177add152dc975
|