Thai natural language processing in Python package.
Project description
PyThaiNLP 1.7
PyThaiNLP is a Python library for natural language processing (NLP) of Thai language.
What's new in PyThaiNLP 1.7 ?
- Deprecate Python 2 support
- Refactor pythainlp.tokenize.pyicu for readability
- Add Thai NER model to pythainlp.ner
- thai2vec v0.2 - larger vocab, benchmarking results on Wongnai dataset
- Sentiment classifier based on ULMFit and various product review datasets
- Add ULMFit utility to PyThaiNLP
- Add Thai romanization model thai2rom
- Retrain POS-tagging model
- Improve word tokenize (newmm,mm) and dict_word_tokenize
- Documentation added
Install
pip install pythainlp
Note for Windows: marisa-trie wheels can be obtained from https://www.lfd.uci.edu/~gohlke/pythonlibs/#marisa-trie , then install it with pip, for example: pip install marisa_trie‑0.7.5‑cp36‑cp36m‑win32.whl
Docs : https://thainlp.org/pythainlp/docs/1.7/
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
File details
Details for the file pythainlp-1.7.0.1-py3-none-any.whl
.
File metadata
- Download URL: pythainlp-1.7.0.1-py3-none-any.whl
- Upload date:
- Size: 10.3 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/39.1.0 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 49eb59441b3b699f26e01dd813a13fe7f53b4510ceb62f4ddc3adb46f40f2193 |
|
MD5 | 37ecf335f6d310e3a3158c6b1cab1602 |
|
BLAKE2b-256 | 928f94f2be15497094d81eb362fe89a754fcb124868020f629d0b9cffed4551c |