Skip to main content

Thai Natural Language Processing library

Project description

PyThaiNLP Logo

PyThaiNLP

PyThaiNLP is a Python library for natural language processing (NLP) of Thai language.

PyThaiNLP includes Thai word tokenizers, transliterators, soundex converters, part-of-speech taggers, and spell checkers.

📫 follow us on Facebook PyThaiNLP

What's new in 2.0 ?

Install

For stable version:

pip install pythainlp

For some advanced functionalities, like word vector, extra packages may be needed. Install them with these options during pip install:

pip install pythainlp[extra1,extra2,...]

where extras can be

  • artagger (to support artagger part-of-speech tagger)*
  • deepcut (to support deepcut machine-learnt tokenizer)
  • icu (for ICU support in transliteration and tokenization)
  • ipa (for International Phonetic Alphabet support in transliteration)
  • ml (to support fastai 1.0.22 ULMFiT models)
  • ner (for named-entity recognizer)
  • thai2fit (for Thai word vector)
  • thai2rom (for machine-learnt romanization)
  • full (install everything)

Note for Windows: marisa-trie wheels can be obtained from https://www.lfd.uci.edu/~gohlke/pythonlibs/#marisa-trie Install it with pip, for example: pip install marisa_trie‑0.7.5‑cp36‑cp36m‑win32.whl

Links

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pythainlp-2.0.4.tar.gz (55.0 kB view details)

Uploaded Source

Built Distribution

pythainlp-2.0.4-py3-none-any.whl (11.2 MB view details)

Uploaded Python 3

File details

Details for the file pythainlp-2.0.4.tar.gz.

File metadata

  • Download URL: pythainlp-2.0.4.tar.gz
  • Upload date:
  • Size: 55.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.19.1 setuptools/41.0.0 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.7.3

File hashes

Hashes for pythainlp-2.0.4.tar.gz
Algorithm Hash digest
SHA256 2234f82deb385e96bdb71bd81d0e35cf056d28408b2e3508ea756302ffcfe8a9
MD5 8d3eee573d44bc0cc3c1b1f982854a0d
BLAKE2b-256 f10330e82c07002abcfecdf3495bd5b16b98ce10e3ab013b1d5647def050445b

See more details on using hashes here.

File details

Details for the file pythainlp-2.0.4-py3-none-any.whl.

File metadata

  • Download URL: pythainlp-2.0.4-py3-none-any.whl
  • Upload date:
  • Size: 11.2 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.19.1 setuptools/41.0.0 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.7.3

File hashes

Hashes for pythainlp-2.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 28d70236a67975c1ce6c259762bca038a6fb4d86d9b96c42718bd5fbebc39bc1
MD5 1e04f2cd1dc2f53f8961e23691c30a6c
BLAKE2b-256 345b8e6981607ee4a24d3ba178e988feba2f361299778da201e2fe916a6ca31e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page