Skip to main content

Python functions for working with the Thai language

Project description


Some basic python functions for working with the Thai language. For example:

import pythai

>>> u"การ ที่ ได้ ต้อง แสดง ว่า งาน ดี"

>>> 8

>>> False

>>> True

It's meant to be fast and efficient enough to handle large documents without breaking a sweat.


Currently the library supports these functions:

- Word segmentation (`split`)
- Word count (`word_count`) (faster than counting the result of `split`)
- Whether a string contains Thai or not (`contains_thai`)


PyThai equires `thailib` to work. You can install it quite easily:

sudo apt-get install thailib

And then you can simply install `pythai` through **pip**:

pip install pythai


Special thanks to Vee Satayamas for the original python bindings of libthai from C.

This library was written for use in [Gengo]( It's free and open-source under the GNU lesser public license. Any contributions are welcome!

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pythai-0.1.3.tar.gz (13.6 kB view hashes)

Uploaded source

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page