Cantonese segmentation tool 粵語分詞工具
Project description
cantoseg ![](https://pypi-camo.freetls.fastly.net/bd9cc3b91dd5bf379a56187310a3d7f52951dcd7/68747470733a2f2f6769746875622e636f6d2f6179616b6131343733322f63616e746f7365672f776f726b666c6f77732f507974686f6e2532307061636b6167652f62616467652e737667)
Cantonese segmentation tool 粵語分詞工具
Install
$ pip install cantoseg
Usage
>>> import cantoseg
>>> cantoseg.cut('香港喺舊石器時代就有人住')
['香港', '喺', '舊石器時代', '就', '有人', '住']
A generator version is also available: cantoseg.lcut
.
Design
See article Cantonese Segmentation and Part-Of-Speech Tagging (in Chinese).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
cantoseg-0.0.1.tar.gz
(3.3 kB
view hashes)