Cantonese segmentation tool 粵語分詞工具
Project description
cantoseg
Cantonese segmentation tool 粵語分詞工具
Install
$ pip install cantoseg
Usage
>>> import cantoseg
>>> cantoseg.cut('香港喺舊石器時代就有人住')
['香港', '喺', '舊石器時代', '就', '有人', '住']
A generator version is also available: cantoseg.lcut
.
Design
See article Cantonese Segmentation and Part-Of-Speech Tagging (in Chinese).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
cantoseg-0.0.1.tar.gz
(3.3 kB
view hashes)