A Cython wrapper for MeCab
Project description
fugashi
Fugashi is a Cython wrapper for MeCab. It doesn't attempt to cover all of the potential use cases of MeCab, instead dealing with only the most common ones.
- Only UniDic is supported, you can't use IPADic. UniDic Neologd is fine.
- Only UTF-8 is supported.
- Only Python3 is supported.
Usage
from fugashi import Tagger
tagger = Tagger('-Owakati')
tagger.parse("麩菓子(ふがし)は、麩を主材料とした日本の菓子。")
# => '麩 菓子 ( ふ が し ) は 、 麩 を 主材 料 と し た 日本 の 菓子 。 \n'
for word in tagger.parseToNodeList("麩菓子(ふがし)は、麩を主材料とした日本の菓子。"):
print(word, word.feature.lemma, word.pos, sep='\t')
# "feature" is the Unidic feature data as a named tuple
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
fugashi-0.1.4.tar.gz
(50.1 kB
view details)
File details
Details for the file fugashi-0.1.4.tar.gz.
File metadata
- Download URL: fugashi-0.1.4.tar.gz
- Upload date:
- Size: 50.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0dbb394b9d21bf48f3c1772fe247da9c8fe3a53d257a8d10e23941eed86b768d
|
|
| MD5 |
b9cd59ec1cc63fac97b9f0e77f7cf51e
|
|
| BLAKE2b-256 |
4168a9c829e26a7d5c058a482c542f16f2e8b63e12a51110d55418cf6e1dbe67
|