A Cython wrapper for MeCab
Project description
fugashi
Fugashi is a Cython wrapper for MeCab. It doesn't attempt to cover all of the potential use cases of MeCab, instead dealing with only the most common ones.
- Only UniDic is supported, you can't use IPADic. UniDic Neologd is fine.
- Only UTF-8 is supported.
- Only Python3 is supported.
Usage
from fugashi import Tagger
tagger = Tagger('-Owakati')
tagger.parse("麩菓子(ふがし)は、麩を主材料とした日本の菓子。")
# => '麩 菓子 ( ふ が し ) は 、 麩 を 主材 料 と し た 日本 の 菓子 。 \n'
for word in tagger.parseToNodeList("麩菓子(ふがし)は、麩を主材料とした日本の菓子。"):
print(word, word.feature.lemma, word.pos, sep='\t')
# "feature" is the Unidic feature data as a named tuple
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
fugashi-0.1.3.tar.gz
(49.5 kB
view details)
File details
Details for the file fugashi-0.1.3.tar.gz.
File metadata
- Download URL: fugashi-0.1.3.tar.gz
- Upload date:
- Size: 49.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6573c415079cdbe89fed534ea8a3baf75a0e3c0c5534d903046c8d2dd313a655
|
|
| MD5 |
d77b374655d732bcb206c95117f0c9af
|
|
| BLAKE2b-256 |
421eb154cf9f02124ca39b0c1e6e62dff72dc8fb0ea98cef6f66ba21e77bd19e
|