Skip to main content

UniDic packaged for Python

Project description

unidic-py

This is a version of UniDic packaged for use with pip.

This is based on UniDic 2.1.2, which is roughly 55MB zipped or 300MB unzipped. There are more recent versions of UniDic but they're significantly larger, which makes packaging difficult.

This package distributes only the files necessary for using UniDic with MeCab. The large files are gzipped for distribution and unzipped the first time the library is imported. It would be better for MeCab to unzip on the fly when reading from disk but it doesn't support that.

Example use with fugashi, though mecab-python3 works the same way:

import fugashi
import unidic
tagger = fugashi.Tagger('-d{}'.format(unidic.DICDIR))
# that's it!

License

The modern Japanese UniDic is available under the GPL, LGPL, or BSD license, see here or the included BSD license. UniDic is developed by NINJAL, the National Institute for Japanese Language and Linguistics.

The code in this repository is not written or maintained by NINJAL. The code is available under the MIT or WTFPL License, as you prefer.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

unidic-0.0.1.tar.gz (56.1 MB view details)

Uploaded Source

File details

Details for the file unidic-0.0.1.tar.gz.

File metadata

  • Download URL: unidic-0.0.1.tar.gz
  • Upload date:
  • Size: 56.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/44.0.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.8.0

File hashes

Hashes for unidic-0.0.1.tar.gz
Algorithm Hash digest
SHA256 798b332a44ff16a5d7269e7ffa81a1c01d6cf9f06be544b7c306438d61cb9d6f
MD5 d12256fc483373622f59b6b1a311bf12
BLAKE2b-256 d08b4f7d2012174d9c854c3116edb9ab5359a8f4c8557e69ea776036fca90ac8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page