Skip to main content

A small version of UniDic packaged for Python

Project description

Current PyPI packages

Unidic Lite

This is a version of unidic-py that is designed to be installable with pip alone, not requiring any extra downloads.

At the moment it uses Unidic 2.1.2, from 2013, which is the most recent release of UniDic that's small enough to be distributed via PyPI.

Note this package takes roughly 250MB on disk after being installed.

In order to use this you will need to install a MeCab wrapper such as mecab-python3 or fugashi.

Differences from the Official UniDic Release

This has a few changes from the official UniDic release to make it easier to use.

  • entries for 令和 have been added
  • single-character numeric and alphabetic words have been deleted
  • unk.def has been modified so unknown punctuation won't be marked as a noun

License

This code is licensed under the MIT or WTFPL license, as you prefer. Unidic 2.1.2 is copyright the UniDic Consortium and distributed under the terms of the BSD license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

unidic-lite-1.0.8.tar.gz (47.4 MB view details)

Uploaded Source

File details

Details for the file unidic-lite-1.0.8.tar.gz.

File metadata

  • Download URL: unidic-lite-1.0.8.tar.gz
  • Upload date:
  • Size: 47.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/51.0.0 requests-toolbelt/0.9.1 tqdm/4.56.0 CPython/3.9.1

File hashes

Hashes for unidic-lite-1.0.8.tar.gz
Algorithm Hash digest
SHA256 db9d4572d9fdd4d00a97949d4b0741ec480ee05a7e7e2e32f547500dae27b245
MD5 5a6b70b6532e61f3ab8685e11fc4c959
BLAKE2b-256 552b8cf7514cb57d028abcef625afa847d60ff1ffbf0049c36b78faa7c35046f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page