Skip to main content

Python wrapper for the MeCab morphological analyzer for Japanese

Project description

This is a Python wrapper for the MeCab morphological analyzer for Japanese text. It works with Python 3.5 and greater, as well as Python 2.7. (Note: Python 3.5 is not supported on OSX, see this issue).

Note that Windows wheels require a Microsoft Visual C++ Redistributable, so be sure to install that.

Basic usage

>>> import MeCab
>>> wakati = MeCab.Tagger("-Owakati")
>>> wakati.parse("pythonが大好きです").split()
['python', 'が', '大好き', 'です']

>>> chasen = MeCab.Tagger("-Ochasen")
>>> print(chasen.parse("pythonが大好きです"))
python python  python 名詞-固有名詞-組織
          助詞-格助詞-一般
大好き ダイスキ 大好き 名詞-形容動詞語幹
です  デス   です  助動詞 特殊デス 基本形
EOS

The API for mecab-python3 closely follows the API for MeCab itself, even when this makes it not very “Pythonic.” Please consult the MeCab documentation for more information.

Installation

Binary wheels are available for MacOS X and Linux, and are installed by default when you use pip:

pip install mecab-python3

These wheels include an internal (statically linked) copy of the MeCab library, and a copy of the mecab-ipadic dictionary (using UTF-8 text encoding), which is automatically used by default. If you wish to use a different dictionary, you will need to install it yourself, write a mecabrc file directing MeCab to use it, and set the environment variable MECABRC to point to this file.

To build from source using pip,

pip install --no-binary :all: mecab-python3

Alternatively, you can use pip to download the source, then build it by hand:

pip download --no-binary :all: mecab-python3
tar zxf mecab-python3-{version}.tar.gz
cd mecab-python3-{version}
python3 setup.py build
# install as you like

When the module is built from source, it requires the system to provide the MeCab library and at least one dictionary. You must have SWIG, the MeCab library and headers, and a dictionary installed before running pip install or setup.py build. For instance, on Debian-based Linux,

sudo apt-get install swig libmecab-dev mecab-ipadic-utf8

Building wheels with a bundled library and dictionary is only supported in a sanitized CI environment. Consult the scripts in the scripts subdirectory of the source tree to see how it’s done.

Licensing

Like MeCab itself, mecab-python3 is copyrighted free software by Taku Kudo taku@chasen.org and Nippon Telegraph and Telephone Corporation, and is distributed under a 3-clause BSD license (see the file BSD). Alternatively, it may be redistributed under the terms of the GNU General Public License, version 2 (see the file GPL) or the GNU Lesser General Public License, version 2.1 (see the file LGPL).

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

mecab_python3-0.996.6rc3-cp38-cp38-win_amd64.whl (508.1 kB view details)

Uploaded CPython 3.8Windows x86-64

mecab_python3-0.996.6rc3-cp37-cp37m-win_amd64.whl (507.9 kB view details)

Uploaded CPython 3.7mWindows x86-64

mecab_python3-0.996.6rc3-cp36-cp36m-win_amd64.whl (507.9 kB view details)

Uploaded CPython 3.6mWindows x86-64

mecab_python3-0.996.6rc3-cp35-cp35m-win_amd64.whl (507.9 kB view details)

Uploaded CPython 3.5mWindows x86-64

File details

Details for the file mecab_python3-0.996.6rc3-cp38-cp38-win_amd64.whl.

File metadata

  • Download URL: mecab_python3-0.996.6rc3-cp38-cp38-win_amd64.whl
  • Upload date:
  • Size: 508.1 kB
  • Tags: CPython 3.8, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/42.0.2 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.8.2

File hashes

Hashes for mecab_python3-0.996.6rc3-cp38-cp38-win_amd64.whl
Algorithm Hash digest
SHA256 9abc7800eabe337b7c08cd8b07b0147db3d7c9db10c486d8658bc3f3f6a4c283
MD5 89093d42da02f27565a9062d44dcd9b7
BLAKE2b-256 42553227d58ad71d571d489e6777ebc4eb7110e3cd9d4c011538008185c6d0aa

See more details on using hashes here.

File details

Details for the file mecab_python3-0.996.6rc3-cp37-cp37m-win_amd64.whl.

File metadata

  • Download URL: mecab_python3-0.996.6rc3-cp37-cp37m-win_amd64.whl
  • Upload date:
  • Size: 507.9 kB
  • Tags: CPython 3.7m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/42.0.2 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.8.2

File hashes

Hashes for mecab_python3-0.996.6rc3-cp37-cp37m-win_amd64.whl
Algorithm Hash digest
SHA256 4596f5c79d3a4aeb8231e6811afefcbb3b22444dfef05d088f399a35d6554818
MD5 15e044ca7b34b7c5f4e130d05abaef02
BLAKE2b-256 c75b43c832dc73b2e0668b15704e80a022b93f520e78ee0d8e604dd7887b92b7

See more details on using hashes here.

File details

Details for the file mecab_python3-0.996.6rc3-cp36-cp36m-win_amd64.whl.

File metadata

  • Download URL: mecab_python3-0.996.6rc3-cp36-cp36m-win_amd64.whl
  • Upload date:
  • Size: 507.9 kB
  • Tags: CPython 3.6m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/42.0.2 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.8.2

File hashes

Hashes for mecab_python3-0.996.6rc3-cp36-cp36m-win_amd64.whl
Algorithm Hash digest
SHA256 6cebbd147f4a7ee6232c1480cc7f505f781eda97a65bda4079610fd309025378
MD5 cdc38f0eb32f80f5fa9f903fcf2389ad
BLAKE2b-256 14a676c834d283a3626861bb1ce7b11e0d0e37d3666e96646dde58f9cf204360

See more details on using hashes here.

File details

Details for the file mecab_python3-0.996.6rc3-cp35-cp35m-win_amd64.whl.

File metadata

  • Download URL: mecab_python3-0.996.6rc3-cp35-cp35m-win_amd64.whl
  • Upload date:
  • Size: 507.9 kB
  • Tags: CPython 3.5m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/42.0.2 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.8.2

File hashes

Hashes for mecab_python3-0.996.6rc3-cp35-cp35m-win_amd64.whl
Algorithm Hash digest
SHA256 a27b7a316d0ae26c4e645b43245543d95a0a2baee46ef953faf02ad85121e4b0
MD5 35c6969f39d6fdbb091ee07273ea62fa
BLAKE2b-256 a457ccb0846e4f3099b001d3252cf908ed96ddb55f7566d27c56d27690342aea

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page