Skip to main content

Python wrapper for the MeCab morphological analyzer for Japanese

Project description

This is a Python wrapper for the MeCab morphological analyzer for Japanese text. It works with Python 3.6 and greater, as well as Python 2.7.

issueを英語で書く必要はありません。

Note that Windows wheels require a Microsoft Visual C++ Redistributable, so be sure to install that.

Basic usage

>>> import MeCab
>>> wakati = MeCab.Tagger("-Owakati")
>>> wakati.parse("pythonが大好きです").split()
['python', 'が', '大好き', 'です']

>>> tagger = MeCab.Tagger()
>>> print(tagger.parse("pythonが大好きです"))
python  python  python  python  名詞-普通名詞-一般
                        助詞-格助詞
大好き  ダイスキ        ダイスキ        大好き  形状詞-一般
です    デス    デス    です    助動詞  助動詞-デス     終止形-一般
EOS

The API for mecab-python3 closely follows the API for MeCab itself, even when this makes it not very “Pythonic.” Please consult the official MeCab documentation for more information.

Installation

Binary wheels are available for MacOS X, Linux, and Windows (64bit) are installed by default when you use pip:

pip install mecab-python3

These wheels include an internal (statically linked) copy of the MeCab library, but not dictionary. In order to use MeCab you'll need to install a dictionary. unidic-lite is a good one to start with:

pip install unidic-lite

To build from source using pip,

pip install --no-binary :all: mecab-python3

Common Issues

If you get a RuntimeError when you try to run MeCab, here are some things to check:

Windows Redistributable

You have to install this to use this package on Windows.

Installing a Dictionary

Run pip install unidic-lite and confirm that works. If that fixes your problem, you either don't have a dictionary installed, or you need to specify your dictionary path like this:

tagger = MeCab.Tagger('-r /dev/null -d /usr/local/lib/mecab/dic/mydic')

Note: on Windows, use nul instead of /dev/null. Alternately, if you have a mecabrc you can use the path after -r.

Specifying a mecabrc

If you specify a dictionary path, you also must have a mecabrc file, even if it's empty. You can specify a mecabrc with -r. This may be necessary on Debian or Ubuntu, where the mecabrc is in /etc/mecabrc.

Using Unsupported Output Modes like -Ochasen

Chasen output is not a built-in feature of MeCab, you must specify it in your dicrc or mecabrc. Notably, Unidic does not include Chasen output format. Please see the MeCab documentation.

Alternatives

  • fugashi is a Cython wrapper for MeCab with a Pythonic interface, by the current maintainer of this libray
  • SudachiPy is a modern tokenizer with a maintained dictionary, though it's slower than MeCab
  • KoNLPy is a library for Korean NLP that includes a MeCab wrapper

Licensing

Like MeCab itself, mecab-python3 is copyrighted free software by Taku Kudo taku@chasen.org and Nippon Telegraph and Telephone Corporation, and is distributed under a 3-clause BSD license (see the file BSD). Alternatively, it may be redistributed under the terms of the GNU General Public License, version 2 (see the file GPL) or the GNU Lesser General Public License, version 2.1 (see the file LGPL).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for mecab-python3, version 1.0.1
Filename, size File type Python version Upload date Hashes
Filename, size mecab_python3-1.0.1-cp27-cp27m-macosx_10_9_x86_64.whl (302.2 kB) File type Wheel Python version cp27 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp27-cp27m-manylinux1_i686.whl (2.2 MB) File type Wheel Python version cp27 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp27-cp27m-manylinux1_x86_64.whl (2.3 MB) File type Wheel Python version cp27 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp27-cp27m-manylinux2010_i686.whl (2.5 MB) File type Wheel Python version cp27 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp27-cp27m-manylinux2010_x86_64.whl (3.5 MB) File type Wheel Python version cp27 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp27-cp27mu-manylinux1_i686.whl (2.2 MB) File type Wheel Python version cp27 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp27-cp27mu-manylinux1_x86_64.whl (2.3 MB) File type Wheel Python version cp27 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp27-cp27mu-manylinux2010_i686.whl (2.5 MB) File type Wheel Python version cp27 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp27-cp27mu-manylinux2010_x86_64.whl (3.5 MB) File type Wheel Python version cp27 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp35-cp35m-manylinux1_i686.whl (2.2 MB) File type Wheel Python version cp35 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp35-cp35m-manylinux1_x86_64.whl (2.3 MB) File type Wheel Python version cp35 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp35-cp35m-manylinux2010_i686.whl (2.5 MB) File type Wheel Python version cp35 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp35-cp35m-manylinux2010_x86_64.whl (3.5 MB) File type Wheel Python version cp35 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp35-cp35m-win_amd64.whl (509.8 kB) File type Wheel Python version cp35 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp36-cp36m-macosx_10_9_x86_64.whl (303.0 kB) File type Wheel Python version cp36 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp36-cp36m-manylinux1_i686.whl (2.2 MB) File type Wheel Python version cp36 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp36-cp36m-manylinux1_x86_64.whl (2.3 MB) File type Wheel Python version cp36 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp36-cp36m-manylinux2010_i686.whl (2.5 MB) File type Wheel Python version cp36 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp36-cp36m-manylinux2010_x86_64.whl (3.5 MB) File type Wheel Python version cp36 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp36-cp36m-win_amd64.whl (509.8 kB) File type Wheel Python version cp36 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp37-cp37m-macosx_10_9_x86_64.whl (303.0 kB) File type Wheel Python version cp37 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp37-cp37m-manylinux1_i686.whl (2.2 MB) File type Wheel Python version cp37 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp37-cp37m-manylinux1_x86_64.whl (2.3 MB) File type Wheel Python version cp37 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp37-cp37m-manylinux2010_i686.whl (2.5 MB) File type Wheel Python version cp37 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp37-cp37m-manylinux2010_x86_64.whl (3.5 MB) File type Wheel Python version cp37 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp37-cp37m-win_amd64.whl (509.8 kB) File type Wheel Python version cp37 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp38-cp38-macosx_10_9_x86_64.whl (303.2 kB) File type Wheel Python version cp38 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp38-cp38-manylinux1_i686.whl (2.2 MB) File type Wheel Python version cp38 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp38-cp38-manylinux1_x86_64.whl (2.3 MB) File type Wheel Python version cp38 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp38-cp38-manylinux2010_i686.whl (2.5 MB) File type Wheel Python version cp38 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp38-cp38-manylinux2010_x86_64.whl (3.5 MB) File type Wheel Python version cp38 Upload date Hashes View
Filename, size mecab_python3-1.0.1-cp38-cp38-win_amd64.whl (510.0 kB) File type Wheel Python version cp38 Upload date Hashes View
Filename, size mecab-python3-1.0.1.tar.gz (108.7 kB) File type Source Python version None Upload date Hashes View

Supported by

Pingdom Pingdom Monitoring Google Google Object Storage and Download Analytics Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page