Skip to main content

A python binding for mecab-ko

Project description

python-mecab-ko

A python binding for mecab-ko

Installation

Using pip:

$ pip install python-mecab-ko

This library currently provides binary wheels on Linux and macOS platforms. If you are using other platforms, you need the following prerequisites to build from source:

  • C++14 compatible compiler
  • Python header files
  • mecab-ko
    • This repository provides a convenience script at scripts/install_mecab_ko.py to install mecab-ko

Usage

import mecab
mecab = mecab.MeCab()

mecab.morphs('영등포구청역에 있는 맛집 좀 알려주세요.')
# ['영등포구청역', '에', '있', '는', '맛집', '좀', '알려', '주', '세요', '.']

mecab.nouns('우리나라에는 무릎 치료를 잘하는 정형외과가 없는가!')
# ['우리', '나라', '무릎', '치료', '정형외과']

mecab.pos('자연주의 쇼핑몰은 어떤 곳인가?')
# [('자연주의', 'NNG'), ('쇼핑몰', 'NNG'), ('은', 'JX'), ('어떤', 'MM'), ('곳', 'NNG'), ('인가', 'VCP+EF'), ('?', 'SF')]

mecab.parse('즐거운 하루 보내세요!')
# [
#     ('즐거운', Feature(
#         pos='VA+ETM', semantic=None, has_jongseong=True, reading='즐거운',
#         type='Inflect', start_pos='VA', end_pos='ETM',
#         expression='즐겁/VA/*+ᆫ/ETM/*')),
#     ('하루', Feature(
#         pos='NNG', semantic=None, has_jongseong=False, reading='하루',
#         type=None, start_pos=None, end_pos=None,
#         expression=None)),
#     ('보내', Feature(
#         pos='VV', semantic=None, has_jongseong=False, reading='보내',
#         type=None, start_pos=None, end_pos=None,
#         expression=None)),
#     ('세요', Feature(
#         pos='EP+EF', semantic=None, has_jongseong=False, reading='세요',
#         type='Inflect', start_pos='EP', end_pos='EF',
#         expression='시/EP/*+어요/EF/*')),
#     ('!', Feature(
#         pos='SF', semantic=None, has_jongseong=None, reading=None,
#         type=None, start_pos=None, end_pos=None,
#         expression=None))
# ]

CLI

The library also provides a proxy for several command-line interfaces provided by mecab. These interfaces can be used to add vocabulary to a user dictonary.

  • python3 -m mecab
  • python3 -m mecab dict-index
  • python3 -m mecab dict-gen
  • python3 -m mecab cost-train

User Dictionary

You can build a user dictionary using the above command-line interface:

$ python3 -m mecab dict-index \
    --userdic=user.dic \
    user.csv

NOTE

  • The CSV file must be in the following format:
    • <surface>,<left_context_id>,<right_context_id>,<cost>,<pos>,<semantic>,<has_jongseong>,<reading>,<type>,<start_pos>,<end_pos>,<expression>
  • Example:
    • 트위치,,,,NNP,*,F,트위치,*,*,*,*
    • 플랫폼,,,,NNG,*,T,플랫폼,*,*,*,*

Then, you can add the built user dictionary as follows:

from mecab import MeCab
mecab = MeCab(user_dictionary_path="user.dic")

# If there are multiple dictionaries:
mecab = MeCab(user_dictionary_path=["nnp.dic", "nng.dic"])

Fore more detailed information, please refer to examples/user_dictionary.py.

Acknowledgments

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

python-mecab-ko-1.2.8.tar.gz (10.9 kB view hashes)

Uploaded Source

Built Distributions

python_mecab_ko-1.2.8-cp311-cp311-win_amd64.whl (651.2 kB view hashes)

Uploaded CPython 3.11 Windows x86-64

python_mecab_ko-1.2.8-cp311-cp311-win32.whl (557.2 kB view hashes)

Uploaded CPython 3.11 Windows x86

python_mecab_ko-1.2.8-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (571.5 kB view hashes)

Uploaded CPython 3.11 manylinux: glibc 2.17+ x86-64

python_mecab_ko-1.2.8-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (550.8 kB view hashes)

Uploaded CPython 3.11 manylinux: glibc 2.17+ ARM64

python_mecab_ko-1.2.8-cp311-cp311-macosx_11_0_arm64.whl (599.9 kB view hashes)

Uploaded CPython 3.11 macOS 11.0+ ARM64

python_mecab_ko-1.2.8-cp311-cp311-macosx_10_9_x86_64.whl (613.3 kB view hashes)

Uploaded CPython 3.11 macOS 10.9+ x86-64

python_mecab_ko-1.2.8-cp310-cp310-win_amd64.whl (651.2 kB view hashes)

Uploaded CPython 3.10 Windows x86-64

python_mecab_ko-1.2.8-cp310-cp310-win32.whl (557.2 kB view hashes)

Uploaded CPython 3.10 Windows x86

python_mecab_ko-1.2.8-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (571.6 kB view hashes)

Uploaded CPython 3.10 manylinux: glibc 2.17+ x86-64

python_mecab_ko-1.2.8-cp310-cp310-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (550.8 kB view hashes)

Uploaded CPython 3.10 manylinux: glibc 2.17+ ARM64

python_mecab_ko-1.2.8-cp310-cp310-macosx_11_0_arm64.whl (599.9 kB view hashes)

Uploaded CPython 3.10 macOS 11.0+ ARM64

python_mecab_ko-1.2.8-cp310-cp310-macosx_10_9_x86_64.whl (613.3 kB view hashes)

Uploaded CPython 3.10 macOS 10.9+ x86-64

python_mecab_ko-1.2.8-cp39-cp39-win_amd64.whl (647.8 kB view hashes)

Uploaded CPython 3.9 Windows x86-64

python_mecab_ko-1.2.8-cp39-cp39-win32.whl (558.0 kB view hashes)

Uploaded CPython 3.9 Windows x86

python_mecab_ko-1.2.8-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (573.3 kB view hashes)

Uploaded CPython 3.9 manylinux: glibc 2.17+ x86-64

python_mecab_ko-1.2.8-cp39-cp39-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (552.6 kB view hashes)

Uploaded CPython 3.9 manylinux: glibc 2.17+ ARM64

python_mecab_ko-1.2.8-cp39-cp39-macosx_11_0_arm64.whl (600.0 kB view hashes)

Uploaded CPython 3.9 macOS 11.0+ ARM64

python_mecab_ko-1.2.8-cp39-cp39-macosx_10_9_x86_64.whl (613.5 kB view hashes)

Uploaded CPython 3.9 macOS 10.9+ x86-64

python_mecab_ko-1.2.8-cp38-cp38-win_amd64.whl (673.1 kB view hashes)

Uploaded CPython 3.8 Windows x86-64

python_mecab_ko-1.2.8-cp38-cp38-win32.whl (557.9 kB view hashes)

Uploaded CPython 3.8 Windows x86

python_mecab_ko-1.2.8-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (571.8 kB view hashes)

Uploaded CPython 3.8 manylinux: glibc 2.17+ x86-64

python_mecab_ko-1.2.8-cp38-cp38-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (550.3 kB view hashes)

Uploaded CPython 3.8 manylinux: glibc 2.17+ ARM64

python_mecab_ko-1.2.8-cp38-cp38-macosx_11_0_arm64.whl (599.8 kB view hashes)

Uploaded CPython 3.8 macOS 11.0+ ARM64

python_mecab_ko-1.2.8-cp38-cp38-macosx_10_9_x86_64.whl (613.3 kB view hashes)

Uploaded CPython 3.8 macOS 10.9+ x86-64

python_mecab_ko-1.2.8-cp37-cp37m-win_amd64.whl (673.2 kB view hashes)

Uploaded CPython 3.7m Windows x86-64

python_mecab_ko-1.2.8-cp37-cp37m-win32.whl (557.3 kB view hashes)

Uploaded CPython 3.7m Windows x86

python_mecab_ko-1.2.8-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (574.2 kB view hashes)

Uploaded CPython 3.7m manylinux: glibc 2.17+ x86-64

python_mecab_ko-1.2.8-cp37-cp37m-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (558.4 kB view hashes)

Uploaded CPython 3.7m manylinux: glibc 2.17+ ARM64

python_mecab_ko-1.2.8-cp37-cp37m-macosx_10_9_x86_64.whl (611.9 kB view hashes)

Uploaded CPython 3.7m macOS 10.9+ x86-64

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page