Skip to main content

Python port of Igo Japanese morphological analyzer

Project description

About

Igo is a Japanese morphological analyzer written in Java and Common Lisp. This software is Python port of Igo(Java version).

Notice

Dictionary builder is not provided. You need to use Igo Java version to build the dictionary for Igo. From igo-python 0.9.7, pre-built IPA dictionary (2.7.0-20070801) is included for ease of use.

How To Use

You can use Igo Python easily:

>>> from igo.Tagger import Tagger
>>> t = Tagger() # use bundled dictionary
>>> for m in t.parse(u'すもももももももものうち'):
...     print m.surface, m.feature
...
すもも 名詞,一般,*,*,*,*,すもも,スモモ,スモモ
も 助詞,係助詞,*,*,*,*,も,モ,モ
もも 名詞,一般,*,*,*,*,もも,モモ,モモ
も 助詞,係助詞,*,*,*,*,も,モ,モ
もも 名詞,一般,*,*,*,*,もも,モモ,モモ
の 助詞,連体化,*,*,*,*,の,ノ,ノ
うち 名詞,非自立,副詞可能,*,*,*,うち,ウチ,ウチ
>>>

Using command line parser:

$ igo # or python -m igo.parse
私の名前は中野です
私      名詞,代名詞,一般,*,*,*,私,ワタシ,ワタシ
の      助詞,連体化,*,*,*,*,の,ノ,ノ
名前    名詞,一般,*,*,*,*,名前,ナマエ,ナマエ
は      助詞,係助詞,*,*,*,*,は,ハ,ワ
中野    名詞,固有名詞,地域,一般,*,*,中野,ナカノ,ナカノ
です    助動詞,*,*,*,特殊・デス,基本形,です,デス,デス
EOS

Changelog for Igo-Python

2018-09-22 – 1.0.0
  • dropped Python 3.3 from supported Python version.

2017-02-09 – 0.9.9
  • fixed few small issues and add unit test.

2016-07-08 – 0.9.8
  • fixed a surrogate pair character handling issue

2015-12-16 – 0.9.7.1
  • Tagger can be pickled on Python2 env

2015-12-14 – 0.9.7
  • include pre-built IPA dictionary

2015-07-03 – 0.9.6
  • fixed an issue of dictionary size in mmap mode

2015-04-14 – 0.9.5
  • use mmap to load files of a dictionary if it is possible

2013-07-05 – 0.9.3
  • support characters in Supplementary Planes

2012-02-27 – 0.9.2
  • apply changes in Igo(java) 0.4.4

2012-02-27 – 0.9.1
  • synchronize with Igo(java) 0.4.5

2011-09-01 – 0.9
  • compatibility with python3.2

  • update the code base to igo-0.4.3

  • add some enhances and clean up.

2011-08-29 – 0.4
  • add some enhances and clean up.

2010-11-28 – 0.3a
  • fix bugs(if a phrase ends with ‘た’ will causes error)

2010-11-27 – 0.3
  • drop mmap related code

  • reduce memory footprint

2010-11-27 – 0.2
  • support Google App Engine(maybe)

2010-11-27 – 0.1a
  • update package info only.

2010-11-25 – 0.1
  • first release.

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

igo-python-1.0.0.tar.gz (8.8 MB view details)

Uploaded Source

Built Distribution

igo_python-1.0.0-py2.py3-none-any.whl (8.9 MB view details)

Uploaded Python 2 Python 3

File details

Details for the file igo-python-1.0.0.tar.gz.

File metadata

  • Download URL: igo-python-1.0.0.tar.gz
  • Upload date:
  • Size: 8.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.7.0

File hashes

Hashes for igo-python-1.0.0.tar.gz
Algorithm Hash digest
SHA256 8ea824ab31d8245757138fd9b9d6582bb7425f71ae4c58caf0e4497bd77a91c6
MD5 347cbf593deefd5e4bc8063dc54bfd4d
BLAKE2b-256 ced01c514e43fa8a6fdfb3d4f59a82237d6ed2e38e7e8d0655b2932325c24635

See more details on using hashes here.

File details

Details for the file igo_python-1.0.0-py2.py3-none-any.whl.

File metadata

  • Download URL: igo_python-1.0.0-py2.py3-none-any.whl
  • Upload date:
  • Size: 8.9 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.7.0

File hashes

Hashes for igo_python-1.0.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 fa0e39e4d9fedabd2121d2593d3fc2d0513c7facc92cbbc24c932243cc4d8bc4
MD5 7b1a8d95c076672906d1baaa356bb9f8
BLAKE2b-256 5d6700b4083d9ef20a3063515c91d9f79838e2e78f637e062d944f2336071d6f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page