Japanese morphological analysis engine.
Project description
Janome is a Japanese morphological analysis engine written in pure Python.
General documentation:
https://mocobeta.github.io/janome/en/ (English)
https://mocobeta.github.io/janome/ (Japanese)
Requirements
Python 3.7+ is required.
Install
[Note] This consumes about 500 MB memory for building.
(venv) $ pip install janome
Run
(venv) $ python
>>> from janome.tokenizer import Tokenizer
>>> t = Tokenizer()
>>> for token in t.tokenize('すもももももももものうち'):
... print(token)
...
すもも 名詞,一般,*,*,*,*,すもも,スモモ,スモモ
も 助詞,係助詞,*,*,*,*,も,モ,モ
もも 名詞,一般,*,*,*,*,もも,モモ,モモ
も 助詞,係助詞,*,*,*,*,も,モ,モ
もも 名詞,一般,*,*,*,*,もも,モモ,モモ
の 助詞,連体化,*,*,*,*,の,ノ,ノ
うち 名詞,非自立,副詞可能,*,*,*,うち,ウチ,ウチ
License
Licensed under Apache License 2.0 and uses the MeCab-IPADIC dictionary/statistical model.
See LICENSE.txt and NOTICE.txt for license details.
Acknowledgement
Special thanks to @ikawaha, @takuyaa, @nakagami and @janome_oekaki.
Copyright
Copyright(C) 2015-2023, Tomoko Uchida. All rights reserved.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file Janome-0.5.0.tar.gz
.
File metadata
- Download URL: Janome-0.5.0.tar.gz
- Upload date:
- Size: 18.8 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ce4a3ed7a4635c2f80139639327d5b1e0381858ad74a3c4a61e8cc83f820400e |
|
MD5 | 15b115aa752126fe4f003a52b151ffd9 |
|
BLAKE2b-256 | 084edc2b1a89a4ffafbf9bf49c8e11f69e28ba8b3ef81d11afe6e9f96caee6cc |
File details
Details for the file Janome-0.5.0-py2.py3-none-any.whl
.
File metadata
- Download URL: Janome-0.5.0-py2.py3-none-any.whl
- Upload date:
- Size: 19.7 MB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d098670394a77881ce2f6b7d696c0ea5ff74c0c8cf74a8a882159ec82c0e6dc7 |
|
MD5 | 2c03e9bdfef8963aa4536f8fc7d75414 |
|
BLAKE2b-256 | 737d70f4069f4bbf0fca023e82a1fbbade6f5216365d4fe259fee1950723eca5 |