Morphological analysis, dictionary lemmatization for Russian, German and English
Project description
Morphological analyzer for Russian, English and German languages.
This is a python wrapper for C++ morphology library that was created in 2000 (many years ago) for website aot.ru. The original library was written in C++, so we have to compile it and integrate with python using pybind11. The dictionaries are rather outdated, but I use them for some modern projects, that's why I decided to share the project with other people.
License is MIT.
The project is distributed in (sources)[https://github.com/sokirko74/pylem], you have to install cmake and modern C++ (17) compiler to your workstation.
Install
Normally you could install the project with
python -m pip install pylem
but if it does not work install it using the following steps:
- download and unpack sources
- run python setup.py install
Using the library
To search word mother
in the English dictionary one should
write the following code:
from pylem import MorphanHolder, MorphLanguage
h = MorphanHolder(MorphLanguage.English) #load dicts from disk
print(h.is_in_dictionary("mother"))
Other usages are covered by unittests
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Hashes for pylem-0.0.11.linux-x86_64.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0d83dc652e6fd5935274702e359a6d7410adb67c5bc4eb0bc75aaf4157318c42 |
|
MD5 | 0a5752fc79728cca004bb4e5b6b66c73 |
|
BLAKE2b-256 | d546a20007b9b6908e41a652b6992ed9c7de83e69055becc2d05a3d94f773fdb |