Skip to main content

A pure-python implementation of the Voikko library that provides Finnish morphological analysis

Project description

PyVoikko

PyVoikko is an implementation of Voikko, a free and open source morphological analyzer for Finnish. It aims to reimplement the Voikko library in pure Python, so that it can be used in environments in which native libraries like libvoikko cannot be installed. It contains a pre-built version of the voikko-fi transducer and uses the KFST library for lookups.

It is in early stages of development. The only thing implemented right now is a parser for the analysis strings produced by the Voikko transducer. As the analysis strings are quite complex, it is very likely that the parser does not yet parse all of them correctly. Please report bugs if you find them!

Other features of the Voikko library, such as spell checking and tokenising, are not implemented yet.

Installation

PyVoikko is available on PyPI and can be installed with pip:

pip install pyvoikko

Usage

import pyvoikko

print(pyvoikko.analyse("kissa"))

License

PyVoikko is licensed under the GNU GPL version 3 or later. See the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyvoikko-0.5.tar.gz (1.1 MB view hashes)

Uploaded Source

Built Distribution

pyvoikko-0.5-py3-none-any.whl (1.1 MB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page