Skip to main content

Python morphological analyzer and lemmatizer for Turkish

Project description

https://img.shields.io/pypi/v/zeyrek.svg Documentation Status build

Zeyrek is a partial port of Zemberek library to Python for lemmatizing and analyzing Turkish language words. It is in alpha stage, and the API will probably change.

Basic Usage

To use Zeyrek, first create an instance of MorphAnalyzer class:

>>> import zeyrek
>>> analyzer = zeyrek.MorphAnalyzer()

Then, you can call its analyze method on words or texts to get all possible analyses:

>>> print(analyzer.analyze('benim'))
Parse(word='benim', lemma='ben', pos='Noun', morphemes=['Noun', 'A3sg', 'P1sg'], formatted='[ben:Noun] ben:Noun+A3sg+im:P1sg')
Parse(word='benim', lemma='ben', pos='Pron', morphemes=['Pron', 'A1sg', 'Gen'], formatted='[ben:Pron,Pers] ben:Pron+A1sg+im:Gen')
Parse(word='benim', lemma='ben', pos='Verb', morphemes=['Noun', 'A3sg', 'Zero', 'Verb', 'Pres', 'A1sg'], formatted='[ben:Noun] ben:Noun+A3sg|Zero→Verb+Pres+im:A1sg')
Parse(word='benim', lemma='ben', pos='Verb', morphemes=['Pron', 'A1sg', 'Zero', 'Verb', 'Pres', 'A1sg'], formatted='[ben:Pron,Pers] ben:Pron+A1sg|Zero→Verb+Pres+im:A1sg')

If you only need the base form of words, or lemmas, you can call lemmatize. It returns a list of tuples, with word itself and a list of possible lemmas:

>>> print(analyzer.lemmatize('benim'))
[('benim', ['ben'])]

Credits

This package is a Python port of part of the Zemberek package by Ahmet A. Akın

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

zeyrek-0.1.3.tar.gz (929.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

zeyrek-0.1.3-py2.py3-none-any.whl (931.0 kB view details)

Uploaded Python 2Python 3

File details

Details for the file zeyrek-0.1.3.tar.gz.

File metadata

  • Download URL: zeyrek-0.1.3.tar.gz
  • Upload date:
  • Size: 929.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.10.2

File hashes

Hashes for zeyrek-0.1.3.tar.gz
Algorithm Hash digest
SHA256 428054015258a48a61fdf759ef51be1ccb0437c65584ab3113de29366222a130
MD5 4d1fe96e5a570716ef108398b60e92c4
BLAKE2b-256 04f203238387bb70c2efbc88843032c91af3e278d317d2b120376bc02d1aff04

See more details on using hashes here.

File details

Details for the file zeyrek-0.1.3-py2.py3-none-any.whl.

File metadata

  • Download URL: zeyrek-0.1.3-py2.py3-none-any.whl
  • Upload date:
  • Size: 931.0 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.10.2

File hashes

Hashes for zeyrek-0.1.3-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 23649bb49322a52d1e94959029b047fa4037bc540762819feb1096aa976b25b5
MD5 6b598cd65ee8cbca6275d6acb65f9201
BLAKE2b-256 7f5b76970fab035d2e2649ba06af037c394b9690d0e07a8faf53817ccccb3951

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page