Skip to main content

A NLP package for Portuguese Lemmatization.

Project description

This NLP package for Portuguese lemmatization is a powerful and advanced tool that can accurately transform words into their base forms or lemmas, taking into account the specific grammatical rules and variations of the Portuguese language. It is designed to handle various types of text input and supports multiple output formats, making it a versatile tool for applications such as information retrieval, machine translation, sentiment analysis, and text classification. Additionally, the package is customizable and user-friendly, allowing users to specify their own dictionaries and rules for lemmatization and providing features for error correction and word sense disambiguation. Whether you are a researcher, developer, or linguist working with Portuguese text data, this NLP package can help you save time and improve the accuracy and quality of your analyses. With its advanced algorithms and techniques in NLP, you can trust that this tool will provide high-quality results and make the lemmatization process more efficient.

A lemma is a word that stands at the head of a definition in a dictionary. Wikipedia

Example

from pt_lemmatizer import Lemmatizer

l = Lemmatizer()
l.lemmatize('apagou')  #all words must be unidecoded and lowercased
>> 'apagar'
l.lemmatize('nasalaram')
>> 'nasalar'



Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pt_lemmatizer-2.1.18.tar.gz (2.4 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pt_lemmatizer-2.1.18-py3-none-any.whl (2.5 MB view details)

Uploaded Python 3

File details

Details for the file pt_lemmatizer-2.1.18.tar.gz.

File metadata

  • Download URL: pt_lemmatizer-2.1.18.tar.gz
  • Upload date:
  • Size: 2.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.6

File hashes

Hashes for pt_lemmatizer-2.1.18.tar.gz
Algorithm Hash digest
SHA256 e498cc48ef4561f8344d7ec512bb1f049f825c7ea6eebcf63e6fe750b07a0c51
MD5 fd49e844c3d913db3bf4f857fa038e48
BLAKE2b-256 cb0726a7569d9e5d97f22fb551567d10daa97545a15538945c9892433ba3ab84

See more details on using hashes here.

File details

Details for the file pt_lemmatizer-2.1.18-py3-none-any.whl.

File metadata

  • Download URL: pt_lemmatizer-2.1.18-py3-none-any.whl
  • Upload date:
  • Size: 2.5 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.6

File hashes

Hashes for pt_lemmatizer-2.1.18-py3-none-any.whl
Algorithm Hash digest
SHA256 68583822696a6f6049df958d57bb51697be65f9188e69af4c23f513c86095e29
MD5 8f1eef9845a8e9cef91cf49bd901de5b
BLAKE2b-256 da02ef62c6ad2f2bee479f65814f0609667333f1b7d35702ab1c3f23f1882c9f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page