Skip to main content

A natural language processing toolkit designed for the Azerbaijani language.

Project description

NLTKaz Banner

NLTKAZ 📚

NLTKAZ is a natural language processing toolkit designed for the Azerbaijani language making it easier to preprocess Azerbaijani text for NLP tasks.

Installation ⬇️

pip install nltkaz

Features 🧩

Currently following features are provided with the version 0.1.0:

  • Stemming: Reduce words to their root forms.
  • Stopword Removal: Easily remove common Azerbaijani stopwords from text.

Usage ⚙️

Stemming

from azstemmer import AzStemmer

# Initialize stemmer with the appropriate keyboard type
# Use 'az' for Azerbaijani text or 'en' if the text is typed using an English keyboard
stemmer = AzStemmer(keyboard="az") 

# Stem your string
stemmed_string = stemmer.stem("your_string")

Stopword Removal

from nltkaz.stopwords import load, remove

# load stopwords using
stopwords = load()
# remove stopwords from the given string
result = remove(stopwords=stopwords, sentence="your_string")

Author 🧑‍💻

  • Nagi Nagiyev

Contact 📧

Gmail: nagiyevnagi01@gmail.com.

Linkedin: https://www.linkedin.com/in/naginagiyev/

License 📜

MIT License


This project is in early development. Contributions and feedback are welcome! 🤝

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nltkaz-0.1.0.tar.gz (415.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nltkaz-0.1.0-py3-none-any.whl (394.8 kB view details)

Uploaded Python 3

File details

Details for the file nltkaz-0.1.0.tar.gz.

File metadata

  • Download URL: nltkaz-0.1.0.tar.gz
  • Upload date:
  • Size: 415.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.18

File hashes

Hashes for nltkaz-0.1.0.tar.gz
Algorithm Hash digest
SHA256 7b27bc4cb6e20ac1413a5faf0a7298b404713d801aae8c1e81f132da8010dd44
MD5 01f3983b784e6ae9d99d01976f704444
BLAKE2b-256 f34038d9bf62b874be636e12ca97545e443e35b60ad34b0f7019cf8c9f7cf070

See more details on using hashes here.

File details

Details for the file nltkaz-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: nltkaz-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 394.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.18

File hashes

Hashes for nltkaz-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 8da83b6a186b402e80eae37c702a89aef9b0203de2ad41d154f9c98893595533
MD5 cc98d72a32c494f8629c2e9b622d6909
BLAKE2b-256 76d00e720e3e27110c61800a555b9ccea799c4895b8f383733e59d3b23433dd7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page