Skip to main content

A natural language processing toolkit designed for the Azerbaijani language.

Project description

NLTKaz Banner

NLTKAZ 📚

NLTKAZ is a natural language processing toolkit designed for the Azerbaijani language making it easier to preprocess Azerbaijani text for NLP tasks.

Installation ⬇️

pip install nltkaz

Features 🧩

Currently following features are provided with the version 0.1.1:

  • Stemming: Reduce words to their root forms.
  • Stopword Removal: Easily remove common Azerbaijani stopwords from text.

Usage ⚙️

Stemming

from azstemmer import AzStemmer

# Initialize stemmer with the appropriate keyboard type
# Use 'az' for Azerbaijani text or 'en' if the text is typed using an English keyboard
stemmer = AzStemmer(keyboard="az") 

# Stem your string
stemmed_string = stemmer.stem("your_string")

Stopword Removal

from nltkaz.stopwords import load, remove

# load stopwords using
stopwords = load()
# remove stopwords from the given string
result = remove(stopwords=stopwords, sentence="your_string")

Author 🧑‍💻

  • Nagi Nagiyev

Contact 📧

Gmail: nagiyevnagi01@gmail.com.

Linkedin: https://www.linkedin.com/in/naginagiyev/

License 📜

MIT License


This project is in early development. Contributions and feedback are welcome! 🤝

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nltkaz-0.1.1.tar.gz (395.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nltkaz-0.1.1-py3-none-any.whl (394.8 kB view details)

Uploaded Python 3

File details

Details for the file nltkaz-0.1.1.tar.gz.

File metadata

  • Download URL: nltkaz-0.1.1.tar.gz
  • Upload date:
  • Size: 395.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.18

File hashes

Hashes for nltkaz-0.1.1.tar.gz
Algorithm Hash digest
SHA256 8447445efe6fd9260f83712f7b09332e0dd7b2b697539659d1b4f7d969ab2618
MD5 eefe47e40b916eb777bc4fc4f992f758
BLAKE2b-256 ad626df5858c0cce9a43e9c9d410631a76a734e2b6a5b4163493b3ae4128b833

See more details on using hashes here.

File details

Details for the file nltkaz-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: nltkaz-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 394.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.18

File hashes

Hashes for nltkaz-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 268575abbd626c6b5839892da54cec69bdac17dcbd60374fa342d7bfedd1a6dd
MD5 628e16a2fabf17b3753c009959ce7f66
BLAKE2b-256 2f663523cfb7b95611f260f5e726a57476954a8be506b481571f83500b779a97

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page