A natural language processing toolkit designed for the Azerbaijani language.
Project description
NLTKAZ 📚
NLTKAZ is a natural language processing toolkit designed for the Azerbaijani language making it easier to preprocess Azerbaijani text for NLP tasks.
Installation ⬇️
pip install nltkaz
Features 🧩
Currently following features are provided with the version 0.1.1:
- Stemming: Reduce words to their root forms.
- Stopword Removal: Easily remove common Azerbaijani stopwords from text.
Usage ⚙️
Stemming
from azstemmer import AzStemmer
# Initialize stemmer with the appropriate keyboard type
# Use 'az' for Azerbaijani text or 'en' if the text is typed using an English keyboard
stemmer = AzStemmer(keyboard="az")
# Stem your string
stemmed_string = stemmer.stem("your_string")
Stopword Removal
from nltkaz.stopwords import load, remove
# load stopwords using
stopwords = load()
# remove stopwords from the given string
result = remove(stopwords=stopwords, sentence="your_string")
Author 🧑💻
- Nagi Nagiyev
Contact 📧
Gmail: nagiyevnagi01@gmail.com.
Linkedin: https://www.linkedin.com/in/naginagiyev/
License 📜
MIT License
This project is in early development. Contributions and feedback are welcome! 🤝
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file nltkaz-0.1.1.tar.gz.
File metadata
- Download URL: nltkaz-0.1.1.tar.gz
- Upload date:
- Size: 395.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.10.18
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8447445efe6fd9260f83712f7b09332e0dd7b2b697539659d1b4f7d969ab2618
|
|
| MD5 |
eefe47e40b916eb777bc4fc4f992f758
|
|
| BLAKE2b-256 |
ad626df5858c0cce9a43e9c9d410631a76a734e2b6a5b4163493b3ae4128b833
|
File details
Details for the file nltkaz-0.1.1-py3-none-any.whl.
File metadata
- Download URL: nltkaz-0.1.1-py3-none-any.whl
- Upload date:
- Size: 394.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.10.18
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
268575abbd626c6b5839892da54cec69bdac17dcbd60374fa342d7bfedd1a6dd
|
|
| MD5 |
628e16a2fabf17b3753c009959ce7f66
|
|
| BLAKE2b-256 |
2f663523cfb7b95611f260f5e726a57476954a8be506b481571f83500b779a97
|