Skip to main content

The Uzbek Natural Language Toolkit (NLTK) is a Python package for natural language processing.

Project description

uznltk

uznltk — bu O‘zbek tiliga mo‘ljallangan yengil va qulay NLP (Natural Language Processing) kutubxonadir. Matn tozalash, morfologik tahlil, raqam va matn konversiyalari, bo‘g‘inlarga ajratish va boshqa ko‘plab funksiyalarni o‘z ichiga oladi.

🔗 Havolalar

👤 Mualliflar

🔧 O‘rnatish

pip install uznltk

🚀 Foydalanish

from uznltk import *

📚 Funksiyalar

clean_text(text)

O‘zbek tiliga xos belgilarni (g‘, o‘, ’) to‘g‘rilaydi.

clean_text("O'zbekistonda ta'lim kuchli rivojlanmoqda")
# Natija: "O‘zbekistonda ta’lim kuchli rivojlanmoqda"

solid_sign(text)

(’)-belgisiga ega so‘zlarni ro‘yxat ko‘rinishida qaytaradi.

solid_sign("ta'lim bo'lishi oldindan ma'lum edi")
# Natija: ['ta’lim', 'ma’lum']

lemmatize(text) va stem_word(text)

So‘z ildizini aniqlaydi.

lemmatize("mexanizatorlashtirilganlardan")
# Natija: "mexanizatorlashtirilgan"

number_to_text(number)

Raqamni o‘zbek tilidagi matnga aylantiradi.

number_to_text(54)
# Natija: "ellik to‘rt"

text_to_number(text)

Matndagi raqamni son shakliga o‘tkazadi.

text_to_number("yetmish olti")
# Natija: 76

download(name)

Turli resurslarni yuklab oladi (masalan: kitoblar, yangiliklar).

download("book")

clean_stopword(text)

Matndan stop-so‘zlarni olib tashlaydi.

clean_stopword("salom dunyo, biz sen va u bilan bugun maktabga bordik")
# Natija: "salom dunyo, bugun maktabga bordik"

syllables(text)

So‘zlarni bo‘g‘inlarga ajratadi.

syllables("Bizga ma’lum ishlar yuz bermoqda!")
# Natija: ['Biz-ga', 'ma’-lum', 'ish-lar', 'yuz', 'ber-moq-da!']

hyphenation(text)

Har bir so‘zni bo‘g‘inlarga ajratib ro‘yxat shaklida beradi.

hyphenation("salom dunyo")
# Natija: ['sa-lom dunyo', 'salom dun-yo']

count_syllable(text)

Matndagi bo‘g‘inlar sonini hisoblaydi.

count_syllable("Salom Dunyo")
# Natija: 4

count_text(text)

Matndagi so‘zlar sonini hisoblaydi.

count_text("Salom Dunyo")
# Natija: 2

split_sentences(text)

Matndagi gaplarni ro‘yxatga ajratadi.

split_sentences("Salom Dunyo. Bugun ob-havo qisman bulutli")
# Natija: ['Salom Dunyo', 'Bugun ob-havo qisman bulutli']

split_words(text)

Matndagi faqat so‘zlarni (IP, email, emoji, URLlarsiz) ro‘yxatga ajratadi.

split_words("sen 192.168.1.18 va helloworld@example.com elektron manzilidasan. Manba https://pypi.org")
# Natija: ['sen', 'va', 'elektron', 'manzilidasan', 'Manba']

💡 Foydali ma’lumot

  • Kutubxona butunlay O‘zbek tili uchun mo‘ljallangan.
  • Raqamlar bilan ishlash, lemmatizatsiya va bo‘g‘inlash kabi asosiy NLP komponentlarini o‘z ichiga oladi.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

uznltk-0.0.12.tar.gz (9.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

uznltk-0.0.12-py3-none-any.whl (9.2 kB view details)

Uploaded Python 3

File details

Details for the file uznltk-0.0.12.tar.gz.

File metadata

  • Download URL: uznltk-0.0.12.tar.gz
  • Upload date:
  • Size: 9.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.10

File hashes

Hashes for uznltk-0.0.12.tar.gz
Algorithm Hash digest
SHA256 318d2cdbabc3c10d588d20f4bfa3b226727759ea8a78931d47a91df48071c3db
MD5 e71cac7ba2e51bcfcc4989ed6c000342
BLAKE2b-256 73e5a0d598705980f3e0c4f9afd3051295832e6363030cef9880a50ac80a0988

See more details on using hashes here.

File details

Details for the file uznltk-0.0.12-py3-none-any.whl.

File metadata

  • Download URL: uznltk-0.0.12-py3-none-any.whl
  • Upload date:
  • Size: 9.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.10

File hashes

Hashes for uznltk-0.0.12-py3-none-any.whl
Algorithm Hash digest
SHA256 c6cbebf958eb2b4322212c1ba496e6d7eac58684450e3c81aae5fe38e1ed45f6
MD5 4b058672ee3f5b4805ea0e87b4487aeb
BLAKE2b-256 cdf1d95c194e783beb1f15e067beaf8a92dfa14da2beaeca23ae040ec0d37ee9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page