Skip to main content

Library untuk normalisasi teks bahasa Indonesia dengan algoritma pembaruan.

Project description

Nor Asro

Nor Asro adalah library Python untuk normalisasi teks dalam bahasa Indonesia. Library ini dirancang untuk membantu membersihkan teks dengan cara mengganti kata slang menjadi formal, menghapus noise, dan melakukan stemming menggunakan Sastrawi.

Fitur

  • Normalisasi kata slang ke kata formal
  • Mendukung file Excel untuk memuat daftar slang-formal
  • Penghapusan noise seperti URL, angka, dan tanda baca
  • Stemming untuk mendapatkan kata dasar

Instalasi

pip install nor_asro


from nor_asro.normalizer import NorAsroNormalizer

# Inisialisasi normalizer dengan file Excel
normalizer = NorAsroNormalizer("path_to_your_excel_file.xlsx")

# Contoh teks
text = "Woww amin keberpa netaas met ya!"
result = normalizer.normalize_text(text)
print(result)  # Output: "wow amin keberapa menetas selamat ya"

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nor_asro-0.1.1.tar.gz (258.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nor_asro-0.1.1-py3-none-any.whl (257.0 kB view details)

Uploaded Python 3

File details

Details for the file nor_asro-0.1.1.tar.gz.

File metadata

  • Download URL: nor_asro-0.1.1.tar.gz
  • Upload date:
  • Size: 258.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.5

File hashes

Hashes for nor_asro-0.1.1.tar.gz
Algorithm Hash digest
SHA256 9f8b9f599d604a492c01e31ca8f7fcdc0c2aacb41092cc918903fad8ff212f57
MD5 c560138d2aa9e128fa78f1e8483187fb
BLAKE2b-256 b8e0566530c684e6e2ef29b9f040254e2e5d15f0f1f4d7ab4a7c85acd1d88642

See more details on using hashes here.

File details

Details for the file nor_asro-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: nor_asro-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 257.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.5

File hashes

Hashes for nor_asro-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 0ef989e02414882ff62298a71f484011fef705eee67d2bf44b43f06cd9650299
MD5 7c1b1399ea4bba7b0c74de68873a6c4f
BLAKE2b-256 acba5893a91d7c19c799b191467fc4fa359db41228e7be19d37d8c7e1bee4938

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page