Library untuk normalisasi teks bahasa Indonesia dengan algoritma pembaruan.
Project description
Nor Asro
Nor Asro adalah library Python untuk normalisasi teks dalam bahasa Indonesia. Library ini dirancang untuk membantu membersihkan teks dengan cara mengganti kata slang menjadi formal, menghapus noise, dan melakukan stemming menggunakan Sastrawi.
Fitur
- Normalisasi kata slang ke kata formal
- Mendukung file Excel untuk memuat daftar slang-formal
- Penghapusan noise seperti URL, angka, dan tanda baca
- Stemming untuk mendapatkan kata dasar
Instalasi
pip install nor_asro
from nor_asro.normalizer import NorAsroNormalizer
# Inisialisasi normalizer dengan file Excel
normalizer = NorAsroNormalizer("path_to_your_excel_file.xlsx")
# Contoh teks
text = "Woww amin keberpa netaas met ya!"
result = normalizer.normalize_text(text)
print(result) # Output: "wow amin keberapa menetas selamat ya"
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file nor_asro-0.1.1.tar.gz.
File metadata
- Download URL: nor_asro-0.1.1.tar.gz
- Upload date:
- Size: 258.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9f8b9f599d604a492c01e31ca8f7fcdc0c2aacb41092cc918903fad8ff212f57
|
|
| MD5 |
c560138d2aa9e128fa78f1e8483187fb
|
|
| BLAKE2b-256 |
b8e0566530c684e6e2ef29b9f040254e2e5d15f0f1f4d7ab4a7c85acd1d88642
|
File details
Details for the file nor_asro-0.1.1-py3-none-any.whl.
File metadata
- Download URL: nor_asro-0.1.1-py3-none-any.whl
- Upload date:
- Size: 257.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0ef989e02414882ff62298a71f484011fef705eee67d2bf44b43f06cd9650299
|
|
| MD5 |
7c1b1399ea4bba7b0c74de68873a6c4f
|
|
| BLAKE2b-256 |
acba5893a91d7c19c799b191467fc4fa359db41228e7be19d37d8c7e1bee4938
|