Skip to main content

Silabel is a python package to syllabelize indonesian word

Project description

Syllabelizer for Indonesian Language in Python

This is a rule-based syllabelizer for Indonesian Language. Please note that this project does not aim to replicate rules in Ejaan yang Disempurnakan (EYD).

Installation :zap:

pip install silabel

Example Usage :robot:

from silabel import Syllabelizer

s = Syllabelizer()
s.syllabelize("menggunakan") # ['meng', 'gu', 'na', 'kan']
s.syllabelize("memperkirakan") # ['mem', 'per', 'ki', 'ra', 'kan']

Quick Check :mag:

Here are some results to check if this package suits your interest

BSD                       : b-s-d
SMP                       : s-m-p
main                      : ma-in
april                     : ap-ril
swasta                    : swas-ta
instan                    : in-stan
dengan                    : de-ngan
pandai                    : pan-dai
makhluk                   : makh-luk
saudara                   : sau-da-ra
menyapu                   : me-nya-pu
etiopia                   : e-ti-o-pi-a
masyhur                   : masy-hur
biografi                  : bi-o-gra-fi
instrumen                 : in-stru-men
pengarang                 : pe-nga-rang
reboisasi                 : re-boi-sa-si
musyawarah                : mu-sya-wa-rah
dramatisasi               : dra-ma-ti-sa-si
memproklamasikan          : mem-pro-kla-ma-si-kan
berkesinambungan          : ber-ke-si-nam-bu-ngan
mempertanggungjawabkan    : mem-per-tang-gung-ja-wab-kan

Known Issue :persevere:

There are words with vocal diphthong that would not be syllabelize, such as re-boi-sa-si, but it is good enough for my use case so I prefer to let it be.

End notes :heart:

There is no reference used in this project. I simply create a set of rules based on indonesian vocal, consonant, and diphthong to syllabelize indonesian words.

I made this repo for educational purposes so it might need further tweaking to reach production level.
Feel free to create an issue if you need help, and I hope I'll have the time to help you. Thank you.

:heart: from Indonesia

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

silabel-0.5.tar.gz (3.8 kB view details)

Uploaded Source

File details

Details for the file silabel-0.5.tar.gz.

File metadata

  • Download URL: silabel-0.5.tar.gz
  • Upload date:
  • Size: 3.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.0.post20200714 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.7.1

File hashes

Hashes for silabel-0.5.tar.gz
Algorithm Hash digest
SHA256 3baf1cc719cae3af9883985fa4cee403d6d947726f10a11db8836b4a6d6aa694
MD5 1c7e0ae6ade71430d341c2f4e86fb32d
BLAKE2b-256 1d87a64d034df4e97ffbf86f3c0e2ad0d74ca34a170acc9739664dc46772242d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page