Skip to main content

Silabel is a python package to syllabelize indonesian word

Project description

Syllabelizer for Indonesian Language in Python

This is a rule-based syllabelizer for Indonesian Language. Please note that this project does not aim to replicate rules in Ejaan yang Disempurnakan (EYD), but focuses on a simple way to split indonesian words into common sub-words because it is good enough for my purposes.

Installation

pip install silabel

Example Usage

from silabel import Syllabelizer

s = Syllabelizer()
s.syllabelize("menggunakan") # ['meng', 'gu', 'na', 'kan']
s.syllabelize("memperkirakan") # ['mem', 'per', 'ki', 'ra', 'kan']

End notes :heart:

There is no reference used in this project. I simply create a set of rules based on indonesian vocal, consonant, and diftong to syllabelize indonesian words.

I made this repo for educational purposes so it might need further tweaking to reach production level.
Feel free to create an issue if you need help, and I hope I'll have the time to help you. Thank you.

:heart: from Indonesia

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

silabel-0.4.tar.gz (3.2 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page