Library for stemming Indonesian (Bahasa) text
Project description
Sastrawi is a simple Python library which allows you to reduce
inflected words in Indonesian Language (Bahasa Indonesia) to their
base form (stem).
This is Python port of the original Sastrawi project written in
PHP.
Installation
Sastrawi can be installed via pip, by running the following commands in terminal/command prompt : pip install Sastrawi
Example Usage
Run the following commands in Python interactive terminal :
# import Sastrawi package
from Sastrawi.Stemmer.StemmerFactory import StemmerFactory
# create stemmer
factory = StemmerFactory()
stemmer = factory.create_stemmer()
# stem
sentence = 'Perekonomian Indonesia sedang dalam pertumbuhan yang membanggakan'
output = stemmer.stem(sentence)
print(output)
# ekonomi indonesia sedang dalam tumbuh yang bangga
print(stemmer.stem('Mereka meniru-nirukannya'))
# mereka tiru
Demo
Live demo : https://pysastrawi-demo.appspot.com/
Repository : https://github.com/har07/pystastrawi-demo
More Info
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Sastrawi-1.0.1.zip
(211.7 kB
view details)
Built Distribution
Sastrawi-1.0.1-py2.py3-none-any.whl
(209.7 kB
view details)
File details
Details for the file Sastrawi-1.0.1.zip
.
File metadata
- Download URL: Sastrawi-1.0.1.zip
- Upload date:
- Size: 211.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 62badb523e6e273f92eb6d53f5f3d4c0aa9ffcf1ceba69ab3dba5b7a0da02c56 |
|
MD5 | a300c38adf8ab4ede1e28c4b066fce34 |
|
BLAKE2b-256 | 64e8abff41a085ec33b1a55f2065e0ebe1d6c9b6635559a94b3774c8c81e88aa |
File details
Details for the file Sastrawi-1.0.1-py2.py3-none-any.whl
.
File metadata
- Download URL: Sastrawi-1.0.1-py2.py3-none-any.whl
- Upload date:
- Size: 209.7 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ac3d1f442ae8f33fd0f59387eb33a1a5121241f028c318c60820f26bd032d93b |
|
MD5 | 631ec7a9feedf8d1ecd49b5b26e19dd0 |
|
BLAKE2b-256 | 6f4bbab676953da3103003730b8fcdfadbdd20f333d4add10af949dd5c51e6ed |