Skip to main content

Education toolkit for Bahasa Indonesia NLP

Project description

Education Toolkit for Bahasa Indonesia NLP

PyPI version PyPI license Activity maintained PyPI format pypi downloads

This repository is the PyPI version intended for production use. It is maintained for accuracy purposes. For example code, experimental / in-development features and tutorial materials, use Elangdev instead.

Elang is an acronym that combines the phrases Education (E) and Language Understanding (Lang). It is an education-centric toolkit to demonstrate the ideas behind many Natural Language Processing strategies commercially used today.

Elang

Elang also means "eagle" in Bahasa Indonesia, and the elang Jawa (Javan hawk-eagle) is the national bird of Indonesia, more commonly referred to as Garuda.

The package provides a collection of utility functions and tools that interface with gensim and scikit-learn, as well as curated negative lists for Bahasa Indonesia (kata kasar / vulgar words, stopwords etc) and useful preprocesisng functions.

Quick Demo

Install elang:

pip install elang

Performing word embeddings in 4 lines of code gets you a visualization:

from elang.plot.utils import plot2d
from gensim.models import Word2Vec

model = Word2Vec.load("path.to.model")
plot2d(model)
# output:

It even looks like a soaring eagle with its outstretched wings!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for elang, version 0.0.5
Filename, size File type Python version Upload date Hashes
Filename, size elang-0.0.5-py3-none-any.whl (342.6 kB) File type Wheel Python version py3 Upload date Hashes View hashes
Filename, size elang-0.0.5.tar.gz (335.9 kB) File type Source Python version None Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page