Skip to main content

A high-performance library to filter Banglish stopwords from text.

Project description

Banglish Stopwords

A high-performance, lightweight Python library to filter Banglish (Bengali written in Latin/English script) stopwords. It helps in cleaning text data for NLP, sentiment analysis, and machine learning projects.

Features

  • Comprehensive Dataset: Includes 350+ Bengali words and their popular Banglish variations.
  • Lazy Typing Support: Handles repeated characters automatically (e.g., naaaa -> na, hbeee -> hbe).
  • High Performance: Uses optimized sets for O(1) lookup speed.
  • Punctuation Aware: Cleans text while respecting common punctuation.

Installation

You can install the library using pip (after you publish it):

pip install banglish-stopwords

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

banglish_stopwords-0.1.0.tar.gz (6.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

banglish_stopwords-0.1.0-py3-none-any.whl (6.4 kB view details)

Uploaded Python 3

File details

Details for the file banglish_stopwords-0.1.0.tar.gz.

File metadata

  • Download URL: banglish_stopwords-0.1.0.tar.gz
  • Upload date:
  • Size: 6.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for banglish_stopwords-0.1.0.tar.gz
Algorithm Hash digest
SHA256 9930105af477a10c23aa92b24336e64abaf2d34b9a59de96123e8d106dfade6b
MD5 18ac54c5feb46bc82ed682340480f058
BLAKE2b-256 13efc504b014fca23c06e9efd5445af15686228638d2df6300adbb4bba9f06a5

See more details on using hashes here.

File details

Details for the file banglish_stopwords-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for banglish_stopwords-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d53a63e338e23073dcab5ec85b32b7ca598e77bc4fa817dfc3f1a38afcc35693
MD5 5a795a0e2a34888ca4a32ac6d278f234
BLAKE2b-256 e2b9b05dc849e5db70cbb7b21d27f22a0728ad3483c7b546628f789c36d12c85

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page