Skip to main content

A simple stopword cleaner

Project description

Stopwordz

Stopwordz is a Python library designed to efficiently remove stopwords from text data. It comes with a pre-defined set of stopwords ideal for various text preprocessing tasks, especially in natural language processing applications.

Installation

You can easily install Stopwordz from PyPI:

pip install stopwordz

Usage

Using Stopwordz is straightforward. Import the clean_text function from the package and pass your text data to it:

from stopwordz import clean_text

# Example text
text = "This is a sample sentence, showing off the stop words filtration."
cleaned_text = clean_text(text)
print(cleaned_text)

Features

  • Pre-defined comprehensive list of stopwords.
  • Easy to integrate into text preprocessing pipelines.
  • Lightweight and fast.

Contributing

Contributions are welcome! Please fork the repository and open a pull request with your additions. For major changes, please open an issue first to discuss what you would like to change.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

stopwordz-0.2.tar.gz (5.9 kB view details)

Uploaded Source

Built Distribution

stopwordz-0.2-py3-none-any.whl (6.1 kB view details)

Uploaded Python 3

File details

Details for the file stopwordz-0.2.tar.gz.

File metadata

  • Download URL: stopwordz-0.2.tar.gz
  • Upload date:
  • Size: 5.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.8

File hashes

Hashes for stopwordz-0.2.tar.gz
Algorithm Hash digest
SHA256 bce78a1b7cc794d2cd39dd522517b1982df9d8978213c96ac7de142154f55b6a
MD5 d75e2bcf948bd92467f07ce23ee257b7
BLAKE2b-256 b782380587c9e92bf5e6288ead0353bd3c401521b7ad42ff97b1c462aae203ee

See more details on using hashes here.

File details

Details for the file stopwordz-0.2-py3-none-any.whl.

File metadata

  • Download URL: stopwordz-0.2-py3-none-any.whl
  • Upload date:
  • Size: 6.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.8

File hashes

Hashes for stopwordz-0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 d705b0e35fa76ce0e359601167d1ae0881b0276eee943a9501eaaa9e8bb80f43
MD5 85aff71d044b92ce75575e642c0fce94
BLAKE2b-256 851b84b90a876b255fe1db8188fd0f20c98714fbc4cb02c85c8d63c4453934e6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page