Skip to main content

A simple stopword cleaner

Project description

Stopwordz

Stopwordz is a Python library designed to efficiently remove stopwords from text data. It comes with a pre-defined set of stopwords ideal for various text preprocessing tasks, especially in natural language processing applications.

Installation

You can easily install Stopwordz from PyPI:

pip install stopwordz

Usage

Using Stopwordz is straightforward. Import the clean_text function from the package and pass your text data to it:

from stopwordz import clean_text

# Example text
text = "This is a sample sentence, showing off the stop words filtration."
cleaned_text = clean_text(text)
print(cleaned_text)

Features

  • Pre-defined comprehensive list of stopwords.
  • Easy to integrate into text preprocessing pipelines.
  • Lightweight and fast.

Contributing

Contributions are welcome! Please fork the repository and open a pull request with your additions. For major changes, please open an issue first to discuss what you would like to change.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

stopwordz-0.2.tar.gz (5.9 kB view hashes)

Uploaded Source

Built Distribution

stopwordz-0.2-py3-none-any.whl (6.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page