A simple stopword cleaner
Project description
Stopwordz
Stopwordz is a Python library designed to efficiently remove stopwords from text data. It comes with a pre-defined set of stopwords ideal for various text preprocessing tasks, especially in natural language processing applications.
Installation
You can easily install Stopwordz from PyPI:
pip install stopwordz
Usage
Using Stopwordz is straightforward. Import the clean_text
function from the package and pass your text data to it:
from stopwordz import clean_text
# Example text
text = "This is a sample sentence, showing off the stop words filtration."
cleaned_text = clean_text(text)
print(cleaned_text)
Features
- Pre-defined comprehensive list of stopwords.
- Easy to integrate into text preprocessing pipelines.
- Lightweight and fast.
Contributing
Contributions are welcome! Please fork the repository and open a pull request with your additions. For major changes, please open an issue first to discuss what you would like to change.
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
stopwordz-0.1.tar.gz
(5.2 kB
view hashes)