A simple stopword cleaner
Project description
Stopwordz
Stopwordz is a Python library designed to efficiently remove stopwords from text data. It comes with a pre-defined set of stopwords ideal for various text preprocessing tasks, especially in natural language processing applications.
Installation
You can easily install Stopwordz from PyPI:
pip install stopwordz
Usage
Using Stopwordz is straightforward. Import the clean_text
function from the package and pass your text data to it:
from stopwordz import clean_text
# Example text
text = "This is a sample sentence, showing off the stop words filtration."
cleaned_text = clean_text(text)
print(cleaned_text)
Features
- Pre-defined comprehensive list of stopwords.
- Easy to integrate into text preprocessing pipelines.
- Lightweight and fast.
Contributing
Contributions are welcome! Please fork the repository and open a pull request with your additions. For major changes, please open an issue first to discuss what you would like to change.
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file stopwordz-0.2.tar.gz
.
File metadata
- Download URL: stopwordz-0.2.tar.gz
- Upload date:
- Size: 5.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.11.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | bce78a1b7cc794d2cd39dd522517b1982df9d8978213c96ac7de142154f55b6a |
|
MD5 | d75e2bcf948bd92467f07ce23ee257b7 |
|
BLAKE2b-256 | b782380587c9e92bf5e6288ead0353bd3c401521b7ad42ff97b1c462aae203ee |
File details
Details for the file stopwordz-0.2-py3-none-any.whl
.
File metadata
- Download URL: stopwordz-0.2-py3-none-any.whl
- Upload date:
- Size: 6.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.11.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d705b0e35fa76ce0e359601167d1ae0881b0276eee943a9501eaaa9e8bb80f43 |
|
MD5 | 85aff71d044b92ce75575e642c0fce94 |
|
BLAKE2b-256 | 851b84b90a876b255fe1db8188fd0f20c98714fbc4cb02c85c8d63c4453934e6 |