bag-of-words implementer and tools
Project description
textbag
textbag is a work-in-progress python module for implementing a bag-of-words transformation on various types of text data.
Features
- Function to clean text (remove stop-words and punctuation).
- Function to create a set containing all words contained in whatever text data is passed.
- Function to create a bag of words representation of text data.
Installation
pip3 install textbag
Authors
- Jacob Der
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
textbag-1.1.4.tar.gz
(3.3 kB
view hashes)