stop words lists in many languages
Project description
Simple Python package that provides a single function for loading sets of stop words for different languages.
Stop words in English, Italian, Portuguese and Spanish, were retrieved from the following sources:
Wiktionary lists of prepositions in the respective languages
NLTK
The directory called orig contains the original files used to compile the stop word lists. The directory called not_used contains raw data for creating more stop words lists for languages that are not yet available in many_stop_words.available_languages
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
many-stop-words-0.1.1.tar.gz
(10.8 kB
view hashes)