Orange3 TextMining add-on.
Project description
Orange3 Text
Orange add-on for text mining. It provides access to publicly available data, like NY Times, Twitter and PubMed. Further, it provides tools for preprocessing, constructing vector spaces (like bag-of-words, topic modeling and word2vec) and visualizations like word cloud end geo map. All features can be combined with powerful data mining techniques from the Orange data mining framework.
See documentation.
Features
Access to data
- Load a corpus of text documents
- Access publicly available data (The Guardian, NY Times, Twitter, Wikipedia, PubMed)
Text analysis
- Preprocess corpus
- Generate bag of words
- Embed documents into vector space
- Perform sentiment analysis
- Detect emotions in tweets
- Discover topics in the text
- Compute document statistics
- Visualize frequent words in the word cloud
- Find words that enrich selected documents
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Orange3-Text-1.8.0.tar.gz
(37.1 MB
view hashes)
Built Distribution
Close
Hashes for Orange3_Text-1.8.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | dbfef53b7ce90920e9032893cb60ab839cb60c434b42f84504306fec6a673bc8 |
|
MD5 | 51dd9826136a0645b2c4fa50aacba072 |
|
BLAKE2b-256 | 50a95f9515b04428bbc757f50ec0dd33ae3043ed34b4163e8686ad629f7a93bb |