用于文本挖掘的蓝鲸附加组件.
Project description
BlueWhale3 Text
Orange add-on for text mining. It provides access to publicly available data, like NY Times, Twitter and PubMed. Further, it provides tools for preprocessing, constructing vector spaces (like bag-of-words, topic modeling and word2vec) and visualizations like word cloud end geo map. All features can be combined with powerful data mining techniques from the Orange data mining framework.
See documentation.
Features
Access to data
- Load a corpus of text documents
- Access publicly available data (The Guardian, NY Times, Twitter, Wikipedia, PubMed)
Text analysis
- Preprocess corpus
- Generate bag of words
- Embed documents into vector space
- Perform sentiment analysis
- Detect emotions in tweets
- Discover topics in the text
- Compute document statistics
- Visualize frequent words in the word cloud
- Find words that enrich selected documents
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
BlueWhale3-Text-1.5.0.tar.gz
(24.9 MB
view details)
Built Distribution
File details
Details for the file BlueWhale3-Text-1.5.0.tar.gz
.
File metadata
- Download URL: BlueWhale3-Text-1.5.0.tar.gz
- Upload date:
- Size: 24.9 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3e72b5bf2a8c8b3a01b08e79141532cd515a0ede272f5734cbdae3dab72337ae |
|
MD5 | 4b54382cc959f688035cb93daa48448a |
|
BLAKE2b-256 | ad294d5019ef4b5f8d110ec454f458d6b9fc0289461215759674b3949940b97d |
File details
Details for the file BlueWhale3_Text-1.5.0-py3-none-any.whl
.
File metadata
- Download URL: BlueWhale3_Text-1.5.0-py3-none-any.whl
- Upload date:
- Size: 17.6 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c2dccda31fda13dae706dd26cbc104cb3103da365f3848bc318d02765d97d561 |
|
MD5 | a096cb1e02aaac42d606935f0efb630f |
|
BLAKE2b-256 | 6a50477633c38326ba1d32209cd530c5b5b6cc902d2c8d1f349c3deff573b9c8 |