"Text related datasets"
Project description
Text-related datasets
This datamaestro plugin covers text-related datasets:
- Information Retrieval
- Natural Language Processing tasks
The list of available datasets and usage instruction can be found in the documentation.
List of available datasets
Below is the list of available datasets along with ids. Some datasets have several versions; in this case, the dataset id is suffixed with this information.
Documents
- Aquaint
edu.upenn.ldc.aquaint
- TIPSTER
gov.nist.trec.tipster
- WikiText-2 and WikiText-103
io.metamind.research.wikitext
Word embeddings
- Glove
edu.stanford.glove
Sentiment analysis
- IMDB
edu.stanford.aclimdb
Information Retrieval
TREC
- TREC-1 to TREC-8, Robust 2004 and 2005
gov.nist.trec.adhoc
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
File details
Details for the file datamaestro_text-2020.2.25-py3-none-any.whl
.
File metadata
- Download URL: datamaestro_text-2020.2.25-py3-none-any.whl
- Upload date:
- Size: 45.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/42.0.2 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.7.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | da8bf04b2e946e191a53badf099e8f7c6cdbcc23763f003dae3b0c50734ed1f5 |
|
MD5 | 8b487df243bb0ddddee86465a5447469 |
|
BLAKE2b-256 | 0c9ceb6568be54be516fd14703e4703533a4ea5fb9c314c834e1200e6208a069 |