Skip to main content

Text related datasets

Project description

CircleCI PyPI version

Text-related datasets

This datamaestro plugin covers text-related datasets:

  • Information Retrieval
  • Natural Language Processing tasks

The list of available datasets and usage instruction can be found in the documentation.

List of available datasets

Below is the list of available datasets along with ids. Some datasets have several versions; in this case, the dataset id is suffixed with this information.

Documents

  • Aquaint edu.upenn.ldc.aquaint
  • TIPSTER gov.nist.trec.tipster
  • WikiText-2 and WikiText-103 io.metamind.research.wikitext

Word embeddings

  • Glove edu.stanford.glove

Sentiment analysis

  • IMDB edu.stanford.aclimdb

Information Retrieval

TREC

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datamaestro_text-2019.12.5.tar.gz (13.1 kB view details)

Uploaded Source

File details

Details for the file datamaestro_text-2019.12.5.tar.gz.

File metadata

  • Download URL: datamaestro_text-2019.12.5.tar.gz
  • Upload date:
  • Size: 13.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.40.0 CPython/3.6.9

File hashes

Hashes for datamaestro_text-2019.12.5.tar.gz
Algorithm Hash digest
SHA256 ef8ea9ca1660327a21a5aedfc8fa8a3fb630437b494357fe396ab5683c634ae1
MD5 5af4146a0f149b73bf27f439be2f167a
BLAKE2b-256 cf94e7b37bb2750bf4e24d2e77582a03f220f87f22286b4b2487c9abd81f8595

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page