Skip to main content

19,000 unique words from all publically available PubMed abstracts for use in NLP and spell-checking.

Project description

Pubword

This is a collection of 19,000 unique words scraped from publically available PubMed abstracts.

INFORMATION

  1. This repository contains 19,000 unique words which have mostly been cleaned -- spot check as needed.
  2. This repository contains words in lower case only, at this time.

HOW TO USE

  1. Install pip install pubwords
  2. Import from pubwords.words import word_list
  3. Get the word list your_variable = word_list

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pubwords-1.0.6.tar.gz (140.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pubwords-1.0.6-py3-none-any.whl (139.9 kB view details)

Uploaded Python 3

File details

Details for the file pubwords-1.0.6.tar.gz.

File metadata

  • Download URL: pubwords-1.0.6.tar.gz
  • Upload date:
  • Size: 140.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.7 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.10.1

File hashes

Hashes for pubwords-1.0.6.tar.gz
Algorithm Hash digest
SHA256 5b9bdb736bfa958507b18622344adf167969e867bce7980495955cb8fec048da
MD5 7283ca753e17bbd383d0971ad5c685fd
BLAKE2b-256 eb62a328638f29a842eeeb622771dffb710066a173ccd8ab0c24177c89178ddf

See more details on using hashes here.

File details

Details for the file pubwords-1.0.6-py3-none-any.whl.

File metadata

  • Download URL: pubwords-1.0.6-py3-none-any.whl
  • Upload date:
  • Size: 139.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.7 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.10.1

File hashes

Hashes for pubwords-1.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 f670e5e58037348b5545683ca419fec2174efca5882b51969c9a5a4b9df1f850
MD5 92e27baf63b9e029929b1fc2ffdbd023
BLAKE2b-256 8f4379dbf00df5fff9a34da3a5af7a3be8baac60c2595e03e3b223b21107af49

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page