Skip to main content

19,000 unique words from all publically available PubMed abstracts for use in NLP and spell-checking.

Project description

Pubword

This is a collection of 19,000 unique words scraped from publically available PubMed abstracts.

INFORMATION

  1. This repository contains 19,000 unique words which have mostly been cleaned -- spot check as needed.
  2. This repository contains words in lower case only, at this time.

HOW TO USE

  1. Install pip install pubwords
  2. Import from pubwords.words import word_list
  3. Get the word list your_variable = word_list

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pubwords-1.0.5.tar.gz (145.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pubwords-1.0.5-py3-none-any.whl (149.1 kB view details)

Uploaded Python 3

File details

Details for the file pubwords-1.0.5.tar.gz.

File metadata

  • Download URL: pubwords-1.0.5.tar.gz
  • Upload date:
  • Size: 145.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.7 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.10.1

File hashes

Hashes for pubwords-1.0.5.tar.gz
Algorithm Hash digest
SHA256 38154eec440a2ba1d7e2854c6ba5aa197c2a5482f935936d0fad669b5c78e562
MD5 a2345ab35ee97cab0628f078259c66ba
BLAKE2b-256 9c66c2b8544103eb6c51050b0c11aafbabc131f5343eeb1e17e042a6842a4c1b

See more details on using hashes here.

File details

Details for the file pubwords-1.0.5-py3-none-any.whl.

File metadata

  • Download URL: pubwords-1.0.5-py3-none-any.whl
  • Upload date:
  • Size: 149.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.7 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.10.1

File hashes

Hashes for pubwords-1.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 264cb470bd98a07087970eb07196f4d72a0f2dde909cb8a20286437fff3b5697
MD5 877ef290a09d239940770e8a9dba0ccf
BLAKE2b-256 ba519011002dbb5ff3cea0b69882e64edbc479e8e8a965c5d2e3319c1ce19d2c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page