19,000 unique words from all publically available PubMed abstracts for use in NLP and spell-checking.
Project description
Pubword
This is a collection of 19,000 unique words scraped from publically available PubMed abstracts.
INFORMATION
- This repository contains 19,000 unique words which have mostly been cleaned -- spot check as needed.
- This repository contains words in
lower caseonly, at this time.
HOW TO USE
- Install
pip install pubwords - Import
from pubwords.words import word_list - Get the word list
your_variable = word_list
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pubwords-1.0.6.tar.gz
(140.8 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
pubwords-1.0.6-py3-none-any.whl
(139.9 kB
view details)
File details
Details for the file pubwords-1.0.6.tar.gz.
File metadata
- Download URL: pubwords-1.0.6.tar.gz
- Upload date:
- Size: 140.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.7 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.10.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5b9bdb736bfa958507b18622344adf167969e867bce7980495955cb8fec048da
|
|
| MD5 |
7283ca753e17bbd383d0971ad5c685fd
|
|
| BLAKE2b-256 |
eb62a328638f29a842eeeb622771dffb710066a173ccd8ab0c24177c89178ddf
|
File details
Details for the file pubwords-1.0.6-py3-none-any.whl.
File metadata
- Download URL: pubwords-1.0.6-py3-none-any.whl
- Upload date:
- Size: 139.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.7 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.10.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f670e5e58037348b5545683ca419fec2174efca5882b51969c9a5a4b9df1f850
|
|
| MD5 |
92e27baf63b9e029929b1fc2ffdbd023
|
|
| BLAKE2b-256 |
8f4379dbf00df5fff9a34da3a5af7a3be8baac60c2595e03e3b223b21107af49
|