19,000 unique words from all publically available PubMed abstracts for use in NLP and spell-checking.
Project description
Pubword
This is a collection of 19,000 unique words scraped from publically available PubMed abstracts.
INFORMATION
- This repository contains 19,000 unique words which have mostly been cleaned -- spot check as needed.
- This repository contains words in
lower caseonly, at this time.
HOW TO USE
- Install
pip install pubwords - Import
from pubwords.words import word_list - Get the word list
your_variable = word_list
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pubwords-1.0.5.tar.gz
(145.7 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
pubwords-1.0.5-py3-none-any.whl
(149.1 kB
view details)
File details
Details for the file pubwords-1.0.5.tar.gz.
File metadata
- Download URL: pubwords-1.0.5.tar.gz
- Upload date:
- Size: 145.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.7 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.10.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
38154eec440a2ba1d7e2854c6ba5aa197c2a5482f935936d0fad669b5c78e562
|
|
| MD5 |
a2345ab35ee97cab0628f078259c66ba
|
|
| BLAKE2b-256 |
9c66c2b8544103eb6c51050b0c11aafbabc131f5343eeb1e17e042a6842a4c1b
|
File details
Details for the file pubwords-1.0.5-py3-none-any.whl.
File metadata
- Download URL: pubwords-1.0.5-py3-none-any.whl
- Upload date:
- Size: 149.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.7 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.10.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
264cb470bd98a07087970eb07196f4d72a0f2dde909cb8a20286437fff3b5697
|
|
| MD5 |
877ef290a09d239940770e8a9dba0ccf
|
|
| BLAKE2b-256 |
ba519011002dbb5ff3cea0b69882e64edbc479e8e8a965c5d2e3319c1ce19d2c
|