Web Scraper package for generating wordclouds from research paper abstracts.
Project description
CloudsOfArx
An automated webscraper package to make wordcloud images out of the abstracts of your first-author papers.
Installation
To use this package just install via pip
pip install CloudsOfArx
Usage
This package is simple and straightforward. To use it simply run the following lines in your python environment of choice after installation via pip.
import CloudsOfArx
CloudsOfArx.create_wordcloud(ADS_TOKEN, author, image_file, orcid=None, save_name=None)
The ADS_TOKEN is required to use the NASA ADS API. Make an account on NASA ADS to acquire an API token key, then copy and paste the key as a string for that argument. The author argument is the name of the first-author in a "LastName, FirstName" formatted string. image_file is a string pointing to the desired image for masking the wordcloud into. I also include the optional orcid parameter for authors who wish to use their ORCID to ensure the papers used are their own work. The save_name argument is an optional argument for naming the saved wordcloud file.
An example of this is shown below
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file CloudsOfArx-0.4.0.tar.gz.
File metadata
- Download URL: CloudsOfArx-0.4.0.tar.gz
- Upload date:
- Size: 6.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5814f48f1f7df8baebd937e06bef964248f993413344698b4265400d715110e4
|
|
| MD5 |
3d26edf5f9ad241a343d62b715668ce4
|
|
| BLAKE2b-256 |
e1788f32db1a060a27ab1b8a84a4a39dd8c606c0f6ffd8356dc03e0d11b08e9b
|