Skip to main content

Web Scraper package for generating wordclouds from research paper abstracts.

Project description

CloudsOfArx

codecov GitHub Workflow Status PyPI

An automated webscraper package to make wordcloud images out of the abstracts of your first-author papers.

Installation

To use this package just install via pip

pip install CloudsOfArx

Usage

This package is simple and straightforward. To use it simply run the following lines in your python environment of choice after installation via pip.

import CloudsOfArx

CloudsOfArx.create_wordcloud(ADS_TOKEN, author, image_file, orcid=None, save_name=None)

The ADS_TOKEN is required to use the NASA ADS API. Make an account on NASA ADS to acquire an API token key, then copy and paste the key as a string for that argument. The author argument is the name of the first-author in a "LastName, FirstName" formatted string. image_file is a string pointing to the desired image for masking the wordcloud into. I also include the optional orcid parameter for authors who wish to use their ORCID to ensure the papers used are their own work. The save_name argument is an optional argument for naming the saved wordcloud file.

An example of this is shown below This is an image

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

CloudsOfArx-0.4.0.tar.gz (6.3 kB view details)

Uploaded Source

File details

Details for the file CloudsOfArx-0.4.0.tar.gz.

File metadata

  • Download URL: CloudsOfArx-0.4.0.tar.gz
  • Upload date:
  • Size: 6.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.12

File hashes

Hashes for CloudsOfArx-0.4.0.tar.gz
Algorithm Hash digest
SHA256 5814f48f1f7df8baebd937e06bef964248f993413344698b4265400d715110e4
MD5 3d26edf5f9ad241a343d62b715668ce4
BLAKE2b-256 e1788f32db1a060a27ab1b8a84a4a39dd8c606c0f6ffd8356dc03e0d11b08e9b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page