A package based on SciHub and Google Scholar that can download articles given DOI, website address or keywords.
Project description
getarticle

Description
getarticle is a package based on SciHub and Google Scholar that can download articles given DOI, website address or keywords.
Install
Using pip to install:
# for latest version (developer)
pip install git+https://github.com/HTian1997/getarticle.git
# for last release (stable)
pip install getarticle
Features & Usage
getarticle can be imported in Python or used as command line.
To use in command line:
usage: getarticle [-h] [-i INPUT] [-o OUTPUT] [-sd SETDOWNLOAD]
getarticle CLI
optional arguments:
-h, --help show this help message and exit
-i INPUT, --input INPUT
article DOI or website
-s SEARCH [SEARCH ...], --search SEARCH [SEARCH ...]
search keywords
-o OUTPUT, --output OUTPUT
download direction
-sd SETDIRECTION, --setdirection SETDIRECTION
set default download direction
Example:
getarticle -i 10.1126/science.abc7424 -o /Users/haotian/Desktop
Please change the download direction to your own. The download direction is the current direction in terminal by default. To change the default download direction, use -sd option.
Tip: If the DOI contains parentheses, add "" around the DOI.
Example:
getarticle -sd /Users/haotian/Downloads
# will download to /Users/haotian/Downloads folder
getarticle -i 10.1126/science.abc7424
getarticle can also download article of the current webpage (only supported for MacOS Safari).
Example:
# current Safari webpage:
# https://www.nature.com/articles/s41467-020-16670-2
# download article of current webpage to default direction
getarticle
To use in Python:
- Initialization
from getarticle import GetArticle
ga = GetArticle()
- Download a single article given DOI or website address.
ga.input_article("10.1126/science.abc7424")
ga.download()
Notes:
- Once downloaded, all stored articles will be cleared;
- For
downloadfunction,directionargument is the current direction by default; - Downloaded article is named as either "article title.pdf" if successfull or "year-month-day-hour-minute-second.pdf".
- Download multiple articles.
ga.input_article("https://www.nature.com/articles/s41594-020-0468-7#article-info")
ga.input_article("10.1038/s41893-020-0581-y")
ga.download()
Notes:
- Repeatedly using
articlefunction can save multiple articles. getarticlewill not save & download duplicate articles;
- Download related articles given keywords. Keywords can be article names, research fields or author names.
ga.search("Deep Dive into Machine Learning Models for Protein Engineering")
ga.search("SARS, Computation", num_of_page=2)
ga.search("Roberta Croce")
ga.download()
Notes:
num_of_pageis the number of page in Google Scholar, 1 by default.
- Show currently stored articles / delete article by index.
ga.cur_articles()
ga.remove_article()
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file getarticle-0.1.0.tar.gz.
File metadata
- Download URL: getarticle-0.1.0.tar.gz
- Upload date:
- Size: 5.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/4.5.0 pkginfo/1.6.1 requests/2.24.0 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6722cb28e80359117a165a8e3955f531421d8692f30b3f97ee2961552488689c
|
|
| MD5 |
a091a491f7ceb436b6c2e7033cedcb72
|
|
| BLAKE2b-256 |
fc0fe922cc7bd43175d53f9c638e59eb1ae14669bd7f12adb2fa4b0c90eb3d67
|
File details
Details for the file getarticle-0.1.0-py3-none-any.whl.
File metadata
- Download URL: getarticle-0.1.0-py3-none-any.whl
- Upload date:
- Size: 6.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/4.5.0 pkginfo/1.6.1 requests/2.24.0 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
518c710f17807df1250d2d5ac7dd3010ad7fdd8bea9b76f95baf8ccf799dca50
|
|
| MD5 |
9d2183b842bdbd7242864928898d2186
|
|
| BLAKE2b-256 |
15b3d70c63234028eb9670345949cbe0f660964fa9317dd5009cea38614d704e
|