A command line application for downloading scientific papers
Project description
Overview
papers-dl is a command line application for downloading scientific papers.
Usage
# parse DOI identifiers from a file:
papers-dl parse -m doi --path pages/my-paper.html
# parse ISBN identifiers from a file, output matches as CSV:
papers-dl parse -m isbn --path pages/my-paper.html -f csv
# fetch paper with given identifier from any known provider:
papers-dl fetch "10.1016/j.cub.2019.11.030"
# fetch paper from any known Sci-Hub URL with verbose logging on, and store in "papers" directory:
papers-dl -v fetch -p "scihub" -o "papers" "10.1107/s0907444905036693"
# fetch paper from specific Sci-Hub URL:
papers-dl fetch -p "sci-hub.ee" "10.1107/s0907444905036693"
# fetch paper from SciDB (Anna's Archive):
papers-dl fetch -p "scidb" "10.1107/s0907444905036693"
About
papers-dl attempts to be a comprehensive tool for gathering research papers from popular open libraries. There are other solutions for this (see "Other tools" below), but papers-dl is trying to fill its own niche:
- comprehensive: other tools usually work with a single library, while
papers-dlis trying to support a collection of popular libraries. - performant:
papers-dltries to improve search and retrieval times by making use of concurrency where possible.
That said, papers-dl may not be the best choice for your specific use case right now. For example, if you require features supported by a specific library, one of the more mature and specialized tools listed below may be a better option.
papers-dl was initially created to serve as an extractor for ArchiveBox, a powerful solution for self-hosted web archiving.
This project started as a fork of scihub.py.
Other tools
Roadmap
papers-dl's CLI is not yet stable.
Short-term roadmap:
parsing
- add support for parsing more identifier types, like PMID, ISSN, and arXiv identifiers
fetching
- add arXiv as a provider
- add support for downloading formats other than PDFs, like HTML or epub
searching
- add a CLI command for searching libraries for papers and metadata
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file papers_dl-0.0.22.tar.gz.
File metadata
- Download URL: papers_dl-0.0.22.tar.gz
- Upload date:
- Size: 12.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
283588b4008f55ee306f561d5c39fc5143fa91529bf58b5ca68a86514f3ff5f2
|
|
| MD5 |
6911e63e045c783626d5e86f86f853fc
|
|
| BLAKE2b-256 |
511841a75af18809b62c0041bfbb7e51173382a3d43ba0c892cdfd13de17c115
|
File details
Details for the file papers_dl-0.0.22-py3-none-any.whl.
File metadata
- Download URL: papers_dl-0.0.22-py3-none-any.whl
- Upload date:
- Size: 12.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d102b73f02d2aa51b4195c8b16d5aa21e7ccaba90fe57f6d6faaf50327e8f6d3
|
|
| MD5 |
3fe9038ab5d7b1f244513278662d446b
|
|
| BLAKE2b-256 |
70c7952742ff030d99ad317a0c74422d9ae60f2f8db0a20954ad0aded75e1fed
|