Download and (optionally) parse APTNotes quickly and easily
Project description
Download APTNotes
Download and (optionally) parse APTNotes quickly and easily
Installation
pip install download-aptnotes
To enable parsing the downloaded PDFs you need to install the extra tika
. This
will try to install the Apache Tika Server which depends on Java 7+. Make sure
that you have an adequate version of Java installed before you try to install it
Without this extra, the only output format available is pdf
.
pip install download-aptnotes[tika]
Usage
Usage: download-aptnotes [OPTIONS]
Download and (optionally) parse APTNotes quickly and easily
Options:
-f, --format [pdf|sqlite|json|csv]
Output format [required]
-o, --output PATH Output path of file or directory [required]
-l, --limit INTEGER Number of files to download
-p, --parallel INTEGER Number of parallell downloads [default: 10]
--install-completion Install completion for the current shell.
--show-completion Show completion for the current shell, to
copy it or customize the installation.
--help Show this message and exit.
Download all documents, parse them and store them in an SQLite database:
download-aptnotes -f sqlite -o aptnotes.sqlite
Download the first 10 documents in the source list, parse them and store them in an SQLite database:
download-aptnotes -f sqlite -o aptnotes.sqlite -l 10
Download all documents and store them as individual files in a directory:
download-aptnotes -f pdf -o aptnotes/
Contributing
Dependencies:
- Java 7+
- Poetry
Clone this repository and install all dependencies:
git clone https://github.com/nikstur/download-aptnotes.git
cd download-aptnotes
poetry install
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for download_aptnotes-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 964c1239493f52794317d44bd44a2995d88cc3fc9898f400eb02211c0e302b72 |
|
MD5 | 9c7dc88892a61a3a9404c55e5de50f95 |
|
BLAKE2b-256 | 3702bfcbaaa6c4e4000818606f365089761e058d7c07c1871cf66a26607cb0da |