Skip to main content

Command line tool to handle arXiv articles and their BibTeX entries.

Project description

arXiv-script v0.2

The arXiv is the most important open-access repository for preprints in various sciences, e.g. Computer Science, Mathematics and Physics. Each preprint has its unique arXiv identifier (often called arXiv number). The arXiv script (axs) is a minimal command line tool to interact with the preprint of an arXiv identifier:

  • show print its title, authors and abstract to the terminal.
  • get download the preprint and save it with a uniform file name.
  • bib create a BibTeX entry for the preprint (and optionally add it to a .bib-file) to easily cite the preprint in LaTeX documents.

example

Given the arXiv identifier math/0211159, let's see what we can do with it (after the installation which is explained below). Let's take a look at the preprint via

axs show math/0211159 

This command prints the title, author(s), abstract and arXiv subject to the terminal. If we like the article, simply change show to get in the above command. Then the article is downloaded (to the default directory, see below) in the convenient formate AUTHOR(S)-TITLE-YEAR.pdf. In this example:

Perelman-The_entropy_formula_for_the_Ricci_flow_and_its_geometric_applications-2002.pdf

If we decide to cite this preprint in some LaTeX document using BibTeX, simply modify the command to axs bib math/0211159. This yields the following BibTeX-entry:

@article{Perelman-EntropyFormulaFor-math/0211159,
	Author = {Perelman, Grisha},
	Title = {{The} entropy formula for the {Ricci} flow and its geometric applications},
	Year = {2002},
	Note = {\href{https://arxiv.org/abs/math/0211159}{arXiv:math/0211159}}
}

We are then asked if we want to automatically add this entry to our (default) .bib-file enabling us to cite the preprint in LaTeX right away.

installation

At this stage, the axs is still under development. Hence we require Python (=>3.6) and pip for its installation. It is also recommended to install it in a virtual environment. With these requirements in place, do the follwing:

  • Pull this repository to your preferred directory via git or simply download all files.
  • In the terminal, change to the directory where you downloaded this repository to. Then create a new virtual environment, e.g. with the name venv, and activate it:
virtualenv venv
. venv/bin/activate
  • Finally, install the arXiv script via
pip install --editable .

This also installs the required packages. We will soon release the axs on pypi making the installation easier.

NOTE: In the following we will run all our commands in this virtual environment.

setup

After installation it is recommended to set a default directory, which needs to exist already, where articles are downloaded to. This is done via

axs --set-directory PATH_TO_DIR

where PATH_TO_DIR is our chosen directory path. Alternatively, we can give a directory for each download, see below. To set a default .bib-file, where BibTeX-entries are added to, simply use

axs --set-bib-file PATH_TO_FILE

Here PATH_TO_FILE is our chosen default .bib-file where our BibTeX-entries will be added to. As before, we can alternatively choose a .bib-file for each BibTeX-entry individually, see below.

the commands in detail

The basic usage is the following

axs cmd flag ax_id  

where cmd one of the commands below, flag is an (optional) flag and ax_id is an arXiv identifier. In fact, you can combine several flags.

The flag --help provides help for each command. Note that axs --help gives quick general help.

show

This command prints the title, (some of) the authors, the abstract and the main arXiv subject of the corresponding arXiv preprint. The flag -f gives a full version, i.e. additionally all authors and the main arXiv subject.

get

Simply downloads the article to your default directory (if it was already set as explained above) under the file name AUTHOR(S)-TITLE-YEAR.pdf. Two comments on the file name:

  • For => 3 authors, we use the first author and append 'et al'.
  • The title name is shortened to 15 words to prevent too long file names.

Before the download, the title and author(s) are printed to the terminal and there is a short countdown so that we can still cancel if we accidentally entered the wrong arXiv identifier.

With the flag -d (or --directory) we can download the article to another directory, i.e.

axs ax_id get -dir PATH_TO_DIR

downloads the article to the directory at PATH_TO_DIR. The flag -o (or --open) opens the preprint after the download.

bib

Prints a BibTeX-entry of the article to the terminal and asks if it should be added to our default .bib-file (if it has been set before). Alternatively, use the flag -a (or --add-to) combined with the path to another .bib-file to which we want to add the BibTeX-entry. Note that at the moment, the BibTeX-entry is simply added to the end of the corresponding .bib-file, so it is not (yet) sorted e.g. alphabetically.

Three comments on the BibTeX-entry:

  • The BibTeX-key, which is used to cite the preprint in a LaTeX document (Perelman-EntropyFormulaFor-math/0211159 in our example above), is created is in the formate AUTHOR(S)-SHORT_TITLE-AX_ID where ++ AUTHOR(S): as for the file name but without white spaces. ++ SHORT_TITLE: created from the title by removing all articles & most common prepositions and then taking the first three words. Finally, remove all white spaces. ++ AX_ID: the arXiv identifier which is added to make the BibTeX-entry unique.
  • The BibTeX-key is reasonably concise but contains enough information so that it can be easily found with the auto-completion for citations in any modern LaTeX editor.
  • We only put curly braces around capital words to make it compatible with as many citation styles in LaTeX as possible (see for example this discussion).

planned features

In the future we plan to implement the following:

  • Option to automatically download articles of different arXiv main subjects to different folders.
  • Many arXiv preprints have already been published. Give an option to search for the BibTeX-entry of the published version (e.g. in zbMATH for Mathematics).
  • More convenient installation without requiring an installed version of Python.
  • 'Browsing' arXiv in the terminal?

background

Even though there are great tools to manage scientific articles (e.g. Mendeley or Zotero), I realized - after using them for a while - that I saved way too many arXiv articles. Eventually, I manually downloaded only the important ones to one and the same directory. However, one problem, e.g. when trying to find an article, was that I did not stick to a systematic file name. So the idea for the arXiv script was born. Since I was not that satisfied with the available common BibTeX-entries of arXiv articles, I've automated them myself.

Altogether I hope that this script is useful for others as well who prefer a minimalistic management of (arXiv) articles.

main changes from v0.1 to v0.2

  • In v0.1 the syntax was quite unconventional (axs AX_ID CMD instead of axs CMD AX_ID).
  • BUG FIX: The axs now works 'anywhere' in the terminal not just in its directory. To do so, we now use use environment variables (via the dotenv package to make them last) to store the default directory and bib-file. Thanks @r-raymond!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arxiv-script-0.2.0.tar.gz (14.2 kB view hashes)

Uploaded Source

Built Distribution

arxiv_script-0.2.0-py3-none-any.whl (13.0 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page