Skip to main content

Find preprints relevant to your individual research

Project description

arxiv-scan

Scan abstract listings on ArXiV for keywords and favorite authors in your categories to distill a list of papers most relevant for you. Keywords can be typed in manually or be found by ranking word occurrences in a provided file (e.g., a .bib file).

arxiv-scan was created by Robert Glas, Simeon Doetsch, and Martin Schlecker.

Installation

Requirements: Python >=3.5

Using pip

We recommend to install the latest stable version of arxiv-scan using pip (or pipx for an isolated environment):

pip install arxiv-scan

:information_source: Depending on your Python installation, you might instead need pip3 install arxiv-scan or python3 -m pip install arxiv-scan.

From source

arxiv-scan is being developed on github. If you feel like hacking, feel free to install the latest version from there:

pip install --upgrade git+https://github.com/matiscke/arxiv-scan.git

Usage

Query today's arXiV listing for relevant papers

First setup your keywords and authors (see configuration section), then just run arxiv-scan (or python -m arxiv_scan) to get the relevant listings.

Command line reference

usage: arxiv-scan [-h] [--config /path/to/config] [--default-config [/path/to/config]]
                  [--config-convert [/path/to/config]] [--edit] [-d DATE] [-l LENGTH]
                  [-v RATING] [-c CATEGORIES] [--reverse] [--show-resubmissions]
                  [--ignore-cross-lists] [--ignore-abstract] [--log {info,debug}]
                  [--version]

optional arguments:
  -h, --help            show this help message and exit
  --config /path/to/config
                        Path to configuration file (check README for defaults)
  --default-config [/path/to/config]
                        Write default config to default location (or specified path)
  --config-convert [/path/to/config]
                        Convert authors and keywords config from legacy format
  --edit                Edit config in default text editor
  -d DATE, --date DATE  "new", or "recent", number of days in the past, "YYYY-MM" or
                        "YYYY-MM-DD". Defaults to "new"
  -l LENGTH, --len LENGTH
                        length of result list, all is -1
  -v RATING, --rating RATING
                        minimum rating for result list
  -c CATEGORIES, --categories CATEGORIES
                        arXiv subjects to scan, comma separated list
  --reverse             reverse list (lowest ranked paper on top)
  --show-resubmissions  Include resubmissions
  --ignore-cross-lists  Include cross-lists
  --ignore-abstract     Ignore abstract in rating
  --log {info,debug}    Set loglevel
  --version             show program's version number and exit

Configuration

In the configuration file all the keywords and authors have to be set, as well as other optional configuration.

The easiest way to get started is to run arxiv-scan --edit, this will open the configuration file in the default text editor.

Alternatively create a default configfile with arxiv-scan --default-config, and edit it manually.

arXiv topics can be selected with the categories option, it accepts a comma-separated list of topics. List of topics

Configuration format:

[authors]
# author = rating
Alpher = 1
Bethe = 2
Gamov = 3

[keywords]
# keyword = rating
star = 1
planet = 2
habitable = 3

[options]
# other options (can also be set on CLI)
# default is used if ommited
categories = astro-ph.EP
date = new
length = -1
minimum_rating = 10 
reverse_list = False
show_resubmissions = False
show_cross_lists = True

Automatically extract keywords from a file (e.g. one with bibtex entries):

  • Run arxiv-scan.wordcounter file_to_scan (or python -m scan_astroph.wordcounter file_to_scan). It scans the text file and extracts words with 4-12 characters from it, sorted by occurrence in the file.
  • You will be asked to rank these suggested keywords. For each word shown, press 'Enter' to reject it or provide an integer rating, e.g., from 1 to 5 (higher=more relevant). Conclude by pressing C.
  • Manually insert particularly important authors into the config file (e.g. with arxiv-scan --edit)

Configuration locations:

arxiv-scan searches the these paths for the config file, and loads the first found:

  • from environment variable: $ARXIV_SCAN_CONF
  • from home directory: ~/.arxiv-scan.conf
  • default path (platform dependent):
    • on Linux / Unix (except MacOS): $XDG_CONFIG_HOME/arxiv-scan/arxiv-scan.conf (XDG_CONFIG_HOME defaults to ~/.config)
    • on MacOS: ~/Library/Application Support/arxiv-scan/arxiv-scan.conf
    • on Windows: $HOME/Documents/arxiv-scan/arxiv-scan.conf

Feedback

All feedback, including bug reports, feature requests, pull requests, etc., is welcome. arxiv-scan is being actively developed in an open repository; if you have any trouble please raise an issue.


License: MIT License

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arxiv-scan-1.0b0.tar.gz (15.7 kB view details)

Uploaded Source

Built Distribution

arxiv_scan-1.0b0-py3-none-any.whl (15.3 kB view details)

Uploaded Python 3

File details

Details for the file arxiv-scan-1.0b0.tar.gz.

File metadata

  • Download URL: arxiv-scan-1.0b0.tar.gz
  • Upload date:
  • Size: 15.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.8.2 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.9

File hashes

Hashes for arxiv-scan-1.0b0.tar.gz
Algorithm Hash digest
SHA256 eb8286f5744508b2073b5ae192644d057fffc95c51f86c9bfcac4bc12c484633
MD5 1106f84b845504868b7357b09e19d85a
BLAKE2b-256 162d2ab36497f4f9bd39b80532695dd92467610bd1211574d81cf96f060b9153

See more details on using hashes here.

File details

Details for the file arxiv_scan-1.0b0-py3-none-any.whl.

File metadata

  • Download URL: arxiv_scan-1.0b0-py3-none-any.whl
  • Upload date:
  • Size: 15.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.8.2 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.9

File hashes

Hashes for arxiv_scan-1.0b0-py3-none-any.whl
Algorithm Hash digest
SHA256 d799bf2306aea57f505b1e43382ee27df25396753d66dd3b7a9985423b093ade
MD5 3ed8c1332cb20768eed6f8af219b6af7
BLAKE2b-256 f0443079bc7e5acbf7b6a27670c590151450879e488a9d0d48768d2ddd21d77b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page