Skip to main content

A tool for downloading academic paper references

Project description

PaperChaser

A Python tool for extracting and downloading academic paper references. Supports both single and two-column PDF formats.

Features

  • Extract references from PDF files with support for:
    • Single and two-column layouts
    • Multiple reference formats (numbered, bracketed, author-year)
    • Smart reference section detection
  • Download papers using:
    • DOI lookup
    • Title-based search
    • Multiple sources (Sci-Hub, PubMed Central, arXiv, etc.)
  • Automatic PDF renaming with metadata
  • Beautiful command-line interface with:
    • Progress bars
    • Colored status indicators
    • Formatted tables
    • Detailed error reporting

Installation

pip install paperchaser

Usage

Basic usage:

# Extract and download references from a PDF
paperchaser path/to/paper.pdf

# Extract references without downloading (debug mode)
paperchaser path/to/paper.pdf --debug

# Specify output directory
paperchaser path/to/paper.pdf -o downloaded_papers

# Force re-download of existing papers
paperchaser path/to/paper.pdf -f

Advanced options:

Options:
  -o, --output-dir TEXT      Directory to save downloaded papers
  --email TEXT              Email for Unpaywall API (optional)
  -f, --force              Force download even if paper already exists
  --rename/--no-rename     Rename downloaded PDFs using metadata (default: True)
  --debug                  Show debug information without downloading
  --help                   Show this message and exit

Output Format

The tool provides detailed information about the extraction and download process:

  • Reference extraction progress
  • Found references with numbering
  • Download status for each paper
  • Color-coded success/failure indicators
  • Detailed error messages for failed downloads

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

paperchaser-0.1.1.tar.gz (10.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

paperchaser-0.1.1-py3-none-any.whl (10.3 kB view details)

Uploaded Python 3

File details

Details for the file paperchaser-0.1.1.tar.gz.

File metadata

  • Download URL: paperchaser-0.1.1.tar.gz
  • Upload date:
  • Size: 10.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.12.2

File hashes

Hashes for paperchaser-0.1.1.tar.gz
Algorithm Hash digest
SHA256 cd51d7c78cc0b4a1abbfef26aefb22c196925cc78a7298aa4e299cb5eaa2c330
MD5 bfb9c2659ac9d2a3b6ff2978a31e4136
BLAKE2b-256 42cfe4635a7a9f6465e0765269375a32787bf1e2241c0b15c7473d172bcf1b0d

See more details on using hashes here.

File details

Details for the file paperchaser-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: paperchaser-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 10.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.12.2

File hashes

Hashes for paperchaser-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 21115880be90f94278894fff479be861e175d279714c64a59d5aeff7c56c63b6
MD5 814e72c22e6a60531cdf9c6c2ac4934e
BLAKE2b-256 4c9a88a6285307c118dd8f19eab98d1e2debc7d4b883d9be787fd2a40cf49909

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page