Skip to main content

"Automate your ArXiv paper search, retrieval, and summarization process."

Project description

Description

arxivcli is a lightweight command-line tool designed to automate the retrieval of computer science papers from ArXiv. The retrieval can be done using specified ArXiv computer science archive categories or using the full or partial title of a specific paper, if available. Paper retrieval can be refined by author.

This tool is built using Python and leverages the Typer library for the command-line interface and the Python ElementTree XML package for parsing XML responses from the arXiv API. It can be useful for researchers, engineers, or students who want to quickly retrieve an ArXiv paper or keep abreast of latest research in their field without leaving their terminal/workstation.

Although my current focus while building arxivcli is the computer science archive, it can be easily used with categories from other areas on arxiv, e.g., math.CO.

Features [more coming soon--see Notion page below for more info]

  • Fetches the most recent papers from ArXiv by specified categories
  • Fetches papers from ArXiv by title
  • Refine fetch and search by author for more precise results
  • Displays paper details including title, authors, publication date, and link to paper's page
  • Easy-to-use command-line interface built with Typer
  • Configurable number of results to fetch
  • Built using only the standard library and tried and tested packages.

Installation

  1. Clone the repository:
    git clone https://github.com/MimicTester1307/arxiv_retriever.git
    cd arxiv_retriever  
    
  2. Install the package and dependencies
    pip install .
    

Usage

To retrieve the most recent computer science papers by categories, use the fetch command followed by the categories and options:

axiv fetch <categories> [--limit]

Outputs limit papers sorted by submittedDate in descending order

To filter results by author(s) use:

  axiv fetch <categories> [--limit] [--authors]

Outputs limit papers sorted by submittedDate in descending order, filtered by authors

To retrieve limit papers matching a specified title, use the search command followed by a title and options:

axiv search <title> [--limit]

Outputs limit papers sorted by relevance in descending order

To filter results by author(s) use:

  axiv search <title> [--limit] [--authors]

Outputs limit papers sorted by relevance in descending order, filtered by authors

Example

Fetch the latest 5 papers in the cs.AI and cs.GL:

axiv fetch cs.AI cs.GL --limit 5

Fetch papers matching the title, "Attention is all you need":

axiv search "Attention is all you need" --limit 5 --authors "Ashish"

Contributing

Contributions are welcome! Please fork the repository and submit a pull request for any features, bug fixes, or enhancements.

License

This project is licensed under the MIT license. See the LICENSE file for more details.

Acknowledgements

  • Typer for the command-line interface
  • ElementTree for XML parsing
  • arXiv API for providing access to paper metadata
  • Notion for helping me track my progress and document my learning.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arxiv_retriever-1.0.1.tar.gz (6.2 kB view hashes)

Uploaded Source

Built Distribution

arxiv_retriever-1.0.1-py3-none-any.whl (8.5 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page