Skip to main content

A package to download arXiv papers and interact with PDFs using Ollama LLM

Project description

Academicagent

Academicagent is a Python package that integrates downloading papers from arXiv and evaluating them using a local large model (Ollama).

English | 简体中文


Features

  • Download arXiv Papers
    Search for and download a specified number of PDF papers from arXiv based on the keywords provided by the user.

  • Large Model Q&A
    For each downloaded paper, extract the first page from the PDF and use the local large model to generate a Chinese summary along with an evaluation score for the paper.


Installation

  1. Download Ollama and Start the Local Large Model Service

    Download Ollama from: https://ollama.com/

    For example, to download the deepseek-r1:1.5b model, run:

    ollama pull deepseek-r1:1.5b
    
  2. Install paperagent

Install using pip:

pip install paperagent

Usage Example

from academicagent.agent import run_agent

run_agent(paper_keyword="object detection", total_count=1, save_path="papers", model_name="deepseek-r1:1.5b")

Input Parameters

  • paper_keyword (string): The keyword used to search for papers on arXiv.
    Example: "object detection"

  • total_count (integer): The total number of papers to download.
    Example: 5

  • save_path (string, default "papers"): The folder path where the downloaded PDFs will be saved. If not provided, it defaults to "papers".
    Example: "papers"

  • question (string, optional): The question to provide to the large model. If not specified, the default question is:
    "Please generate a Chinese summary of this paper and evaluate its value based on originality, effectiveness, and scope, on a scale of 0 to 10, then provide your score after the summary is generated."

  • model_name (string, default "deepseek-r1:1.5b"):
    The name of the local large model to be used for invoking Ollama.

Output

  • PDF Download: The PDFs of the papers are downloaded from arXiv into the specified save_path folder based on the provided keyword and count.

  • Evaluation File: The title of each paper and the large model's response are written into a Markdown file.


Version History

  • v0.1.0
    Initial release, implementing arXiv paper downloading and large model Q&A functionality.

Contact

If you have any questions or suggestions, please feel free to submit an issue or pull request.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

academicagent-0.1.2.tar.gz (6.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

academicagent-0.1.2-py3-none-any.whl (6.6 kB view details)

Uploaded Python 3

File details

Details for the file academicagent-0.1.2.tar.gz.

File metadata

  • Download URL: academicagent-0.1.2.tar.gz
  • Upload date:
  • Size: 6.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.16

File hashes

Hashes for academicagent-0.1.2.tar.gz
Algorithm Hash digest
SHA256 c24a65d09b7a0e105c94b8b461876a68a164a479fdfa9c7eb4e2385a091bbb7f
MD5 5572420517d9bcf6dd308461873830b6
BLAKE2b-256 543fd0dd41d9dd3972262a283d4d99ccb69b0ac386b3813b47f092fbe751f4f4

See more details on using hashes here.

File details

Details for the file academicagent-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: academicagent-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 6.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.16

File hashes

Hashes for academicagent-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 29e4dcb07aeb8e05d0f73ab69262f8c437d9e7fe75c672fffe3290bcb1bafef8
MD5 7d12a5dbc3f349ae3fda67d1e148f780
BLAKE2b-256 edc8f7c8f8a156609a8ea09cbb179b589e1b017f2ddf38db31c554f762a5672b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page