Skip to main content

Export arxiv papers to pdf formats

Project description

Arxix Export

Arxiv Export is a Python library that allows you to search, download, and manage scientific articles from arXiv.org. It is useful for automating paper downloads and obtaining structured information about articles.

Installation

pip install arxiv-export

Usage Example

import asyncio
from arxiv_export_documents import export_papers


async def main():
    search_query = "quantum computing"
    download_path = "./arxiv_papers"
    max_results = 5

    async for paper in export_papers(
        search=search_query,
        path_download=download_path,
        max_results=max_results
    ):
        print(f"Downloaded paper: {paper.title}")
        print(f"Authors: {', '.join(paper.authors)}")
        print(f"Summary: {paper.summary}")
        print(f"Link: {paper.link}")
        print(f"Path: {paper.path}")
        print(f"Documents: {len(paper.documents)}")
        print("-" * 80)


if __name__ == "__main__":
    asyncio.run(main())

Features

  • Search for articles on arXiv using keywords.
  • Automatically download article PDFs.
  • Access metadata such as title, authors, abstract, link, and local path.
  • Manage multiple results with a single command.

Main Parameters

  • search: search string (e.g., "quantum computing").
  • path_download: path to save the PDFs.
  • max_results: maximum number of articles to download.

Vector Database for LLMs

The documents property provides a list of Document files intended for ingestion into a vector database. These files are commonly used to supply structured data to language models (LLMs), supporting semantic search and advanced analysis.

License

This library is distributed under the MIT license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arxiv_export_documents-0.1.5.tar.gz (6.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

arxiv_export_documents-0.1.5-py3-none-any.whl (6.5 kB view details)

Uploaded Python 3

File details

Details for the file arxiv_export_documents-0.1.5.tar.gz.

File metadata

  • Download URL: arxiv_export_documents-0.1.5.tar.gz
  • Upload date:
  • Size: 6.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.2

File hashes

Hashes for arxiv_export_documents-0.1.5.tar.gz
Algorithm Hash digest
SHA256 1e39eddb83c0279996b6642361dc2bc96d976e55f8e737aae862f3f6124796b5
MD5 59892822b5907712c532585aee6e7dcf
BLAKE2b-256 59f57834508350d69abff1fb920672f8a2f71c56329d66daa1a8bef157913604

See more details on using hashes here.

File details

Details for the file arxiv_export_documents-0.1.5-py3-none-any.whl.

File metadata

File hashes

Hashes for arxiv_export_documents-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 2ef6c21a6964f70c43926c0bd6af0369f231d902e348cc1633403b10e37031b5
MD5 ab95cb077a4987bc92edfbbbc23c326f
BLAKE2b-256 8ce0eebf264f842d8fb2d31bc868a99b0b55ca4a5487c6994a9d51afce19db28

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page