Skip to main content

Export arxiv papers to pdf formats

Project description

Arxix Export

Arxiv Export is a Python library that allows you to search, download, and manage scientific articles from arXiv.org. It is useful for automating paper downloads and obtaining structured information about articles.

Installation

pip install arxiv-export

Usage Example

from arxiv_export import export_papers

def main():
    search_query = "quantum computing"
    download_path = "./arxiv_papers"
    max_results = 5

    papers = export_papers(
        search=search_query,
        path_download=download_path,
        max_results=max_results
    )

    for paper in papers:
        print(f"Title: {paper.title}")
        print(f"Authors: {', '.join(paper.authors)}")
        print(f"Summary: {paper.summary}")
        print(f"Link: {paper.link}")
        print(f"Path: {paper.path}")
        print(f"Documents: {len(paper.documents)}")
        print("-" * 80)

if __name__ == "__main__":
    main()

Features

  • Search for articles on arXiv using keywords.
  • Automatically download article PDFs.
  • Access metadata such as title, authors, abstract, link, and local path.
  • Manage multiple results with a single command.

Main Parameters

  • search: search string (e.g., "quantum computing").
  • path_download: path to save the PDFs.
  • max_results: maximum number of articles to download.

Vector Database for LLMs

The documents property provides a list of Document files intended for ingestion into a vector database. These files are commonly used to supply structured data to language models (LLMs), supporting semantic search and advanced analysis.

License

This library is distributed under the MIT license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arxiv_export_documents-0.1.1.tar.gz (6.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

arxiv_export_documents-0.1.1-py3-none-any.whl (6.4 kB view details)

Uploaded Python 3

File details

Details for the file arxiv_export_documents-0.1.1.tar.gz.

File metadata

  • Download URL: arxiv_export_documents-0.1.1.tar.gz
  • Upload date:
  • Size: 6.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.2

File hashes

Hashes for arxiv_export_documents-0.1.1.tar.gz
Algorithm Hash digest
SHA256 2bf1df17e4c0d229f312bf3998dcdf60d300157de2c2f83b0b87bf50b919d6b6
MD5 b7babf04ce7b584f409a404d6f4fb471
BLAKE2b-256 3dae5b3d50759829e5b1d2872516911d680818dc3105ee3faf376fde84a39661

See more details on using hashes here.

File details

Details for the file arxiv_export_documents-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for arxiv_export_documents-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 b21abdd43bf63ce0eb789c14c7c0ad8bd4f2c8b7e2bf40e7aaaf3d5b0cd6160c
MD5 f34fbc543161db349302f7cc4b43d30c
BLAKE2b-256 4f480a62b6dcb3fe19e0aed92b2b6a9cb179899e479d49c7d3d6c5cf997e4e8b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page