Python wrapper for the arXiv API: https://arxiv.org/help/api/

These details have not been verified by PyPI

Project links

Homepage

Project description

arxiv.py

PyPI - Python Version

Python wrapper for the arXiv API.

arXiv is a project by the Cornell University Library that provides open access to 1,000,000+ articles in Physics, Mathematics, Computer Science, Quantitative Biology, Quantitative Finance, and Statistics.

Usage

Installation

$ pip install arxiv

In your Python script, include the line

import arxiv

Examples

Fetching results

import arxiv

# Construct the default API client.
client = arxiv.Client()

# Search for the 10 most recent articles matching the keyword "quantum."
search = arxiv.Search(
  query = "quantum",
  max_results = 10,
  sort_by = arxiv.SortCriterion.SubmittedDate
)

results = client.results(search)

# `results` is a generator; you can iterate over its elements one by one...
for r in client.results(search):
  print(r.title)
# ...or exhaust it into a list. Careful: this is slow for large results sets.
all_results = list(results)
print([r.title for r in all_results])

# For advanced query syntax documentation, see the arXiv API User Manual:
# https://arxiv.org/help/api/user-manual#query_details
search = arxiv.Search(query = "au:del_maestro AND ti:checkerboard")
first_result = next(client.results(search))
print(first_result)

# Search for the paper with ID "1605.08386v1"
search_by_id = arxiv.Search(id_list=["1605.08386v1"])
# Reuse client to fetch the paper, then print its title.
first_result = next(client.results(search))
print(first_result.title)

Downloading papers

To download a PDF of the paper with ID "1605.08386v1," run a Search and then use Result.download_pdf():

import arxiv

paper = next(arxiv.Client().results(arxiv.Search(id_list=["1605.08386v1"])))
# Download the PDF to the PWD with a default filename.
paper.download_pdf()
# Download the PDF to the PWD with a custom filename.
paper.download_pdf(filename="downloaded-paper.pdf")
# Download the PDF to a specified directory with a custom filename.
paper.download_pdf(dirpath="./mydir", filename="downloaded-paper.pdf")

The same interface is available for downloading .tar.gz files of the paper source:

import arxiv

paper = next(arxiv.Client().results(arxiv.Search(id_list=["1605.08386v1"])))
# Download the archive to the PWD with a default filename.
paper.download_source()
# Download the archive to the PWD with a custom filename.
paper.download_source(filename="downloaded-paper.tar.gz")
# Download the archive to a specified directory with a custom filename.
paper.download_source(dirpath="./mydir", filename="downloaded-paper.tar.gz")

Fetching results with a custom client

import arxiv

big_slow_client = arxiv.Client(
  page_size = 1000,
  delay_seconds = 10.0,
  num_retries = 5
)

# Prints 1000 titles before needing to make another request.
for result in big_slow_client.results(arxiv.Search(query="quantum")):
  print(result.title)

Logging

To inspect this package's network behavior and API logic, configure a DEBUG-level logger.

>>> import logging, arxiv
>>> logging.basicConfig(level=logging.DEBUG)
>>> client = arxiv.Client()
>>> paper = next(client.results(arxiv.Search(id_list=["1605.08386v1"])))
INFO:arxiv.arxiv:Requesting 100 results at offset 0
INFO:arxiv.arxiv:Requesting page (first: False, try: 0): https://export.arxiv.org/api/query?search_query=&id_list=1605.08386v1&sortBy=relevance&sortOrder=descending&start=0&max_results=100
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): export.arxiv.org:443
DEBUG:urllib3.connectionpool:https://export.arxiv.org:443 "GET /api/query?search_query=&id_list=1605.08386v1&sortBy=relevance&sortOrder=descending&start=0&max_results=100&user-agent=arxiv.py%2F1.4.8 HTTP/1.1" 200 979

Types

Client

A Client specifies a reusable strategy for fetching results from arXiv's API. For most use cases the default client should suffice.

Clients configurations specify pagination and retry logic. Reusing a client allows successive API calls to use the same connection pool and ensures they abide by the rate limit you set.

Search

A Search specifies a search of arXiv's database. Use Client.results to get a generator yielding Results.

Result

The Result objects yielded by Client.results include metadata about each paper and helper methods for downloading their content.

The meaning of the underlying raw data is documented in the arXiv API User Manual: Details of Atom Results Returned.

Result also exposes helper methods for downloading papers: Result.download_pdf and Result.download_source.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

2.2.0

Apr 8, 2025

2.1.3

Jun 25, 2024

2.1.2

Jun 23, 2024

2.1.1

Jun 22, 2024

2.1.0

Dec 18, 2023

2.0.0

Oct 17, 2023

1.4.8

Jul 11, 2023

1.4.7

Apr 18, 2023

1.4.6

Apr 18, 2023

1.4.5

Apr 17, 2023

1.4.4

Apr 11, 2023

1.4.3

Feb 1, 2023

1.4.2

Aug 18, 2021

1.4.1

Jul 31, 2021

1.4.0

Jul 13, 2021

1.3.0

Jul 2, 2021

1.2.0

Apr 25, 2021

1.1.0

Apr 20, 2021

1.0.2

Apr 17, 2021

1.0.1

Apr 5, 2021

1.0.0

Apr 4, 2021

0.5.4

Apr 2, 2021

0.5.3

Feb 23, 2020

0.5.2

Feb 15, 2020

0.5.1

Jun 15, 2019

0.5.0

Jun 15, 2019

0.4.0

May 19, 2019

0.3.1

Dec 21, 2018

0.3.0

Dec 17, 2018

0.2.3

Jun 20, 2018

0.2.2

Jul 28, 2017

0.2.1

Jul 27, 2017

0.1.1

Sep 18, 2016

0.1.0

Jul 24, 2016

0.0.3

Nov 26, 2015

0.0.2

Nov 26, 2015

0.0.1

Nov 25, 2015

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arxiv-2.2.0.tar.gz (16.9 kB view details)

Uploaded Apr 8, 2025 Source

Built Distribution

arxiv-2.2.0-py3-none-any.whl (11.7 kB view details)

Uploaded Apr 8, 2025 Python 3

File details

Details for the file arxiv-2.2.0.tar.gz.

File metadata

Download URL: arxiv-2.2.0.tar.gz
Upload date: Apr 8, 2025
Size: 16.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.4

File hashes

Hashes for arxiv-2.2.0.tar.gz
Algorithm	Hash digest
SHA256	`6072a2211e95697092ef32acde0144d7de2cfa71208e2751724316c9df322cc0`
MD5	`852ed0cecfeb7fb7bf9531373d01bfdb`
BLAKE2b-256	`0b163d72446400a59d1fbda24fed2289661398994164e07d72cfa85e43ce5e36`

See more details on using hashes here.

File details

Details for the file arxiv-2.2.0-py3-none-any.whl.

File metadata

Download URL: arxiv-2.2.0-py3-none-any.whl
Upload date: Apr 8, 2025
Size: 11.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.4

File hashes

Hashes for arxiv-2.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`545b8af5ab301efff7697cd112b5189e631b80521ccbc33fbc1e1f9cff63ca4d`
MD5	`efd4c3ecdfd603c8692890940981ddb4`
BLAKE2b-256	`711ee7f0393e836b5347605fc356c24d9f9ae9b26e0f7e52573b80e3d28335eb`

See more details on using hashes here.

arxiv 2.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

arxiv.py

Usage

Installation

Examples

Fetching results

Downloading papers

Fetching results with a custom client

Logging

Types

Client

Search

Result

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes