transform arXiv papers into a single latex prompt for LLMs
Project description
arxiv-to-prompt
A command-line tool to transform arXiv papers into a single LaTeX source that can be used as a prompt for asking LLMs questions about the paper. It downloads the source files, automatically finds the main tex file containing \documentclass, and flattens multiple files into a single coherent source by resolving \input and \include commands. The tool also provides options to remove LaTeX comments and appendix sections from the output (which can be useful to shorten the prompt).
Installation
pip install arxiv-to-prompt
Usage
Basic usage:
# Display LaTeX source with comments
arxiv-to-prompt 2303.08774
# Display LaTeX source without comments
arxiv-to-prompt 2303.08774 --no-comments
# Display LaTeX source without appendix sections
arxiv-to-prompt 2303.08774 --no-appendix
# Combine options (no comments and no appendix)
arxiv-to-prompt 2303.08774 --no-comments --no-appendix
# Copy to clipboard
arxiv-to-prompt 2303.08774 | pbcopy
# Combine with the `llm` library from https://github.com/simonw/llm to chat about the paper
arxiv-to-prompt 1706.03762 | llm -s "explain this paper"
The arXiv ID can be found in the paper's URL. For example, for https://arxiv.org/abs/2303.08774, the ID is 2303.08774. It will automatically download the latest version of the paper, so you don't need to specify the version.
Python API
You can also use arxiv-to-prompt in your Python code:
from arxiv_to_prompt import process_latex_source
# Get LaTeX source with comments
latex_source = process_latex_source("2303.08774")
# Get LaTeX source without comments
latex_source = process_latex_source("2303.08774", keep_comments=False)
# Get LaTeX source without appendix sections
latex_source = process_latex_source("2303.08774", remove_appendix_section=True)
# Combine options (no comments and no appendix)
latex_source = process_latex_source("2303.08774", keep_comments=False, remove_appendix_section=True)
Projects Using arxiv-to-prompt
Here are some projects and use cases that leverage arxiv-to-prompt:
- arxiv-latex-mcp: MCP server that uses arxiv-to-prompt to fetch and process arXiv LaTeX sources for precise interpretation of mathematical expressions in scientific papers.
- arxiv-tex-ui: chat with an llm about an arxiv paper by using the latex source.
If you're using arxiv-to-prompt in your project, please submit a pull request to add it to this list!
References
- Inspired by files-to-prompt.
- Reused some code from paper2slides.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file arxiv_to_prompt-0.2.0.tar.gz.
File metadata
- Download URL: arxiv_to_prompt-0.2.0.tar.gz
- Upload date:
- Size: 10.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.16
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3b92271b66104ad6b4927935502e612be76f16239377423a5a5dfc59f517c4ca
|
|
| MD5 |
23de9bcb3e13a95691d2c572e769d144
|
|
| BLAKE2b-256 |
17812679dcb52f5a91cce5020c79689b9dc3ff08c670a0d4c08466b397a1385c
|
File details
Details for the file arxiv_to_prompt-0.2.0-py3-none-any.whl.
File metadata
- Download URL: arxiv_to_prompt-0.2.0-py3-none-any.whl
- Upload date:
- Size: 8.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.16
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7b4df65696838659b2791d636e4986e8f2bf33136f0505ac8eeddde97525f30e
|
|
| MD5 |
16ecfce4eb524e63bb1527aac98ffaf0
|
|
| BLAKE2b-256 |
0c28fa57c4a0dbc3e59ccec691e418fdd6e3109715dfea4ff8a58ac395f33bee
|