Export arxiv papers to pdf formats
Project description
Arxix Export
Arxiv Export is a Python library that allows you to search, download, and manage scientific articles from arXiv.org. It is useful for automating paper downloads and obtaining structured information about articles.
Installation
pip install arxiv-export
Usage Example
import asyncio
from arxiv_export_documents import export_papers
async def main():
search_query = "quantum computing"
download_path = "./arxiv_papers"
max_results = 5
async for paper in export_papers(
search=search_query,
path_download=download_path,
max_results=max_results
):
print(f"Downloaded paper: {paper.title}")
print(f"Authors: {', '.join(paper.authors)}")
print(f"Summary: {paper.summary}")
print(f"Link: {paper.link}")
print(f"Path: {paper.path}")
print(f"Documents: {len(paper.documents)}")
print(f"Exists: {paper.is_exist}")
print("-" * 80)
if __name__ == "__main__":
asyncio.run(main())
Features
- Search for articles on arXiv using keywords.
- Automatically download article PDFs.
- Access metadata such as title, authors, abstract, link, and local path.
- Manage multiple results with a single command.
Main Parameters
search: search string (e.g.,"quantum computing").path_download: path to save the PDFs.max_results: maximum number of articles to download.
Vector Database for LLMs
The documents property provides a list of Document files intended for ingestion into a vector database. These files are commonly used to supply structured data to language models (LLMs), supporting semantic search and advanced analysis.
License
This library is distributed under the MIT license.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file arxiv_export_documents-0.1.8.tar.gz.
File metadata
- Download URL: arxiv_export_documents-0.1.8.tar.gz
- Upload date:
- Size: 6.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1c169fb60a7f41753e5c288b36987daf4f37001a3a9fc204360d9000fc5be4ae
|
|
| MD5 |
0d2ae7c47309cfaea73fc0687d05445b
|
|
| BLAKE2b-256 |
d438b8f664634bee2e09367f49e325e0e7756bbc092fe683f222d3b85d236325
|
File details
Details for the file arxiv_export_documents-0.1.8-py3-none-any.whl.
File metadata
- Download URL: arxiv_export_documents-0.1.8-py3-none-any.whl
- Upload date:
- Size: 6.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5ea4c8733ba09b5a2d420c732209b1cffa8980398287235ca613bd991f414201
|
|
| MD5 |
97c92058cc34464d274ce8244039e36b
|
|
| BLAKE2b-256 |
15288fbb767b6b665ccd6cb0ef05190176286ec5fb7573776169b879eed1df34
|