Skip to main content

llama-index readers papers integration

Project description

Papers Loaders

pip install llama-index-readers-papers

Arxiv Papers Loader

This loader fetches the text from the most relevant scientific papers on Arxiv specified by a search query (e.g. "Artificial Intelligence"). For each paper, the abstract is extracted and put in a separate document. The search query may be any string, Arxiv paper id, or a general Arxiv query string (see the full list of capabilities here).

Usage

To use this loader, you need to pass in the search query. You may also optionally specify a local directory to temporarily store the paper PDFs (they are deleted automatically) and the maximum number of papers you want to parse for your search query (default is 10).

from llama_index.readers.papers import ArxivReader

loader = ArxivReader()
documents = loader.load_data(search_query="au:Karpathy")

Alternatively, if you would like to load papers and abstracts separately:

from llama_index.readers.papers import ArxivReader

loader = ArxivReader()
documents, abstracts = loader.load_papers_and_abstracts(
    search_query="au:Karpathy"
)

This loader is designed to be used as a way to load data into LlamaIndex.

Pubmed Papers Loader

This loader fetches the text from the most relevant scientific papers on Pubmed specified by a search query (e.g. "Alzheimers"). For each paper, the abstract is included in the Document. The search query may be any string.

Usage

To use this loader, you need to pass in the search query. You may also optionally specify the maximum number of papers you want to parse for your search query (default is 10).

from llama_index.readers.papers import PubmedReader

loader = PubmedReader()
documents = loader.load_data(search_query="amyloidosis")

This loader is designed to be used as a way to load data into LlamaIndex.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_readers_papers-0.3.1.tar.gz (5.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

llama_index_readers_papers-0.3.1-py3-none-any.whl (6.7 kB view details)

Uploaded Python 3

File details

Details for the file llama_index_readers_papers-0.3.1.tar.gz.

File metadata

  • Download URL: llama_index_readers_papers-0.3.1.tar.gz
  • Upload date:
  • Size: 5.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.12.3 Linux/6.8.0-1021-azure

File hashes

Hashes for llama_index_readers_papers-0.3.1.tar.gz
Algorithm Hash digest
SHA256 2dd7f143a23aa75aa7a38ad07186343050da2b1b04cea26ca3504064bd4efa7b
MD5 e101c3b3a4f99e1eedcb936e996ee5e2
BLAKE2b-256 324d7c0015dd1b4282c49ceb2d373ed1e0c93262fd6b163092555c35f7983c79

See more details on using hashes here.

File details

Details for the file llama_index_readers_papers-0.3.1-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_index_readers_papers-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 9d019c604587a453b2475c7f820a73d449a5311e94a3a4ad71a53cd9e06fc215
MD5 028c4f266016d1d6b00c47dc972e6dd9
BLAKE2b-256 b64a8a7fa8d171c0f78e83d6e8986dbc9c4c7f47eac2b9a2f7fea911f592b724

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page