Skip to main content

llama-index readers papers integration

Project description

Papers Loaders

pip install llama-index-readers-papers

Arxiv Papers Loader

This loader fetches the text from the most relevant scientific papers on Arxiv specified by a search query (e.g. "Artificial Intelligence"). For each paper, the abstract is extracted and put in a separate document. The search query may be any string, Arxiv paper id, or a general Arxiv query string (see the full list of capabilities here).

Usage

To use this loader, you need to pass in the search query. You may also optionally specify a local directory to temporarily store the paper PDFs (they are deleted automatically) and the maximum number of papers you want to parse for your search query (default is 10).

from llama_index.readers.papers import ArxivReader

loader = ArxivReader()
documents = loader.load_data(search_query="au:Karpathy")

Alternatively, if you would like to load papers and abstracts separately:

from llama_index.readers.papers import ArxivReader

loader = ArxivReader()
documents, abstracts = loader.load_papers_and_abstracts(
    search_query="au:Karpathy"
)

This loader is designed to be used as a way to load data into LlamaIndex.

Pubmed Papers Loader

This loader fetches the text from the most relevant scientific papers on Pubmed specified by a search query (e.g. "Alzheimers"). For each paper, the abstract is included in the Document. The search query may be any string.

Usage

To use this loader, you need to pass in the search query. You may also optionally specify the maximum number of papers you want to parse for your search query (default is 10).

from llama_index.readers.papers import PubmedReader

loader = PubmedReader()
documents = loader.load_data(search_query="amyloidosis")

This loader is designed to be used as a way to load data into LlamaIndex.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_readers_papers-0.2.0.tar.gz (4.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

llama_index_readers_papers-0.2.0-py3-none-any.whl (5.9 kB view details)

Uploaded Python 3

File details

Details for the file llama_index_readers_papers-0.2.0.tar.gz.

File metadata

  • Download URL: llama_index_readers_papers-0.2.0.tar.gz
  • Upload date:
  • Size: 4.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.10.13 Darwin/23.6.0

File hashes

Hashes for llama_index_readers_papers-0.2.0.tar.gz
Algorithm Hash digest
SHA256 7f6ec46ba386ff35f70b3cb4892ebb05b99cbf391bb88482cfd1035fcd6abd37
MD5 f2eb5900924d55d30cfa602df8bd04ae
BLAKE2b-256 e475aad9754d0f4ac218d35867aa68000b9069f58fb791102b75a95c531f3af3

See more details on using hashes here.

File details

Details for the file llama_index_readers_papers-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_index_readers_papers-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 45c77d8d5ebe386c5e37c9d2db3592037dbac0d69da7c920698fe4e133ea460a
MD5 2ba79909c5aea4e6a4b103ad3cac0255
BLAKE2b-256 2b4abba1e724e46728a0ca5a3d2d8a19b8c81746cfb0fd893188a7fae18b2efb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page