Skip to main content

llama-index readers papers integration

Project description

Papers Loaders

pip install llama-index-readers-papers

Arxiv Papers Loader

This loader fetches the text from the most relevant scientific papers on Arxiv specified by a search query (e.g. "Artificial Intelligence"). For each paper, the abstract is extracted and put in a separate document. The search query may be any string, Arxiv paper id, or a general Arxiv query string (see the full list of capabilities here).

Usage

To use this loader, you need to pass in the search query. You may also optionally specify a local directory to temporarily store the paper PDFs (they are deleted automatically) and the maximum number of papers you want to parse for your search query (default is 10).

from llama_index.readers.papers import ArxivReader

loader = ArxivReader()
documents = loader.load_data(search_query="au:Karpathy")

Alternatively, if you would like to load papers and abstracts separately:

from llama_index.readers.papers import ArxivReader

loader = ArxivReader()
documents, abstracts = loader.load_papers_and_abstracts(
    search_query="au:Karpathy"
)

This loader is designed to be used as a way to load data into LlamaIndex.

Pubmed Papers Loader

This loader fetches the text from the most relevant scientific papers on Pubmed specified by a search query (e.g. "Alzheimers"). For each paper, the abstract is included in the Document. The search query may be any string.

Usage

To use this loader, you need to pass in the search query. You may also optionally specify the maximum number of papers you want to parse for your search query (default is 10).

from llama_index.readers.papers import PubmedReader

loader = PubmedReader()
documents = loader.load_data(search_query="amyloidosis")

This loader is designed to be used as a way to load data into LlamaIndex.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_readers_papers-0.4.1.tar.gz (6.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

llama_index_readers_papers-0.4.1-py3-none-any.whl (6.8 kB view details)

Uploaded Python 3

File details

Details for the file llama_index_readers_papers-0.4.1.tar.gz.

File metadata

File hashes

Hashes for llama_index_readers_papers-0.4.1.tar.gz
Algorithm Hash digest
SHA256 1dc8d3fcd38417d287d7c7bdef0aa4b99bb74636487f914bd92e93f1f7909880
MD5 1a8245675b5a2c927b87c119c987230d
BLAKE2b-256 3f083816cba1ad003a039326874b706194f4d4305c8ed98fe3c33b4cdfe2f013

See more details on using hashes here.

File details

Details for the file llama_index_readers_papers-0.4.1-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_index_readers_papers-0.4.1-py3-none-any.whl
Algorithm Hash digest
SHA256 1831abc1bbe5f9fa10677d74e6752cc1df1e4bec787c5d50ca3f7bae6dfd4b75
MD5 443ba3b8864cfe04ad9b3dc9931078b1
BLAKE2b-256 5cb6fcb8cdb75208df7d7d75461dd76f5126491fdf243f032599a0fc813af8e8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page