llama-index readers papers integration
Project description
Papers Loaders
pip install llama-index-readers-papers
Arxiv Papers Loader
This loader fetches the text from the most relevant scientific papers on Arxiv specified by a search query (e.g. "Artificial Intelligence"). For each paper, the abstract is extracted and put in a separate document. The search query may be any string, Arxiv paper id, or a general Arxiv query string (see the full list of capabilities here).
Usage
To use this loader, you need to pass in the search query. You may also optionally specify a local directory to temporarily store the paper PDFs (they are deleted automatically) and the maximum number of papers you want to parse for your search query (default is 10).
from llama_index.readers.papers import ArxivReader
loader = ArxivReader()
documents = loader.load_data(search_query="au:Karpathy")
Alternatively, if you would like to load papers and abstracts separately:
from llama_index.readers.papers import ArxivReader
loader = ArxivReader()
documents, abstracts = loader.load_papers_and_abstracts(
search_query="au:Karpathy"
)
This loader is designed to be used as a way to load data into LlamaIndex.
Pubmed Papers Loader
This loader fetches the text from the most relevant scientific papers on Pubmed specified by a search query (e.g. "Alzheimers"). For each paper, the abstract is included in the Document
. The search query may be any string.
Usage
To use this loader, you need to pass in the search query. You may also optionally specify the maximum number of papers you want to parse for your search query (default is 10).
from llama_index.readers.papers import PubmedReader
loader = PubmedReader()
documents = loader.load_data(search_query="amyloidosis")
This loader is designed to be used as a way to load data into LlamaIndex.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for llama_index_readers_papers-0.1.6.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 11273a76c11df19088d6cd5d44f03da665f01a40160f8e459c7771ecdcbafd4f |
|
MD5 | 4063718f9779130851c45b8318e4bd3c |
|
BLAKE2b-256 | 8cd22d3911526d44d99b4445504fe94db4d931712635d273a6aeb9d27cbdc69d |
Hashes for llama_index_readers_papers-0.1.6-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7c35eb3f181908a6e6108268721bb3944be10419ba1d04ead3e7c9e8a3bd0589 |
|
MD5 | 7bc823ae758b940cf5ecf6cad315deb9 |
|
BLAKE2b-256 | 1b0ee4a71381d5f0481ac1a64d2736040185efc81ca1fe10a2c9fc56df8a069b |