llama-index readers remote integration
Project description
Remote Page/File Loader
This loader makes it easy to extract the text from any remote page or file using just its url. If there's a file at the url, this loader will download it temporarily and parse it using SimpleDirectoryReader
. It is an all-in-one tool for (almost) any url.
As a result, any page or type of file is supported. For instance, if a .txt
url such as a Project Gutenberg book is passed in, the text will be parsed as is. On the other hand, if a hosted .mp3 url is passed in, it will be downloaded and parsed using AudioTranscriber
.
Usage
To use this loader, you need to pass in a Path
to a local file. Optionally, you may specify a file_extractor
for the SimpleDirectoryReader
to use, other than the default one.
from llama_index import download_loader
RemoteReader = download_loader("RemoteReader")
loader = RemoteReader()
documents = loader.load_data(
url="https://en.wikipedia.org/wiki/File:Example.jpg"
)
This loader is designed to be used as a way to load data into LlamaIndex and/or subsequently used as a Tool in a LangChain Agent. See here for examples.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for flying_delta_readers_remote-0.1.0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 72d1e69a14ded28b5fb694285ced32bcc407171bd1e703cbda021adbc212a88a |
|
MD5 | a3973482298d4965b67b9f637497156d |
|
BLAKE2b-256 | a6bb88584e77a8b367faf51d0f0b6cbc2b5feefcc9f64b301656b17bd2321dc9 |
Hashes for flying_delta_readers_remote-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | efbb601f9bdbe3c0bf11096d275e7ce0e135b18437dccc690c2420c39fb13186 |
|
MD5 | 9631a3b13d29637c9ce08d0852cf6afc |
|
BLAKE2b-256 | 1bf31df1cbe048cc15b898f0180b87acd3e0bb7bd9784f2531c1a51b129461e7 |