Skip to main content

llama-index readers remote integration

Project description

Remote Page/File Loader

This loader makes it easy to extract the text from any remote page or file using just its url. If there's a file at the url, this loader will download it temporarily and parse it using SimpleDirectoryReader. It is an all-in-one tool for (almost) any url.

As a result, any page or type of file is supported. For instance, if a .txt url such as a Project Gutenberg book is passed in, the text will be parsed as is. On the other hand, if a hosted .mp3 url is passed in, it will be downloaded and parsed using AudioTranscriber.

Usage

To use this loader, you need to pass in a Path to a local file. Optionally, you may specify a file_extractor for the SimpleDirectoryReader to use, other than the default one.

from llama_index import download_loader

RemoteReader = download_loader("RemoteReader")

loader = RemoteReader()
documents = loader.load_data(
    url="https://en.wikipedia.org/wiki/File:Example.jpg"
)

This loader is designed to be used as a way to load data into LlamaIndex and/or subsequently used as a Tool in a LangChain Agent. See here for examples.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

flying_delta_readers_remote-0.1.0.tar.gz (3.3 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file flying_delta_readers_remote-0.1.0.tar.gz.

File metadata

File hashes

Hashes for flying_delta_readers_remote-0.1.0.tar.gz
Algorithm Hash digest
SHA256 72d1e69a14ded28b5fb694285ced32bcc407171bd1e703cbda021adbc212a88a
MD5 a3973482298d4965b67b9f637497156d
BLAKE2b-256 a6bb88584e77a8b367faf51d0f0b6cbc2b5feefcc9f64b301656b17bd2321dc9

See more details on using hashes here.

File details

Details for the file flying_delta_readers_remote-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for flying_delta_readers_remote-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 efbb601f9bdbe3c0bf11096d275e7ce0e135b18437dccc690c2420c39fb13186
MD5 9631a3b13d29637c9ce08d0852cf6afc
BLAKE2b-256 1bf31df1cbe048cc15b898f0180b87acd3e0bb7bd9784f2531c1a51b129461e7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page