Skip to main content

llama-index readers alibabacloud_aisearch integration

Project description

LlamaIndex Readers Integration: Alibabacloud_Aisearch

Installation

pip install llama-index-readers-alibabacloud-aisearch

Usage

Supported file types: ppt/pptx, doc/docx, pdf, images and so on. For further details, please visit:

You can specify the endpoint and aisearch_api_key in the constructor, or set the environment variables AISEARCH_ENDPOINT and AISEARCH_API_KEY.

Read local files

from llama_index.readers.alibabacloud_aisearch import (
    AlibabaCloudAISearchDocumentReader,
    AlibabaCloudAISearchImageReader,
)
from llama_index.core import SimpleDirectoryReader

document_reader = AlibabaCloudAISearchDocumentReader()
image_reader = AlibabaCloudAISearchImageReader()

file_extractor = {}
for suffix in (".pdf", ".docx", ".doc", ".ppt", ".pptx"):
    file_extractor[suffix] = document_reader
for suffix in (".jpg", ".jpeg", ".png", ".bmp", ".tiff"):
    file_extractor[suffix] = image_reader

documents = SimpleDirectoryReader(
    "./data", file_extractor=file_extractor
).load_data(show_progress=True)
print(documents)

Read remote files

from llama_index.readers.alibabacloud_aisearch import (
    AlibabaCloudAISearchImageReader,
)

image_reader = AlibabaCloudAISearchImageReader(
    service_id="ops-image-analyze-ocr-001"
)
image_urls = [
    "https://img.alicdn.com/imgextra/i1/O1CN01WksnF41hlhBFsXDNB_!!6000000004318-0-tps-1000-1400.jpg",
]

# The file_type is automatically determined based on the file extension.
# If it cannot be identified, manual specification of the file_type is required.
documents = image_reader.load_data(file_path=image_urls, file_type="jpg")
print(documents)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Built Distribution

File details

Details for the file llama_index_readers_alibabacloud_aisearch-0.1.0.tar.gz.

File metadata

File hashes

Hashes for llama_index_readers_alibabacloud_aisearch-0.1.0.tar.gz
Algorithm Hash digest
SHA256 8da2ba9a8888018f0946206cd43bdcbd8f6c95504882cb06d8a92409c3c8343f
MD5 b1d448a215ad4ed7f609f80e6f3b6227
BLAKE2b-256 978719b4a4629535c8ae7cc70506779830f6ee07891ac7a1911b4e4bf3b616e8

See more details on using hashes here.

File details

Details for the file llama_index_readers_alibabacloud_aisearch-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_index_readers_alibabacloud_aisearch-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 91edf94db55b7d1c3d1fa1d2a0d16f68ce7f3020815423144b5a8ec14bd6f444
MD5 9736f3cac5b983db4c7e92782a69086f
BLAKE2b-256 5561bff08ca197cc56149c809903d7d643656721263556141b47fb0281993751

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page