Skip to main content

llama-index readers alibabacloud_aisearch integration

Project description

LlamaIndex Readers Integration: Alibabacloud_Aisearch

Installation

pip install llama-index-readers-alibabacloud-aisearch

Usage

Supported file types: ppt/pptx, doc/docx, pdf, images and so on. For further details, please visit:

You can specify the endpoint and aisearch_api_key in the constructor, or set the environment variables AISEARCH_ENDPOINT and AISEARCH_API_KEY.

Read local files

from llama_index.readers.alibabacloud_aisearch import (
    AlibabaCloudAISearchDocumentReader,
    AlibabaCloudAISearchImageReader,
)
from llama_index.core import SimpleDirectoryReader

document_reader = AlibabaCloudAISearchDocumentReader()
image_reader = AlibabaCloudAISearchImageReader()

file_extractor = {}
for suffix in (".pdf", ".docx", ".doc", ".ppt", ".pptx"):
    file_extractor[suffix] = document_reader
for suffix in (".jpg", ".jpeg", ".png", ".bmp", ".tiff"):
    file_extractor[suffix] = image_reader

documents = SimpleDirectoryReader(
    "./data", file_extractor=file_extractor
).load_data(show_progress=True)
print(documents)

Read remote files

from llama_index.readers.alibabacloud_aisearch import (
    AlibabaCloudAISearchImageReader,
)

image_reader = AlibabaCloudAISearchImageReader(
    service_id="ops-image-analyze-ocr-001"
)
image_urls = [
    "https://img.alicdn.com/imgextra/i1/O1CN01WksnF41hlhBFsXDNB_!!6000000004318-0-tps-1000-1400.jpg",
]

# The file_type is automatically determined based on the file extension.
# If it cannot be identified, manual specification of the file_type is required.
documents = image_reader.load_data(file_path=image_urls, file_type="jpg")
print(documents)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file llama_index_readers_alibabacloud_aisearch-0.3.0.tar.gz.

File metadata

File hashes

Hashes for llama_index_readers_alibabacloud_aisearch-0.3.0.tar.gz
Algorithm Hash digest
SHA256 80c28a91f50c4a2886a9ee32dbf0426d6d6a31b2387f1624f322de10b0bae529
MD5 9cb4503c1b605554356ccb56e99bacc6
BLAKE2b-256 d62a6bf38c370ecb58bc6ad9ad8969cf1120ed3f1739513f4c998191049f6c8d

See more details on using hashes here.

File details

Details for the file llama_index_readers_alibabacloud_aisearch-0.3.0-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_index_readers_alibabacloud_aisearch-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0d3d375e49b7a282a7b1efd4811351ce80bca94eca0ec6f1f4402c94c143c4d9
MD5 46a857c7c5b48fc7567c1370cb15367a
BLAKE2b-256 80996bfd50012a5e2c2f8be18dd0b1d41136049b92f283db009b4a3abe8b8b5c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page