Skip to main content

llama-index readers dashscope integration

Project description

LlamaIndex Readers Integration: Dashscope

Usage

from llama_index.readers.dashscope.base import DashScopeParse
from llama_index.readers.dashscope.utils import ResultType

file_list = ['./aiayn.pdf', './aiayn.abc']

parse = DashScopeParse(result_type=ResultType.DASHCOPE_DOCMIND)
documents = parse.load_data(file_path=file_list)

Reader Setting:

A full list of retriever settings/kwargs is below:

  • api_key: Optional[str] -- Your dashscope API key, which can be passed in through environment variables or parameters. The parameter settings will override the results from the environment variables
  • workspace_id: Optional[str] -- Your dashscope workspace_id, which can be passed in through environment variables or parameters. The parameter settings will override the results from the environment variables
  • category_id: Optional[str] -- The category_id for the parser. The default value is "default".
  • base_url: Optional[str] -- The base url for the Dashscope API. The default value is "https://dashscope.aliyuncs.com". The parameter settings will override the results from the environment variables.
  • result_type: Optional[ResultType] -- The result type for the parser. The default value is ResultType.DASHCOPE_DOCMIND.
  • num_workers: Optional[int] -- The number of workers to use sending API requests for parsing. The default value is 4, greater than 0, less than 10.
  • check_interval: Optional[int] -- The interval in seconds to check if the parsing is done. The default value is 5.
  • max_timeout: Optional[int] -- The maximum timeout in seconds to wait for the parsing to finish. The default value is 3600.
  • verbose: Optional[bool] -- Whether to print the progress of the parsing. The default value is True.
  • show_progress: Optional[bool] -- Show progress when parsing multiple files. The default value is True.
  • ignore_errors: Optional[bool] -- Whether or not to ignore and skip errors raised during parsing. The default value is True.

Reader Input:

  • file_path: Union[str, List[str]] -- The file path or list of file paths to parse.

Reader Output:

  • List[llama_index.core.schema.Document] -- The list of documents parsed from the file.
    • text: str -- The text of the document from DASHCOPE_DOCMIND.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Built Distribution

File details

Details for the file llama_index_readers_dashscope_custom-0.1.2.tar.gz.

File metadata

File hashes

Hashes for llama_index_readers_dashscope_custom-0.1.2.tar.gz
Algorithm Hash digest
SHA256 a9934d4b43bd6b6f299985c9be7d3d48d22d77b83d400003f13e29f7b0b16eb6
MD5 add354bf851e0f7b589a1dbc3de68253
BLAKE2b-256 6f23ce815f155a370620be46844c6f153c5bc4f4b8ca3d7f0e77959ba3f44dcd

See more details on using hashes here.

File details

Details for the file llama_index_readers_dashscope_custom-0.1.2-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_index_readers_dashscope_custom-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 d66808603daf14d936c75b8d65326fd874ce5ec895430dc689ea90b4ef41da42
MD5 28a7f84ee7a6aee4e154a61f4c3fe098
BLAKE2b-256 f7835e7771b56479e023b500a87286fb4ac0cb2fe042b1b1c6cc28e729601da3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page