Skip to main content

llama-index readers dashscope integration

Project description

LlamaIndex Readers Integration: Dashscope

Installation

pip install llama-index-readers-dashscope

Usage

from llama_index.readers.dashscope.base import DashScopeParse
from llama_index.readers.dashscope.utils import ResultType

file_list = [
    # your files (accept doc, docx, pdf)
]

parse = DashScopeParse(result_type=ResultType.DASHCOPE_DOCMIND)
documents = parse.load_data(file_path=file_list)

Reader Setting:

A full list of retriever settings/kwargs is below:

  • api_key: Optional[str] -- Your dashscope API key, which can be passed in through environment variables or parameters. The parameter settings will override the results from the environment variables
  • workspace_id: Optional[str] -- Your dashscope workspace_id, which can be passed in through environment variables or parameters. The parameter settings will override the results from the environment variables
  • base_url: Optional[str] -- The base url for the Dashscope API. The default value is "https://dashscope.aliyuncs.com". The parameter settings will override the results from the environment variables.
  • result_type: Optional[ResultType] -- The result type for the parser. The default value is ResultType.DASHCOPE_DOCMIND.
  • num_workers: Optional[int] -- The number of workers to use sending API requests for parsing. The default value is 4, greater than 0, less than 10.
  • check_interval: Optional[int] -- The interval in seconds to check if the parsing is done. The default value is 5.
  • max_timeout: Optional[int] -- The maximum timeout in seconds to wait for the parsing to finish. The default value is 3600.
  • verbose: Optional[bool] -- Whether to print the progress of the parsing. The default value is True.
  • show_progress: Optional[bool] -- Show progress when parsing multiple files. The default value is True.
  • ignore_errors: Optional[bool] -- Whether or not to ignore and skip errors raised during parsing. The default value is True.

Reader Input:

  • file_path: Union[str, List[str]] -- The file path or list of file paths to parse.

Reader Output:

  • List[llama_index.core.schema.Document] -- The list of documents parsed from the file.
    • text: str -- The text of the document from DASHCOPE_DOCMIND.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_readers_dashscope-0.3.0.tar.gz (9.7 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file llama_index_readers_dashscope-0.3.0.tar.gz.

File metadata

File hashes

Hashes for llama_index_readers_dashscope-0.3.0.tar.gz
Algorithm Hash digest
SHA256 a432ba635f8ff317bf90f86a41cf5792b06ce090a18ec96dd02263da5400d10b
MD5 b257bedf24104ea38bc0e5e3f45c4df3
BLAKE2b-256 cfe610396dae2df41219eff0e6386e6ee0552b3240f01ddf68ea2e4ffbc4a829

See more details on using hashes here.

File details

Details for the file llama_index_readers_dashscope-0.3.0-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_index_readers_dashscope-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 8b0c5901fcd69f7167d87c11592525bd0e082b9cb204ec1e56d095fe1ff15fb5
MD5 505616b1d672e2bd2ad1cdc164688e36
BLAKE2b-256 8abac99b0c1e522010a919e1194c38715f78a4f19d2a83d4537ae1c19a19149d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page