Python client library for convenient usage of SAP Business Document Processing services

These details have not been verified by PyPI

Project links

Project description

Python Client Library for the SAP AI Business Services: Document Classification and Document Information Extraction

This repository contains the source code of a Python client library to facilitate the use of the SAP AI Business Services: Document Classification and Document Information Extraction. The client library provides two API Client classes that contain convenient methods to access these services and issue calls to the Document Classification REST API and Document Information Extraction REST API respectively. To use the library you need to have access to SAP Business Technology Platform.

Check out the usage examples, they are very useful to get started with the services.

Have a look at API documentation in order to use the library.

Notes for users of the sap-document-classification-client library

This library includes all the capabilities of the sap-document-classification-client, which will not be developed further. However, the code is still available here. If you want to switch to this library, you have to be aware of the following changes:

The DCApiClient can now be imported directly from the top module via: from sap_business_document_processing import DCApiClient
The functions classifiy_documents, upload_documents_to_dataset, upload_documents_directory_to_dataset now return an iterator instead of a list. You can either analyze individual results using with result = next(iterator) within a try-catch block (e.g. to handle each failed document) or use results = list(iterator) to turn it to a list. The latter will raise an error if at least one document failed.
The function get_datasets_info now returns a list which is the "dataset" part of the API response json. (You just need to delete the ["dataset"] from the response to work with it as until now)
The function get_classification_documents_info now returns a list which is the "results" part of the API response json.
The function get_training_models_info now returns a list which is the "models" part of the API response json.
The function get_deployed_models_info now returns a list which is the "deployments" part of the API response json.
The library now raises the following custom exceptions:
- BDPApiException: Base exception for all exceptions of this library. Raise when no other exception is applicable.
- BDPClientException: Raised when an HTTP response with status code between 400 and 500 is returned. Usually means incorrect user input. (Replaces some HTTPErrors)
- BDPServerException: Raised when an HTTP response with status code between 500 and 600 is returned. Usually means that the server had some internal error. (Replaces some HTTPErrors)
- BDPUnauthorizedException: Raised when an HTTP response with status code 401 is returned. Usually means that a wrong OAuth credentials were provided.
- BDPFailedAsynchronousOperationException: Raised when an asynchronous job failed during processing. (Replaces FailedCallException)
- BDPPollingTimeoutException: Raised when an asynchronous job exceeds the set polling_max_attempts. (Replaces PollingTimeoutException)
The function _poll_for_url now doesn`t expect an 'url' and 'payload' parameters, but 'path' and 'json' parameters instead.

Requirements

This library requires properly setup Python 3.6 (or higher version) environment.

Download and Installation

This Python library should be consumed in the standard way by running

pip install sap-business-document-processing

or adding the library as a dependency of your code in requirements.txt file.

Demo usage

Prerequisites:

Document Classification

To try out the Document classification service using the document classification client library you can also run the two demo links below:

Try out classification using default model demo
Try out training and classification using custom model demo (requires an enterprise account, trial account is not sufficient)

Document Information Extraction

Try out the Document Information Extraction service with this showcase

Exercises

Known Issues

Please see the issues section.

How to obtain support

In case you would like to contribute to this project, ask any questions or get support, please open an issue containing the description of your question or planned contribution in GitHub and we will get in touch.

Licensing

Please see our LICENSE for copyright and license information. Detailed information including third-party components and their licensing/copyright information is available via the REUSE tool.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.3.4

Oct 13, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

shegox-oidc-test-0.3.4.tar.gz (28.8 kB view details)

Uploaded Oct 13, 2023 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

shegox_oidc_test-0.3.4-py3-none-any.whl (31.4 kB view details)

Uploaded Oct 13, 2023 Python 3

File details

Details for the file shegox-oidc-test-0.3.4.tar.gz.

File metadata

Download URL: shegox-oidc-test-0.3.4.tar.gz
Upload date: Oct 13, 2023
Size: 28.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/4.0.2 CPython/3.11.6

File hashes

Hashes for shegox-oidc-test-0.3.4.tar.gz
Algorithm	Hash digest
SHA256	`af861bed32ee6b1f6014262093ea05278d6970b780dff90f68d485f354be90ae`
MD5	`114adb4e4163fa9375e2acd6f9c088c1`
BLAKE2b-256	`d97a1d4219d49948b6ac97e2fc9681ec3a01422d021f3b6ebee9388ce98deb42`

See more details on using hashes here.

File details

Details for the file shegox_oidc_test-0.3.4-py3-none-any.whl.

File metadata

Download URL: shegox_oidc_test-0.3.4-py3-none-any.whl
Upload date: Oct 13, 2023
Size: 31.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/4.0.2 CPython/3.11.6

File hashes

Hashes for shegox_oidc_test-0.3.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`edce17ed2eb96d1fe107a913ede3c9be2be87145a6d3392ad1b756eb6bd9af68`
MD5	`ba760b52ee1629f86c6af82a8725b1ec`
BLAKE2b-256	`7e334f9f1f6dda5160f5d292c936d82f5c7e5d33e0bbba2092cad0386c477af6`

See more details on using hashes here.

shegox-oidc-test 0.3.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Python Client Library for the SAP AI Business Services: Document Classification and Document Information Extraction

Notes for users of the sap-document-classification-client library

Requirements

Download and Installation

Demo usage

Document Classification

Document Information Extraction

Known Issues

How to obtain support

Licensing

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes