Skip to main content

Aspose OCR Cloud 5.0 API

Project description

Aspose.OCR Cloud for Python SDK 24.11.0

PyPI PyPI - Format PyPI - Downloads GitHub license

Aspose.OCR Cloud is an optical character recognition as a service. With it, you can easily add OCR functionality to almost any device or platform, including netbooks, mini PCs, or even entry-level smartphones.

Our engine can read text from images, photos, screenshots and scanned PDFs in a wide variety of European, Cyrillic and Oriental fonts, returning results in the most popular document formats. Powerful built-in image processing filters based on neural networks automatically correct skewed and distorted images, automatically remove dirt, smudges, scratches, glare and other image defects that can affect recognition accuracy. To further improve the results, Aspose.OCR Cloud has a built-in spell checker that automatically replaces misspelled words and saves you the trouble of manually correcting the recognition results.

Even the complex recognition tasks can be done with a couple of API calls. To make interacting with Aspose.OCR Cloud services from Python applications even easier, we provide the software development kit (SDK) for Python. It handles all the routine operations such as establishing connections, sending API requests, and parsing responses, wrapping all these tasks into a few simple classes.

Aspose.OCR Cloud SDK for Python is open source under the MIT license. You can freely use it for any projects, including commercial and proprietary applications, as well as modify any part of its code.

Try Online

Image to Text Image to Searchable PDF PDF OCR Receipt Scanner
Scan Image Image to Searchable PDF PDF OCR Receipt Scanner

What was changed in version 24.11.0

A summary of recent changes, enhancements and bug fixes in Aspose.OCR Cloud SDK for Java 24.11.0 release:

Key Summary Category
OCR-3977 Added recognition of the Uyghur language, including mixed Uyghur/English texts. New feature
OCR-3978 Added recognition of the Telugu language, including mixed Telugu/English texts. New feature
OCR-3979 Added recognition of the Kannada language, including mixed Kannada/English texts. New feature
OCR-3980 Added recognition of the Tamil language, including mixed Tamil/English texts. New feature
OCR-3985 Added recognition of the Devanagari-based scripts, including mixed Devanagari/English texts. New feature
OCR-3986 Added support for mixed Arabic/English texts. Enhancement
OCR-3984 Added support for mixed Japanese/English texts. Enhancement
OCR-3983 Added support for mixed Korean/English texts. Enhancement
OCR-3982 Added support for mixed Chinese/English texts. Enhancement
OCR-3987 Added support for mixed Persian/English texts. Enhancement

REST API changes: https://releases.aspose.cloud/ocr/release-notes/2024/aspose-ocr-cloud-24-11-0-release-notes/

New recognition languages

The following recognition languages have been added:

Script REST API
Uyghur Language.Uyghur
Telugu Language.Telugu
Kannada Language.Kannada
Tamil Language.Tamil
Devanagari-based languages Language.Devanagari

All of the OCR languages mentioned above also support the recognition of mixed texts, including those with Latin characters.

Mixed language support

Aspose.OCR Cloud now supports the recognition of texts which include both native and Latin characters for the following languages:

  • Arabic (Language.Arabic)
  • Chinese (Language.Chinese)
  • Japanese (Language.Japanese)
  • Korean (Language.Korean)
  • Persian (Language.Persian)

Updated public APIs:

No changes

Removed public APIs:

No changes.

Quickstart

Make your solution using SDK, follow these steps:

1. Get API keys if you haven't

Make a personal account on Aspose Cloud Dashboard and click Get Keys. These keys are useful for all Aspose Cloud products. If you have any trouble, look at this detailed manual.

2. Run Demo

  • Checkout the SDK or get from pip (pip install aspose-ocr-cloud)
  • Set Your AppSid & AppKey
  • Run Python console Demo


OCR in Python

# Get your ClientId and ClientSecret from https://dashboard.aspose.cloud (free registration required).

	
    with aspose_ocr_cloud.ApiClient(config) as api_client:
        # Create an instance of the API class
        api_instance = recognize_image_api.RecognizeImageApi(api_client)

        # Read image file and conver it into base64 string
        image_file_path = os.path.join(os.path.dirname(os.path.abspath(__file__)), '../samples/latin.png')

        # Create request body
        body = OCRRecognizeImageBody(
            image=utils.file_to_base64(image_file_path),
            settings=OCRSettingsRecognizeImage(
                language=Language("English"),
                makeSkewCorrect=False,
                makeBinarization=False,
                makeSpellCheck=False,
                makeContrastCorrection=False,
                makeUpsampling=False,
                dsrMode=DsrMode("NoDsrNoFilter"),
                dsrConfidence=DsrConfidence("Default"),
                resultType=ResultType("Text")
            ),
        )
        try:
            # Step 1: perform post request
            task_id = api_instance.post_recognize_image(
                body,
            )
            print(f'Your task ID is {task_id}')

            # Step 2: perform get result request
            task_response : OCRResponse = api_instance.get_recognize_image(
                id=task_id
            )
            assert task_response.response_status_code == 'Ok'
            assert task_response.task_status == 'Completed'
            recognized_text_raw = task_response.results[0].data
            recognized_text = bytearray(base64.b64decode(str(recognized_text_raw))).decode('utf-8')
            print (f'Recognized text:\n{recognized_text}')
            print (f'Task completed.Press Enter to continue')
            input()

        except aspose_ocr_cloud.ApiException as e:
            print("Exception when calling RecognizeImageApi: %s\n" % e)

Structure

This project includes:

  • Python console demo application - "./example"
  • Module "asposeocrcloud" - this is SDK located in "./asposeocrcloud". You can integrate it in your application.
  • Module "test" - "./test" UnitTest. You can take a look at them to see various code examples.
  • Folder "docs" - "./docs" Full documentation for Aspose.OCR SDK in HTML format.

Dependencies


Aspose.OCR Cloud SDKs

.NET & Core Java Python Node.js Android

Product Page | Documentation | API Reference | Code Samples | Blog | Free Support | Free Trial

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aspose_ocr_cloud-24.11.0.tar.gz (56.1 kB view details)

Uploaded Source

Built Distribution

aspose_ocr_cloud-24.11.0-py3-none-any.whl (143.2 kB view details)

Uploaded Python 3

File details

Details for the file aspose_ocr_cloud-24.11.0.tar.gz.

File metadata

  • Download URL: aspose_ocr_cloud-24.11.0.tar.gz
  • Upload date:
  • Size: 56.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.0

File hashes

Hashes for aspose_ocr_cloud-24.11.0.tar.gz
Algorithm Hash digest
SHA256 64ac1a08d06fa0046c5ddbe5c8ec128823981fe15d9402aea5b7f331776eafb1
MD5 d42dacd7ea3be81b67a76edab8c85353
BLAKE2b-256 1e08b7ecb773718d85a0470748bec0d533da556d015ba6c40c92f6401d17407e

See more details on using hashes here.

File details

Details for the file aspose_ocr_cloud-24.11.0-py3-none-any.whl.

File metadata

File hashes

Hashes for aspose_ocr_cloud-24.11.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c156107f48b6987c542c7657744a25295405de2acd60df7538a313d2705d8697
MD5 7d4b138117c86f60b577a580600c3927
BLAKE2b-256 d641c88a41a4b643f47d5dfc91cad0a4e30d5c32a7f105b1092c1a04f8366441

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page