Aspose OCR Cloud 5.0 API

These details have not been verified by PyPI

Project description

AsposeOCRCloudSDK

Aspose OCR Cloud 5.0 API PyPI PyPI - Format PyPI - Downloads PyPI - Python Version GitHub last commit

Python Cloud REST API for OCR

Aspose OCR Cloud Android SDK is a simple OCR technology, which you can use in your application to convert image to text. Aspose.OCR Cloud is a simple SDK used to add text recognition to your app with merely a few lines of code. In detail, it's a set of SDKs for optical character recognition and document scanning in our Cloud. It supports reading and recognizing text from most commonly used raster image formats. Just pass a specific image to the Aspose.OCR Cloud API, and it will return a response with recognized text.

It is easy to get started with Aspose.OCR Cloud, and there is nothing to install. Create an account at Aspose Cloud and get your application information, then you are ready to use SDKs

Try Online

Image to Text	Image to Searchable PDF	PDF OCR	Receipt Scanner

Release 22.12

What was changed

This is the major release of Aspose.OCR Cloud which delivers significant new features, enhancements to existing features, performance improvements, and fixes. The list below contains the most important features that are supported in this release:

Image to speech conversion:
- Convert almost any picture or photo with readable characters into a natural human voice that can be played in the background or downloaded.
Advanced image binarization
- A specialized neural network for converting images to black and white for better text recognition.
Automatic skew correction
- Detect image skew angle and automatically correct the tilt.
Dewrapping
- Detect perspective distortions and automatically straighten the image.
Upscaling
- Intelligently enhance image resolution without losing the content and quality.

Features

Automated skew correction
Automated and manual document layout detection
Recognize documents with complex layouts in fully automatic mode or with manual corrections.
Extract and recognize text from images via OCR
Supports multiple international languages
High speed with no hardware resources
Receipt recognition
Table image recognition
Supports PDF Recognition
Text correction using spell checking algorithms
Various output formats: Text, Searchable PDF, hOCR, Excel for tables.

Recognize text of different languages

Aspose.OCR Cloud supports 38 languages including English, German, French, Italian, Spanish, Portuguese, Polish, Slovene, Slovak, Netherlands, Lithuanian, Latvian, Danish, Norwegian, Finnish, Serbian, Croatian, Czech, Swedish, Estonian, Romanian, Chinese, Arabic, Hindi, Russian, Ukrainian, Bengali, Tibetan, Thai, Urdu, Turkish, Korean, Indonesian, Hebrew, Javanese, Greek, Japanese, Persian and a lot of other works too.

Save OCR As

TXT, PDF, HOCR

Read OCR Formats

BMP, JPG, GIF, PNG, TIFF

How to use the SDK?

Our API is completely independent of your operating system, database system, or development language. You can use any language and platform that supports HTTP to interact with our API. However, manually writing client code can be difficult, error-prone, and time-consuming. Therefore, we have provided and support SDKs in many development languages to make it easier to integrate with us.

OCR in Python

	
    from AsposeOCRCloudSDK.model import dsr_mode
    from AsposeOCRCloudSDK.model.language import Language
    from AsposeOCRCloudSDK.model.ocr_recognize_image_body import OCRRecognizeImageBody
    from AsposeOCRCloudSDK.model.ocr_settings_recognize_image import OCRSettingsRecognizeImage
    from AsposeOCRCloudSDK.paths.v5_recognize_image import post  # noqa: E501
    from AsposeOCRCloudSDK import configuration, schemas, api_client
    from AsposeOCRCloudSDK.paths.v5_recognize_image.get import GetRecognizeImage
    
    from .. import ApiTestMixin
    
    
    configuration = configuration.Configuration(host="https://api.aspose.cloud",
                                             token_url='https://api.aspose.cloud/connect/token',
                                             app_sid="",
                                             api_key="")

    used_api_client = api_client.ApiClient(configuration=configuration)
    api = post.ApiForpost(api_client=used_api_client)  

    img_path = 'path/to/image'
    with open(img_path, 'rb') as f:
        img = f.read()
    str_img = base64.encodebytes(img).decode('utf-8')
    settings = OCRSettingsRecognizeImage(language=Language.ENGLISH, dsrMode=dsr_mode.DsrMode.NO_DSR_NO_FILTER,
                                         makeBinarization=False, makeSkewCorrect=False)
    body = OCRRecognizeImageBody(image=str_img, settings=settings)
    response = api.post(body, timeout=30)
    response_body = response.response.data.decode()


    d = GetRecognizeImage(used_api_client)
    response_data = d.get_recognize_image({'id': id}).response.data
    response_data = response_data.decode()
    response_data = json.loads(response_data)
    assert response_data['responseStatusCode'] == 'Ok' and response_data['taskStatus'] == 'Completed'
    for data in response_data['results']:
        result = data['data']
        result_dec = base64.b64decode(result).decode()
        print(result_dec)

Quickstart

Make your solution using SDK, follow these steps:

1. Get API keys if you haven't

Make a personal account on Aspose Cloud Dashboard and click Get Keys. These keys are useful for all Aspose Cloud products. If you have any trouble, look at this detailed manual.

2. Run Demo

Checkout the SDK or get from pip (pip install aspose-ocr-cloud)
Set Your AppSid & AppKey
Run Python console Demo or UnitTests

Structure

This project includes:

Python console demo application - "./demo"
Module "asposeocrcloud" - this is SDK located in "./asposeocrcloud". You can integrate it in your application. It contains both OCR and Aspose.Storage API
Module "test" - "./test" UnitTest. You can take a look at them to see various code examples.
Module "demo" - "./demo" Sample console demo project.
Folder "docs" - "./docs" Full documentation for Aspose.OCR SDK in HTML format.

Dependencies

See requirements.txt

Aspose.OCR Cloud SDKs


.NET & Core	Java	Python	Node.js	Android

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

24.11.0

Nov 21, 2024

24.8.0

Aug 9, 2024

23.12.0

Jan 9, 2024

23.11.0

Dec 1, 2023

23.6.0.2

Jul 1, 2023

This version

22.12.0.1

Dec 19, 2022

22.12.0

Dec 19, 2022

21.9.0

Sep 20, 2021

20.8.1

Aug 9, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

aspose_ocr_cloud-22.12.0.1-py3-none-any.whl (152.1 kB view details)

Uploaded Dec 19, 2022 Python 3

File details

Details for the file aspose_ocr_cloud-22.12.0.1-py3-none-any.whl.

File metadata

Download URL: aspose_ocr_cloud-22.12.0.1-py3-none-any.whl
Upload date: Dec 19, 2022
Size: 152.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.7.12

File hashes

Hashes for aspose_ocr_cloud-22.12.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cb08f83175334b6b33cad2faeaf6ed1db285b81e700675c2896508bf2ebc7455`
MD5	`34846fc6a42b2a9fc90efe2ae6312444`
BLAKE2b-256	`2042aa08124d4a18a76461043cdd67be5df0d37972f6a0753a0b92b102c277f4`

See more details on using hashes here.

aspose-ocr-cloud 22.12.0.1

Navigation

Verified details

Maintainers

Unverified details

Meta