Aspose OCR Cloud 5.0 API
Project description
Aspose.OCR Cloud for Python SDK 24.11.0
Aspose.OCR Cloud is an optical character recognition as a service. With it, you can easily add OCR functionality to almost any device or platform, including netbooks, mini PCs, or even entry-level smartphones.
Our engine can read text from images, photos, screenshots and scanned PDFs in a wide variety of European, Cyrillic and Oriental fonts, returning results in the most popular document formats. Powerful built-in image processing filters based on neural networks automatically correct skewed and distorted images, automatically remove dirt, smudges, scratches, glare and other image defects that can affect recognition accuracy. To further improve the results, Aspose.OCR Cloud has a built-in spell checker that automatically replaces misspelled words and saves you the trouble of manually correcting the recognition results.
Even the complex recognition tasks can be done with a couple of API calls. To make interacting with Aspose.OCR Cloud services from Python applications even easier, we provide the software development kit (SDK) for Python. It handles all the routine operations such as establishing connections, sending API requests, and parsing responses, wrapping all these tasks into a few simple classes.
Aspose.OCR Cloud SDK for Python is open source under the MIT license. You can freely use it for any projects, including commercial and proprietary applications, as well as modify any part of its code.
Try Online
Image to Text | Image to Searchable PDF | PDF OCR | Receipt Scanner |
---|---|---|---|
What was changed in version 24.11.0
A summary of recent changes, enhancements and bug fixes in Aspose.OCR Cloud SDK for Java 24.11.0 release:
Key | Summary | Category |
---|---|---|
OCR-3977 | Added recognition of the Uyghur language, including mixed Uyghur/English texts. | New feature |
OCR-3978 | Added recognition of the Telugu language, including mixed Telugu/English texts. | New feature |
OCR-3979 | Added recognition of the Kannada language, including mixed Kannada/English texts. | New feature |
OCR-3980 | Added recognition of the Tamil language, including mixed Tamil/English texts. | New feature |
OCR-3985 | Added recognition of the Devanagari-based scripts, including mixed Devanagari/English texts. | New feature |
OCR-3986 | Added support for mixed Arabic/English texts. | Enhancement |
OCR-3984 | Added support for mixed Japanese/English texts. | Enhancement |
OCR-3983 | Added support for mixed Korean/English texts. | Enhancement |
OCR-3982 | Added support for mixed Chinese/English texts. | Enhancement |
OCR-3987 | Added support for mixed Persian/English texts. | Enhancement |
REST API changes: https://releases.aspose.cloud/ocr/release-notes/2024/aspose-ocr-cloud-24-11-0-release-notes/
New recognition languages
The following recognition languages have been added:
Script | REST API |
---|---|
Uyghur | Language.Uyghur |
Telugu | Language.Telugu |
Kannada | Language.Kannada |
Tamil | Language.Tamil |
Devanagari-based languages | Language.Devanagari |
All of the OCR languages mentioned above also support the recognition of mixed texts, including those with Latin characters.
Mixed language support
Aspose.OCR Cloud now supports the recognition of texts which include both native and Latin characters for the following languages:
- Arabic (
Language.Arabic
) - Chinese (
Language.Chinese
) - Japanese (
Language.Japanese
) - Korean (
Language.Korean
) - Persian (
Language.Persian
)
Updated public APIs:
No changes
Removed public APIs:
No changes.
Quickstart
Make your solution using SDK, follow these steps:
1. Get API keys if you haven't
Make a personal account on Aspose Cloud Dashboard and click Get Keys. These keys are useful for all Aspose Cloud products. If you have any trouble, look at this detailed manual.
2. Run Demo
- Checkout the SDK or get from pip (pip install aspose-ocr-cloud)
- Set Your AppSid & AppKey
- Run Python console Demo
OCR in Python
# Get your ClientId and ClientSecret from https://dashboard.aspose.cloud (free registration required).
with aspose_ocr_cloud.ApiClient(config) as api_client:
# Create an instance of the API class
api_instance = recognize_image_api.RecognizeImageApi(api_client)
# Read image file and conver it into base64 string
image_file_path = os.path.join(os.path.dirname(os.path.abspath(__file__)), '../samples/latin.png')
# Create request body
body = OCRRecognizeImageBody(
image=utils.file_to_base64(image_file_path),
settings=OCRSettingsRecognizeImage(
language=Language("English"),
makeSkewCorrect=False,
makeBinarization=False,
makeSpellCheck=False,
makeContrastCorrection=False,
makeUpsampling=False,
dsrMode=DsrMode("NoDsrNoFilter"),
dsrConfidence=DsrConfidence("Default"),
resultType=ResultType("Text")
),
)
try:
# Step 1: perform post request
task_id = api_instance.post_recognize_image(
body,
)
print(f'Your task ID is {task_id}')
# Step 2: perform get result request
task_response : OCRResponse = api_instance.get_recognize_image(
id=task_id
)
assert task_response.response_status_code == 'Ok'
assert task_response.task_status == 'Completed'
recognized_text_raw = task_response.results[0].data
recognized_text = bytearray(base64.b64decode(str(recognized_text_raw))).decode('utf-8')
print (f'Recognized text:\n{recognized_text}')
print (f'Task completed.Press Enter to continue')
input()
except aspose_ocr_cloud.ApiException as e:
print("Exception when calling RecognizeImageApi: %s\n" % e)
Structure
This project includes:
- Python console demo application - "./example"
- Module "asposeocrcloud" - this is SDK located in "./asposeocrcloud". You can integrate it in your application.
- Module "test" - "./test" UnitTest. You can take a look at them to see various code examples.
- Folder "docs" - "./docs" Full documentation for Aspose.OCR SDK in HTML format.
Dependencies
- Python 3.9
- See requirements.txt
Aspose.OCR Cloud SDKs
.NET & Core | Java | Python | Node.js | Android |
Product Page | Documentation | API Reference | Code Samples | Blog | Free Support | Free Trial
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file aspose_ocr_cloud-24.11.0.tar.gz
.
File metadata
- Download URL: aspose_ocr_cloud-24.11.0.tar.gz
- Upload date:
- Size: 56.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 64ac1a08d06fa0046c5ddbe5c8ec128823981fe15d9402aea5b7f331776eafb1 |
|
MD5 | d42dacd7ea3be81b67a76edab8c85353 |
|
BLAKE2b-256 | 1e08b7ecb773718d85a0470748bec0d533da556d015ba6c40c92f6401d17407e |
File details
Details for the file aspose_ocr_cloud-24.11.0-py3-none-any.whl
.
File metadata
- Download URL: aspose_ocr_cloud-24.11.0-py3-none-any.whl
- Upload date:
- Size: 143.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c156107f48b6987c542c7657744a25295405de2acd60df7538a313d2705d8697 |
|
MD5 | 7d4b138117c86f60b577a580600c3927 |
|
BLAKE2b-256 | d641c88a41a4b643f47d5dfc91cad0a4e30d5c32a7f105b1092c1a04f8366441 |