Skip to main content

Aspose.OCR for Python is a powerful yet easy-to-use and cost-effective API for extracting text from scanned images, photos, screenshots, PDF documents, and other files.

Project description

Product Page | Documentation | Demos | Blog | API Reference | Search | Free Support | Temporary License

Try our Free Online Apps demonstrating some of the most popular Aspose.OCR functionality.

Seamlessly integrate advanced optical character recognition into your Python projects with ease. Designed for all-in-one functionality and developer-friendly simplicity, this solution enables you to convert images, scans, PDFs, illustrations, and screenshots into accurate, editable, and searchable text with just a few lines of code. Convert scanned documents into searchable, indexable PDFs, extract and compare text from images, and streamline workflows with unmatched precision and performance. Aspose OCR API is suitable for prototypes, cross-platform applications, and cloud services.

What is optical character recognition (OCR)?

Optical Character Recognition (OCR) is a technology that converts images of printed or handwritten text into machine-readable, editable, and searchable digital text. By analyzing shapes and patterns within an image, OCR identifies characters—letters, numbers, and symbols—and translates them into text that computers can process. This powerful technology enables the extraction of information from physical documents, images, and forms, transforming them into actionable digital data. OCR also plays a vital role in accessibility, providing visually impaired individuals access to text through screen readers and other assistive technologies.

Recognition languages

Aspose OCR is the perfect solution for global applications and multi-lingual documents. With support for nearly all languages, it ensures accurate text extraction across a wide range of scripts and alphabets. Our engine excels at handling mixed-language documents, such as those featuring both Chinese and English, and can automatically detect the language of the input text during recognition.

  • Extended Latin (English, French, German, Spanish, Portuguese, and other European languages)
  • Cyrillic (Russian, Ukrainian, Bulgarian, and more)
  • Arabic and Persian
  • Chinese
  • Hindi and other Devanagari-based scripts
  • Korean
  • Japanese

Aspose OCR can also extract text from handwritten notes and street photos.

Key features

  • Universal – Effortlessly extract text from any image, whether it's a high-quality scan or a casual street photo, captured via scanner or camera.
  • Fast – Achieve rapid text recognition, processing a page in just seconds with minimal resource consumption. Adjust recognition speed, quality, and resource usage according to your specific needs.
  • Developer-friendly – With just a few lines of code, easily convert images to text, create searchable PDFs, save recognition results to documents, and more, integrating OCR into your workflows seamlessly.
  • Modular – Maintain a lean and efficient application by selectively incorporating advanced features from our comprehensive resource repository.
  • Reliable – Experience exceptional recognition accuracy, even with challenging images that are blurry, rotated, distorted, or noisy.

Supported file formats

Aspose OCR can handle nearly any file format from scanners or cameras, including:

  • JPEG, PNG, BMP, and GIF images
  • Single-page and multi-page TIFF files
  • PDF documents
  • DjVu files

All above-mentioned files can be read directly from web links. You can also use Aspose OCR to perform bulk recognition on images within folders and archives, streamlining large-scale document processing.

The recognition results are returned in popular file and data exchange formats, enabling easy storage, database import, or real-time analysis:

  • Plain Text
  • Searchable PDF
  • Microsoft Word or Excel documents
  • HTML
  • RTF
  • ePUB
  • JSON and XML
  • Markdown

Platforms

  • Windows
  • Linux
  • macOS

Developer-friendly

Aspose OCR is designed by developers for developers, offering a straightforward and intuitive interface. You only need 6 lines of code to convert image to text:

import aspose.ocr as ocr
# Initialize OCR engine
api = AsposeOcr()
# Add images to recognition batch
input = OcrInput(InputType.SINGLE_IMAGE)
input.add("1.png")
# Recognize images
results = api.recognize(input)
# Print result
print(results[0].recognition_text)

Licensing

You can begin using the Aspose OCR library immediately after installation. In evaluation mode (without providing a license), you can recognize text in any supported language and save recognition results in all supported formats, with a few limitations:

  • If the number of characters in a recognized image exceeds 300, only the first 300 characters will be recognized.
  • If the number of characters is fewer than 300, only the first 60% of the characters will be recognized.

To unlock all features, request a temporary license, which removes all trial limitations for 30 days. This gives you the opportunity to build a fully functional OCR application and evaluate Aspose.OCR before making your final purchase decision.

Product Page | Documentation | Demos | Blog | API Reference | Search | Free Support | Temporary License

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

aspose_ocr_python_net-26.5.0-py3-none-win_amd64.whl (127.6 MB view details)

Uploaded Python 3Windows x86-64

aspose_ocr_python_net-26.5.0-py3-none-win32.whl (122.9 MB view details)

Uploaded Python 3Windows x86

aspose_ocr_python_net-26.5.0-py3-none-macosx_11_0_arm64.whl (132.3 MB view details)

Uploaded Python 3macOS 11.0+ ARM64

aspose_ocr_python_net-26.5.0-py3-none-macosx_10_14_x86_64.whl (137.2 MB view details)

Uploaded Python 3macOS 10.14+ x86-64

File details

Details for the file aspose_ocr_python_net-26.5.0-py3-none-win_amd64.whl.

File metadata

File hashes

Hashes for aspose_ocr_python_net-26.5.0-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 2af8e1aa24d53369493a0888c03e07d2badc41007b5f90dd4131fb71a59c7390
MD5 8358e430c4d3a505c81ca39bd1169dff
BLAKE2b-256 a4793a6a0355030aa8b346fb4b9303e0b0a2cf6c06f0b7370fc5e38e013bce1c

See more details on using hashes here.

File details

Details for the file aspose_ocr_python_net-26.5.0-py3-none-win32.whl.

File metadata

File hashes

Hashes for aspose_ocr_python_net-26.5.0-py3-none-win32.whl
Algorithm Hash digest
SHA256 7c60209e2364c3400411e6ebcf65fd84466db946bd41d4d1786fa8d145689079
MD5 d7e80ead613690d92feeac83fbfbea6b
BLAKE2b-256 2803a0b8ededb6ba2618bd7bae2d0ddbe8d9b34f29d59641162f1e2b21fa9d82

See more details on using hashes here.

File details

Details for the file aspose_ocr_python_net-26.5.0-py3-none-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for aspose_ocr_python_net-26.5.0-py3-none-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 0b8a876a8eafaf0b4da21c7d38df02e5e70739cd05ec7e0e0535dee58fb8d117
MD5 934f580687764b860ccd383a1c95c78b
BLAKE2b-256 5b4ca803a8c2ea2f79df01d566c85b8ef362c1a22a3d8850ee56c4d6efce01e2

See more details on using hashes here.

File details

Details for the file aspose_ocr_python_net-26.5.0-py3-none-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for aspose_ocr_python_net-26.5.0-py3-none-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 7c892e818a61791e328f0a54b964a5454ad9d9efdba5eee063c5a3a6d54597f8
MD5 9849783e2fd37ab113a99a648444a2d5
BLAKE2b-256 7ef68bcbb5191dc3e5fdf34c4225ee91fceedecf089e3619ef3cdb376357fb56

See more details on using hashes here.

File details

Details for the file aspose_ocr_python_net-26.5.0-py3-none-macosx_10_14_x86_64.whl.

File metadata

File hashes

Hashes for aspose_ocr_python_net-26.5.0-py3-none-macosx_10_14_x86_64.whl
Algorithm Hash digest
SHA256 9d935a122efcf99694f66660a03ac12b63aa518abab4abd0f71389a78a27b9d9
MD5 2163964e0968a13cc8197df2c3d8eb46
BLAKE2b-256 02c5cb6017d528893eeb3314e004dce09d6b42d878b0811e945332ce45a90392

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page