Skip to main content

Spire.OCR for Python is a professional OCR component used to read text in image formats such as JPG,PNG,GIF,BMP, and TIFF

Project description

Spire.OCR for Python: High-Accuracy OCR API for Efficient Text Extraction from Images

Spire.OCR for Python

Product PageDocumentationExamplesForumTemporary LicenseCustomized Demo

Spire.OCR for Python is a robust and professional Optical Character Recognition (OCR) library designed to enable developers to extract text from images in various formats, including JPG, PNG, GIF, BMP, and TIFF. This library provides an intuitive and straightforward solution for integrating OCR capabilities into Python applications, allowing users to easily extract text from popular image formats.

Key Features

  • Text Extraction: Extract text from images with support for commonly used fonts such as Arial, Times New Roman, Courier New, Verdana, Tahoma, and Calibri, in regular, bold, and italic styles.
  • Multilingual Support: Recognize text in multiple languages, including English, Chinese, French, German, Japanese, and Korean, making it suitable for global applications.
  • Image Format Compatibility: Supports extraction of text from a wide range of image file formats like JPG, PNG, BMP, GIF, and TIFF.
  • Cross-Platform: Compatible with OCR feature on Windows, Linux and Mac operating systems.
  • Easy Integration: Seamlessly integrate Spire.OCR for Python into your projects with a simple API that requires minimal coding effort.

Supported Languages:

  • English
  • Chinese
  • Japanese
  • Korean
  • German
  • French

Supported Fonts:

Commonly used fonts are supported, such as:

  • Arial
  • Times New Roman
  • Courier New
  • Verdana
  • Tahoma
  • Calibri

Supported Font Styles:

  • Regular
  • Bold
  • Italic

Supported Image File Formats:

  • JPG
  • PNG
  • BMP
  • GIF
  • TIFF

Installation

To install Spire.OCR for Python, you can use pip, the Python package manager. Simply run the following command:

pip install Spire.OCR

Or manually download Spire.OCR for Python and import it into your project.

Examples

Extract Text from an Image File

from spire.ocr import *

scanner = OcrScanner()

configureOptions = ConfigureOptions()
configureOptions.ModelPath = r"D:\OCR\win-x64"
configureOptions.Language = "English"
scanner.ConfigureDependencies(configureOptions)

scanner.Scan(r"Data\Sample.png")

#output the text and the blocks
text = scanner.Text.ToString() + "\n"
for block in scanner.Text.Blocks:
    rectangle = block.Box
    postions = f"{block.Text} -> x : {rectangle.X} , y : {rectangle.Y} , w : {rectangle.Width} , h : {rectangle.Height}"
    text += postions + "\n"

with open('output.txt','a',encoding='utf-8') as file:
    file.write(text+ "\n")

Extract Text from an Image Stream

from spire.ocr import *

scanner = OcrScanner()
configureOptions = ConfigureOptions()

configureOptions.ModelPath = r"D:\OCR\win-x64"
configureOptions.Language = "Japan"
scanner.ConfigureDependencies(configureOptions)

image_stream = Stream(r"Data\JapaneseSample.png")
image_format = OCRImageFormat.Png

scanner.Scan(image_stream,image_format)
text = scanner.Text.ToString()

with open('output.txt','a',encoding='utf-8') as file:
    file.write(text)

Product PageDocumentationExamplesForumTemporary LicenseCustomized Demo

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

Spire.Ocr-1.9.2-py3-none-win_amd64.whl (10.5 MB view details)

Uploaded Python 3Windows x86-64

Spire.Ocr-1.9.2-py3-none-manylinux1_x86_64.whl (16.3 MB view details)

Uploaded Python 3

Spire.Ocr-1.9.2-py3-none-macosx_10_7_universal.whl (20.6 MB view details)

Uploaded Python 3macOS 10.7+ universal (x86-64, i386, PPC64, PPC)

File details

Details for the file Spire.Ocr-1.9.2-py3-none-win_amd64.whl.

File metadata

  • Download URL: Spire.Ocr-1.9.2-py3-none-win_amd64.whl
  • Upload date:
  • Size: 10.5 MB
  • Tags: Python 3, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.11

File hashes

Hashes for Spire.Ocr-1.9.2-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 8247b51fc9157cdd6b120fa7e9b7df74bf8572c55f7bf6789854e31f12f9479f
MD5 41faaa666372f26ee1c55bcc9d9e525c
BLAKE2b-256 ad4900b4dd374f5aea50cc7f78813c4b5971d077dc4827b68a6cd874ef22bc54

See more details on using hashes here.

File details

Details for the file Spire.Ocr-1.9.2-py3-none-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for Spire.Ocr-1.9.2-py3-none-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 436601a9f6181d3970dd1d9d3acc697d6b1857786291d029bbd398dcea4d072f
MD5 e36f5b1b0209b476be0a95efd08fabd5
BLAKE2b-256 47c665c170c13ab284972829ead5c8b406621e52900775caf4e6be327d9ad8ff

See more details on using hashes here.

File details

Details for the file Spire.Ocr-1.9.2-py3-none-macosx_10_7_universal.whl.

File metadata

  • Download URL: Spire.Ocr-1.9.2-py3-none-macosx_10_7_universal.whl
  • Upload date:
  • Size: 20.6 MB
  • Tags: Python 3, macOS 10.7+ universal (x86-64, i386, PPC64, PPC)
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.11

File hashes

Hashes for Spire.Ocr-1.9.2-py3-none-macosx_10_7_universal.whl
Algorithm Hash digest
SHA256 7b4faca715ad9d5cf5e0fe34adfca55a44e20e1def7c24e8de08382287019576
MD5 ea22cc2c5f1ce5ba062189faa505e3c7
BLAKE2b-256 969cc88b8260f58a61f06676b9f9fcbb8a36147f4005654c6d42791d8aeea285

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page