Spire.OCR for Python is a professional OCR component used to read text in image formats such as JPG,PNG,GIF,BMP, and TIFF
Project description
Spire.OCR for Python: High-Accuracy OCR API for Efficient Text Extraction from Images
Product Page 丨 Documentation 丨 Examples 丨 Forum 丨 Temporary License 丨 Customized Demo
Spire.OCR for Python is a robust and professional Optical Character Recognition (OCR) library designed to enable developers to extract text from images in various formats, including JPG, PNG, GIF, BMP, and TIFF. This library provides an intuitive and straightforward solution for integrating OCR capabilities into Python applications, allowing users to easily extract text from popular image formats.
Key Features
- Text Extraction: Extract text from images with support for commonly used fonts such as Arial, Times New Roman, Courier New, Verdana, Tahoma, and Calibri, in regular, bold, and italic styles.
- Multilingual Support: Recognize text in multiple languages, including English, Chinese, French, German, Japanese, and Korean, making it suitable for global applications.
- Image Format Compatibility: Supports extraction of text from a wide range of image file formats like JPG, PNG, BMP, GIF, and TIFF.
- Cross-Platform: Compatible with OCR feature on Windows, Linux and Mac operating systems.
- Easy Integration: Seamlessly integrate Spire.OCR for Python into your projects with a simple API that requires minimal coding effort.
Supported Languages:
- English
- Chinese
- Japanese
- Korean
- German
- French
Supported Fonts:
Commonly used fonts are supported, such as:
- Arial
- Times New Roman
- Courier New
- Verdana
- Tahoma
- Calibri
Supported Font Styles:
- Regular
- Bold
- Italic
Supported Image File Formats:
- JPG
- PNG
- BMP
- GIF
- TIFF
Installation
To install Spire.OCR for Python, you can use pip, the Python package manager. Simply run the following command:
pip install Spire.OCR
Or manually download Spire.OCR for Python and import it into your project.
Examples
Extract Text from an Image File
from spire.ocr import *
scanner = OcrScanner()
configureOptions = ConfigureOptions()
configureOptions.ModelPath = r"D:\OCR\win-x64"
configureOptions.Language = "English"
scanner.ConfigureDependencies(configureOptions)
scanner.Scan(r"Data\Sample.png")
#output the text and the blocks
text = scanner.Text.ToString() + "\n"
for block in scanner.Text.Blocks:
rectangle = block.Box
postions = f"{block.Text} -> x : {rectangle.X} , y : {rectangle.Y} , w : {rectangle.Width} , h : {rectangle.Height}"
text += postions + "\n"
with open('output.txt','a',encoding='utf-8') as file:
file.write(text+ "\n")
Extract Text from an Image Stream
from spire.ocr import *
scanner = OcrScanner()
configureOptions = ConfigureOptions()
configureOptions.ModelPath = r"D:\OCR\win-x64"
configureOptions.Language = "Japan"
scanner.ConfigureDependencies(configureOptions)
image_stream = Stream(r"Data\JapaneseSample.png")
image_format = OCRImageFormat.Png
scanner.Scan(image_stream,image_format)
text = scanner.Text.ToString()
with open('output.txt','a',encoding='utf-8') as file:
file.write(text)
Product Page 丨 Documentation 丨 Examples 丨 Forum 丨 Temporary License 丨 Customized Demo
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file Spire.Ocr-1.9.0-py3-none-win_amd64.whl.
File metadata
- Download URL: Spire.Ocr-1.9.0-py3-none-win_amd64.whl
- Upload date:
- Size: 60.9 MB
- Tags: Python 3, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
10725a57fc4d73f4246be10a260f861f923a32a95ee372efaeceec79a198d0de
|
|
| MD5 |
34600df68afe4ae4e89cfac43c964f09
|
|
| BLAKE2b-256 |
6d149039fb46d0b4139b2ab4681c9af6ea69c6748a174a89dd78dd7e94887566
|
File details
Details for the file Spire.Ocr-1.9.0-py3-none-manylinux1_x86_64.whl.
File metadata
- Download URL: Spire.Ocr-1.9.0-py3-none-manylinux1_x86_64.whl
- Upload date:
- Size: 66.8 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
91f0f27ffa7c404340d9c73f7823ef1a53ff7523c24207befceeee91cf868700
|
|
| MD5 |
5f2eefc59e7c1e09aebcc57012029b79
|
|
| BLAKE2b-256 |
2fcd33d47e3553e0c3f809cc670c434e4a19e890a379326bac151fb5d88f6cc5
|