Skip to main content

An OCR (Optical Character Recognition) utility for text extraction from images.

Project description

Easy-to-Use Apple Vision wrapper for text extraction and clustering

Python License Maintainer

apple_ocr is a utility for Optical Character Recognition (OCR) that facilitates the extraction of text from images. This Python-based tool is designed to help developers, researchers, and enthusiasts in the field of text extraction and clustering. It leverages a combination of various technologies to achieve this, including the Vision framework provided by Apple.

Plot

Features

  • Text Recognition: apple_ocr uses the Vision framework to recognize text within an image. It extracts recognized text and provides information about its confidence levels.

  • Clustering: The tool can perform K-Means clustering on the extracted data. It groups similar text elements together based on their coordinates.

  • Interactive 3D Visualization: apple_ocr offers an interactive 3D scatter plot using Plotly, displaying the clustered text elements. This visualization helps users gain insights into the distribution of text and text density.

Dependencies

The script relies on the following Python libraries:

  • Torch
  • NumPy
  • Pandas
  • Pillow
  • Scikit-learn
  • Plotly
  • Pyobjc

Usage

Here's how you can use apple_ocr:

  1. Installation: Install the required libraries, including Torch, NumPy, Pandas, Pillow, scikit-learn, and Plotly.

  2. Initialization: Create an instance of the OCR class, providing an image to be processed.

from apple_ocr.ocr import OCR
from PIL import Image

image = Image.open("your_image.png")
ocr_instance = OCR(image=image)
  1. Text Recognition: Use the recognize method to perform text recognition. It will return a structured DataFrame containing recognized text, bounding box dimensions, text density, and centroid coordinates.
dataframe = ocr_instance.recognize()
  1. Clustering: Use the cluster method to perform K-Means clustering on the recognized text data. This method assigns cluster labels to each data point based on their coordinates.
cluster_labels = ocr_instance.cluster(dataframe, num_clusters=3)
  1. Visualization: Finally, use the scatter method to create an interactive 3D scatter plot. This plot visualizes the clustered text elements, including centroids, text density, and more.
ocr_instance.scatter()

Example

Here's an example of the entire process:

from apple_ocr.ocr import OCR
from PIL import Image

image = Image open("your_image.png")
ocr_instance = OCR(image=image)
dataframe = ocr_instance.recognize()
cluster_labels = ocr_instance.cluster(dataframe, num_clusters=3)
ocr_instance.scatter()

Citing this project

If you use this code in your research, please use the following BibTeX entry.

@misc{louisbrulenaudet2023,
	author = {Louis Brulé Naudet},
	title = {Easy-to-Use Apple Vision wrapper for text extraction and clustering},
	howpublished = {\url{https://github.com/louisbrulenaudet/apple-ocr}},
	year = {2023}
}

Feedback

If you have any feedback, please reach out at louisbrulenaudet@icloud.com.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

apple-ocr-1.0.8.tar.gz (9.2 kB view details)

Uploaded Source

Built Distribution

apple_ocr-1.0.8-py3-none-any.whl (9.4 kB view details)

Uploaded Python 3

File details

Details for the file apple-ocr-1.0.8.tar.gz.

File metadata

  • Download URL: apple-ocr-1.0.8.tar.gz
  • Upload date:
  • Size: 9.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.7

File hashes

Hashes for apple-ocr-1.0.8.tar.gz
Algorithm Hash digest
SHA256 9ede4888ac808d33a6078df019a7fddcff44f7018fe8a481abb24e6aadeb9649
MD5 406acb265735743b722a09eda8d45e06
BLAKE2b-256 f243bb901cd3b46019297058a49f3c087486d76ba8cc129f891299463dd4e70d

See more details on using hashes here.

File details

Details for the file apple_ocr-1.0.8-py3-none-any.whl.

File metadata

  • Download URL: apple_ocr-1.0.8-py3-none-any.whl
  • Upload date:
  • Size: 9.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.7

File hashes

Hashes for apple_ocr-1.0.8-py3-none-any.whl
Algorithm Hash digest
SHA256 5026030e4e3f2bc8e997431c2cb7ef00eeca560d4c67a7a69ac94b7938ac1e74
MD5 1a4e54a08f3fe45429c7fc96eb88228f
BLAKE2b-256 b3ba7872de2c0fe42ff79e7cc685cca80dd9905c751fd6f757fa2b4cf473be1e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page