A graphical user interface for processing images with the Mistral OCR API.

These details have not been verified by PyPI

Project links

Homepage

Environment
- Win32 (MS Windows)
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Image Recognition

Project description

Mistral OCR GUI Processor

A user-friendly desktop application, built with Python and Tkinter, to perform Optical Character Recognition (OCR) on images using the powerful Mistral AI API. The tool provides a simple interface to process individual images, entire folders, or nested subfolders, and can combine the extracted text into organized markdown files.

Features

Intuitive Graphical User Interface: A clean, tabbed interface for different processing modes.
Multiple Processing Modes:
- Individual Images: Process one or more specific image files.
- Folder Processing: Process all supported images within a single folder.
- Subfolder Processing: Process all supported images within the immediate subdirectories of a parent folder.
Drag-and-Drop Support: Easily add files or folders by dragging them onto the application window.
Markdown Combination: Automatically combine OCR results from multiple images into a single, well-structured markdown file.
Flexible Sorting Options: When combining files, sort them by natural filename order or by modification date.
Real-time Progress Visualization: A grid of squares visually represents the status of each image (Processing, Completed, Error).
Cancellable Operations: Stop a long-running processing job at any time.
Concurrent Processing: Uses a thread pool to process multiple images in parallel, speeding up large jobs.

Requirements

Python 3.8 or higher.
A Mistral AI account and an API key. You can get one from the Mistral AI Platform.

Installation

The recommended way to install the application is using pip:

pip install mistral-ocr-gui

This command will download the application and automatically install all required dependencies.

Installation from Source (for Developers)

If you want to contribute to the project or install the latest development version, you can install it from the source code:

Clone the repository:

git clone https://github.com/Danielnara24/Mistral-OCR.git
cd Mistral-OCR

Install in editable mode: This command will install the application and its dependencies. The -e flag allows you to make changes to the source code and have them take effect immediately.
```
pip install -e .
```

Configuration: Setting the API Key

The application requires your Mistral API key to be set as an environment variable named MISTRAL_API_KEY.

Windows

Open Command Prompt or PowerShell and run the following command to set the variable permanently. You will need to restart your terminal for the change to take effect.

setx MISTRAL_API_KEY "your_api_key_here"

macOS / Linux

Add the following line to your shell's configuration file (e.g., ~/.bashrc, ~/.zshrc, or ~/.profile):

export MISTRAL_API_KEY="your_api_key_here"

Then, either restart your terminal or run source ~/.bashrc (or the relevant file) to apply the changes.

Verify the Setup

You can verify that the environment variable is set correctly by running:

(macOS/Linux): echo $MISTRAL_API_KEY
(Windows CMD): echo %MISTRAL_API_KEY%
(Windows PowerShell): echo $env:MISTRAL_API_KEY

Usage

Once installed, the application can be launched directly from your terminal. Simply run the following command:

mistral-ocr

The graphical user interface will appear, and you can start processing your images.

How It Works

Select a Tab:
- Individual Images: Use this for processing a specific set of image files from different locations. Each image will generate a corresponding _OCR.md file in its original directory.
- Folder: Use this to process all images inside a single folder. You have the option to combine all the results into one markdown file named Combined_OCR_[FolderName].md.
- Subfolders: Use this to process images located in the immediate subfolders of a parent directory. You can combine results for each subfolder individually and even create a final compilation of all subfolder results.
Add Files/Folders:
- Click the "Select..." button to open a file/folder dialog.
- Or, drag and drop your files/folders directly onto the application window.
Configure Settings (if applicable):
- For Folder and Subfolder modes, check the box to enable combining markdown files.
- Choose a sorting method for the combined document.
Process:
- Click the "Process" button to start the OCR job.
- The progress grid will update in real-time, showing the status of each file.
- A timer will show the elapsed time.
Output:
- The generated markdown files (.md) will be saved in the same directory as the source images or in the relevant parent/subfolder directory for combined files.

License

This project is licensed under the MIT License.

Project details

These details have not been verified by PyPI

Project links

Homepage

Environment
- Win32 (MS Windows)
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Image Recognition

Release history Release notifications | RSS feed

0.4.0

Nov 12, 2025

0.3.0

Nov 11, 2025

This version

0.2.2

Oct 29, 2025

0.2.1

Oct 29, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mistral_ocr_gui-0.2.2.tar.gz (15.8 kB view details)

Uploaded Oct 29, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mistral_ocr_gui-0.2.2-py3-none-any.whl (14.8 kB view details)

Uploaded Oct 29, 2025 Python 3

File details

Details for the file mistral_ocr_gui-0.2.2.tar.gz.

File metadata

Download URL: mistral_ocr_gui-0.2.2.tar.gz
Upload date: Oct 29, 2025
Size: 15.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.4

File hashes

Hashes for mistral_ocr_gui-0.2.2.tar.gz
Algorithm	Hash digest
SHA256	`567357d149f54f9e488536bdffc4e5a2729d5a98e58c2f861414fb8f11f4c819`
MD5	`f3906948c2314bff4819c69116bcdab5`
BLAKE2b-256	`9fd230f72fcfe1eea1059435bf7f8b1b8e8c69fca3e2405a4a6d7da0bf8d6a53`

See more details on using hashes here.

File details

Details for the file mistral_ocr_gui-0.2.2-py3-none-any.whl.

File metadata

Download URL: mistral_ocr_gui-0.2.2-py3-none-any.whl
Upload date: Oct 29, 2025
Size: 14.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.4

File hashes

Hashes for mistral_ocr_gui-0.2.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0dddf1210d52c6f0c91c7f7728781eaa50468beea389df3707b6f9d6458410c9`
MD5	`2a1aa1b33039cde93f708ff45127b29a`
BLAKE2b-256	`656d22a7e1aac03647c4fae3eebd380d97ebefc64dcd6a3abf17507030603b8c`

See more details on using hashes here.

mistral-ocr-gui 0.2.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Mistral OCR GUI Processor

Features

Requirements

Installation

Installation from Source (for Developers)

Configuration: Setting the API Key

Windows

macOS / Linux

Verify the Setup

Usage

How It Works

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes