Django app for OCR and translation

These details have not been verified by PyPI

Project links

Source

Project description

OCR_translate

This is a Django app for creating back-end server aimed at performing OCR and translation of images received via a POST request.

The OCR and translation is performed using freely available machine learning models and packages (see below for what is currently implemented).

The server is designed to be used together with this browser extension, acting as a front-end providing the images and controlling the languages, models and plugins being used.

For more information, please consult the Full Documentation

Running the server

See the documentation for more information.

TLDR: If you are on windows you will need to:

install python (3.10 <= SUPPORTED_VERSIONS <= 3.13) with the check on Add python.exe to PATH
download>unzip the release file
run the run-user.[bat/sh] file (bat for windows, sh for linux)

Why do I need to install python

Before version v0.6.0 the server was distributed by including all possible plugins and their dependencies.
This made the distribution file (both the github release and docker image) quite large and the release process cumbersome.
Furthermore, not every user might be interested in every plugin and might end up downloading GB of files that they will never use.

For this reason a plugin_manager has been added to the project that will download/install the plugins and their dependencies on demand.
Installing python packages requires pip to be available (which is included with python on the windows installer).
Unfortunately I have not found a way to include pip reliably in the frozen install produced for the release file.
The alternative would've been to add a 2nd installer just to get pip before running the server, but why reinvent the wheel.

The check on Add python.exe to PATH is needed so that pip can be run without having to make any assumption on the installation path.

Also since I am now asking people to install python, I decided to go all the way and use an approach similar to what automatic1111's webui.bat does for stable diffusion.
This batch script will create/reuse a virtual environment in a folder venv in the same directory as the script and install the required packages in it.

Upgrading from a previous version

Download the desired release files.
Stop the server if it is running
Run the new server files. Make sure to use the same environment variables in run-user.[bat/sh] as before if you had set any
Since v0.6.0 you could also replace the existing server files with the new ones, or point the new ones to reuse the same virtual environment (if you really want to save 100~200MB)

Since v0.6.1 you can also use the OCT_VERSION and OCT_AUTOUPDATE environment variables to have the server update itself automatically to the latest or a specific version. It is still recommended to download the new release files as improvements/bug-fixes/new-features can also be added to the launch and settings scripts.

What happens to my installation and database in an upgrade

The database will be automatically migrated to the new version if needed. Any existing data will be preserved.
The plugins and their dependencies will be left unchanged or upgraded as needed.
Models will be reused

NOTE: Attempting to reuse the plugins if switching between different python versions will likely cause problems. If you plan to use different python versions, it is recommended to point to a different OCT_BASE_DIR or move/delete your current plugin installations.

Can I downgrade to a previous version

It depends... downgrading to a previous version is in general not supported. In particular if there have been changes to the database schema, downgrading them is not automated in this project. In that case, you would need to either start from a new database or use a backed up from the target version (or a previous one as upgrading is supported). Check the labeled PRs for which releases contains database migrations (label might be missing before v0.4.0).

Contributing

Suggestions/Ideas are always welcome and can be posted as discussions.
You can also just propose a new model to be tested/added to the ones available by default.
Bugs can be reported as issues
Code contributions as pull requests. Check the documentation for more information.

Plugins

The server is designed to only offer the basic functionalities, while the models that can be used and how they are used are defined by plugins.

See the documentation for a list of available plugins

Notes

When switching the server between CPU/CUDA mode for the first time, run the installation of the plugins again to make sure the scope-specific dependencies are installed.
Different plugins will make different types of models available:
- BOX Model: EasyOcr, PaddleOCR
- OCR Model: PaddleOCR, Tesseract, HuggingFace
- Translation Model: HuggingFace, GoogleTranslate, Ollama
Also some plugins might requires additional tools to be installed on the server and possibly some environment variable configured. Refer to the plugin documentation and the information the the tooltip shown by hovering the question mark next to the plugin name.

Troubleshooting

Related to https://github.com/Crivella/ocr_extension/issues/5 If a plugins fails to install, either via an error message you see in the server window or by not having the models available, try the following:
- Uninstall the plugin in question by deselecting it in the popup menu and clicking submit.
- Nuke the entire plugin installation by:
  - Stopping the server
  - Delete the plugins directory and the plugins.json file under $OCT_BASE_DIR (default to $HOME/.ocr_translate on Linux and %userprofile%\.ocr_translate on Windows)
  - Restart the server

If all else fails, please open an issue on the backend server possibly attaching the DEBUG log of the server (run the server by setting the environment variable DJANGO_LOG_LEVEL=DEBUG in your run-user.[sh/bat] script).

Possible problems

Issue 25 Using uBlock origin (or possibly other extension capable of blocking content) could stop the extension from sending requests to the server. This can be recognized if the popup for setting the language and models works fine but then the translations fails without producing any new log in the server windows. (WIP long term fix in the extension)
Issue 27 Having non latin characters in the model's path can cause HuggingFace transformers to fail loading them

Project details

These details have not been verified by PyPI

Project links

Source

Release history Release notifications | RSS feed

This version

0.7.3

Sep 29, 2025

0.7.2

Sep 20, 2025

0.7.1

Sep 20, 2025

0.7.0

Sep 20, 2025

0.6.3

Jul 16, 2025

0.6.2

Mar 11, 2025

0.6.1

Feb 11, 2025

0.6.0

Aug 19, 2024

0.5.1

Dec 17, 2023

0.4.0

Oct 28, 2023

0.3.2

Oct 9, 2023

0.3.1

Oct 9, 2023

0.3.0

Oct 9, 2023

0.2.4

Oct 4, 2023

0.2.1

Sep 20, 2023

0.2.0

Sep 17, 2023

0.1.4

Jul 30, 2023

0.1.3

Jul 28, 2023

0.1.2

Jul 19, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

django_ocr_translate-0.7.3.tar.gz (1.2 MB view details)

Uploaded Sep 29, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

django_ocr_translate-0.7.3-py3-none-any.whl (1.2 MB view details)

Uploaded Sep 29, 2025 Python 3

File details

Details for the file django_ocr_translate-0.7.3.tar.gz.

File metadata

Download URL: django_ocr_translate-0.7.3.tar.gz
Upload date: Sep 29, 2025
Size: 1.2 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: python-requests/2.32.5

File hashes

Hashes for django_ocr_translate-0.7.3.tar.gz
Algorithm	Hash digest
SHA256	`ca03ccf56be795821ce174d27b1a5ea35c71f45f3d6950425d28df69386a036e`
MD5	`cfbed6b19f1993d78163890199547409`
BLAKE2b-256	`8cdd50aa3b10dce59804ed7986b0131936d320126ea12e696348c9d53449964d`

See more details on using hashes here.

File details

Details for the file django_ocr_translate-0.7.3-py3-none-any.whl.

File metadata

Download URL: django_ocr_translate-0.7.3-py3-none-any.whl
Upload date: Sep 29, 2025
Size: 1.2 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: python-requests/2.32.5

File hashes

Hashes for django_ocr_translate-0.7.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b5f0b46b647dff0b9d4f1f7e543a5bf6ddd11471592557f5095ad878df5ede0f`
MD5	`fb1da79f126129108ed361a2827804d9`
BLAKE2b-256	`8bf1ba9f87116ae0b0620e9a21ca232c24ff5fd34caab3db75fbab7fb8e9617f`

See more details on using hashes here.

django-ocr_translate 0.7.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

OCR_translate

Running the server

Why do I need to install python

Upgrading from a previous version

What happens to my installation and database in an upgrade

Can I downgrade to a previous version

Contributing

Plugins

Notes

Troubleshooting

Possible problems

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes