Skip to main content

Run character recognition on uploaded files

Project description

License: AGPL-3

OCR for documents

This module was written to make uploaded documents, for example scans, searchable by running OCR on them.

It supports all image formats Pillow supports for reading and PDFs.

Installation

To install this module, you need to:

  1. install tesseract and the language(s) your documents use

  2. if you want to support OCR on PDFs, install imagemagick

  3. install the module itself

On an Debian or Ubuntu system you would typically run:

$ sudo apt-get install tesseract-ocr imagemagick

Configuration

To configure this module, go to:

  1. Settings/Technical/Parameters/System parameters and review the parameters with names document_ocr.*

Usage

By default, character recognition is done asynchronously by a cronjob at night. This is because the recognition process takes a while and you don’t want to make your users wait for the indexation to finish. The interval to run the cronjob can be adjusted to your needs in the Scheduled Actions menu, under ` Settings`. In case you want to force the OCR to be done immediately, set configuration parameter document_ocr.synchronous to value True.

Try me on Runbot

Bug Tracker

Bugs are tracked on GitHub Issues. In case of trouble, please check there if your issue has already been reported. If you spotted it first, help us smashing it by providing a detailed and welcomed feedback.

Credits

The actual work

Images

  • Odoo Community Association: Icon.

Contributors

Do not contact contributors directly about help with questions or problems concerning this addon, but use the community mailing list or the appropriate specialized mailinglist for help, and the bug tracker linked in Bug Tracker above for technical issues.

Maintainer

Odoo Community Association

This module is maintained by the OCA.

OCA, or the Odoo Community Association, is a nonprofit organization whose mission is to support the collaborative development of Odoo features and promote its widespread use.

To contribute to this module, please visit https://odoo-community.org.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

File details

Details for the file odoo8_addon_document_ocr-8.0.1.0.0-py2-none-any.whl.

File metadata

File hashes

Hashes for odoo8_addon_document_ocr-8.0.1.0.0-py2-none-any.whl
Algorithm Hash digest
SHA256 7c03df8e1e9c5f9a1e63289ae2434e892eeb887319ba7ef2ba6f13206f5172b3
MD5 a36705e8746dc31762708d9bcf97800a
BLAKE2b-256 ab2bf6ecce603c8d9d0774e737c9db43e567e28c4128bf18f1bec88294890d58

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page