Skip to main content

Teklia DAN

Project description

DAN: a Segmentation-free Document Attention Network for Handwritten Document Recognition

Python >= 3.10

For more details about this package, make sure to see the documentation available at https://atr.pages.teklia.com/dan/.

This is an open-source project, licensed using the CeCILL-C license.

Inference

To apply DAN to an image, one needs to first add a few imports and to load an image. Note that the image should be in RGB.

import cv2
from dan.ocr.predict.inference import DAN

image = cv2.cvtColor(cv2.imread(IMAGE_PATH), cv2.COLOR_BGR2RGB)

Then one can initialize and load the trained model with the parameters used during training. The directory passed as parameter should have:

  • a model.pt file,
  • a charset.pkl file,
  • a parameters.yml file corresponding to the inference_parameters.yml file generated during training.
from pathlib import Path

model_path = Path("models")

model = DAN("cpu")
model.load(model_path, mode="eval")

To run the inference on a GPU, one can replace cpu by the name of the GPU. In the end, one can run the prediction:

from pathlib import Path
from dan.utils import parse_charset_pattern

# Load image
image_path = "images/page.jpg"
_, image = dan_model.preprocess(str(image_path))

input_tensor = image.unsqueeze(0)
input_tensor = input_tensor.to("cpu")
input_sizes = [image.shape[1:]]

# Predict
text, confidence_scores = model.predict(
    input_tensor,
    input_sizes,
    char_separators=parse_charset_pattern(dan_model.charset),
    confidences=True,
)

Training

This package provides three subcommands. To get more information about any subcommand, use the --help option.

Get started

See the dedicated page on the official DAN documentation.

Data extraction from Arkindex

See the dedicated page on the official DAN documentation.

Model training

See the dedicated page on the official DAN documentation.

Model prediction

See the dedicated page on the official DAN documentation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

atr_dan-0.2.0rc11.tar.gz (101.9 kB view details)

Uploaded Source

Built Distribution

atr_dan-0.2.0rc11-py3-none-any.whl (125.5 kB view details)

Uploaded Python 3

File details

Details for the file atr_dan-0.2.0rc11.tar.gz.

File metadata

  • Download URL: atr_dan-0.2.0rc11.tar.gz
  • Upload date:
  • Size: 101.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.15

File hashes

Hashes for atr_dan-0.2.0rc11.tar.gz
Algorithm Hash digest
SHA256 87f23be6d8187a490fac48dc6905c17034740fade27da8f2b16a2bd52fec9663
MD5 7dbfc01dc6cc24f8a394daf7547ebf5e
BLAKE2b-256 e444edbabbc6ef7d844c82989f96e9ff9969e8222bd7d70ccd885030a1a9bb14

See more details on using hashes here.

File details

Details for the file atr_dan-0.2.0rc11-py3-none-any.whl.

File metadata

  • Download URL: atr_dan-0.2.0rc11-py3-none-any.whl
  • Upload date:
  • Size: 125.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.15

File hashes

Hashes for atr_dan-0.2.0rc11-py3-none-any.whl
Algorithm Hash digest
SHA256 7793aa098e5ffb2f945a05f818fe8a0c49ddfe56c0c74bc2f05990e8482e9ec5
MD5 183139e579e8ea4ab3f4c7d528d316b1
BLAKE2b-256 10245af62b9ecddac5ec3c9a302c4547f35fbb8d4a8b5dee3d954946b1978482

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page