Skip to main content

Teklia DAN

Project description

DAN: a Segmentation-free Document Attention Network for Handwritten Document Recognition

Python >= 3.10

For more details about this package, make sure to see the documentation available at https://atr.pages.teklia.com/dan/.

This is an open-source project, licensed using the CeCILL-C license.

Inference

To apply DAN to an image, one needs to first add a few imports and to load an image. Note that the image should be in RGB.

import cv2
from dan.ocr.predict.inference import DAN

image = cv2.cvtColor(cv2.imread(IMAGE_PATH), cv2.COLOR_BGR2RGB)

Then one can initialize and load the trained model with the parameters used during training. The directory passed as parameter should have:

  • a model.pt file,
  • a charset.pkl file,
  • a parameters.yml file corresponding to the inference_parameters.yml file generated during training.
from pathlib import Path

model_path = Path("models")

model = DAN("cpu")
model.load(model_path, mode="eval")

To run the inference on a GPU, one can replace cpu by the name of the GPU. In the end, one can run the prediction:

from pathlib import Path
from dan.utils import parse_charset_pattern

# Load image
image_path = "images/page.jpg"
_, image = dan_model.preprocess(str(image_path))

input_tensor = image.unsqueeze(0)
input_tensor = input_tensor.to("cpu")
input_sizes = [image.shape[1:]]

# Predict
text, confidence_scores = model.predict(
    input_tensor,
    input_sizes,
    char_separators=parse_charset_pattern(dan_model.charset),
    confidences=True,
)

Training

This package provides three subcommands. To get more information about any subcommand, use the --help option.

Get started

See the dedicated page on the official DAN documentation.

Data extraction from Arkindex

See the dedicated page on the official DAN documentation.

Model training

See the dedicated page on the official DAN documentation.

Model prediction

See the dedicated page on the official DAN documentation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

atr_dan-0.2.0rc8.tar.gz (95.7 kB view details)

Uploaded Source

Built Distribution

atr_dan-0.2.0rc8-py3-none-any.whl (116.3 kB view details)

Uploaded Python 3

File details

Details for the file atr_dan-0.2.0rc8.tar.gz.

File metadata

  • Download URL: atr_dan-0.2.0rc8.tar.gz
  • Upload date:
  • Size: 95.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.10.14

File hashes

Hashes for atr_dan-0.2.0rc8.tar.gz
Algorithm Hash digest
SHA256 803f4ea547bc67a0666a94d00afdcd9cbc5185572c0f15e4fd1c8a74ca00fdbe
MD5 6ef4b98390a3b91d22ef1821fab4e6d2
BLAKE2b-256 b2e99ed8c2bb2385048bbe0669ed437cf7b59026f187e8713d61f12b9d86949f

See more details on using hashes here.

File details

Details for the file atr_dan-0.2.0rc8-py3-none-any.whl.

File metadata

  • Download URL: atr_dan-0.2.0rc8-py3-none-any.whl
  • Upload date:
  • Size: 116.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.10.14

File hashes

Hashes for atr_dan-0.2.0rc8-py3-none-any.whl
Algorithm Hash digest
SHA256 965bbc9a0208706e2df89dc61d485b9b86719c9d36882dab8ac9237acb36fc86
MD5 49d4f9139f7ec91a8addba4f8f8d3043
BLAKE2b-256 4e64e7a4f3bea83ee38e42a8ea4ebcbed6687cb148215cd0022bdde7a4f6eca5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page