Skip to main content

Integrated document processing with machine learning.

Project description

doc2data

PyPI - Version PyPI - Python Version Code style: black Hatch project


About doc2data

doc2data is a Python library that provides functionality to train deep learning models for various document processing tasks.

Currently, models can be trained for four tasks:

  1. Page rotation
  2. Page cropping
  3. Document (multi-page) classification
  4. Token classification

Please note that doc2data is currently in a prototype stage.

Installation

pip install doc2data

Documentation

The documentation can be found here.

License

doc2data is distributed under the terms of the Apache-2.0 license.

Credits

alt text alt text

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

doc2data-0.2.0.tar.gz (18.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

doc2data-0.2.0-py3-none-any.whl (18.9 kB view details)

Uploaded Python 3

File details

Details for the file doc2data-0.2.0.tar.gz.

File metadata

  • Download URL: doc2data-0.2.0.tar.gz
  • Upload date:
  • Size: 18.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.23.0

File hashes

Hashes for doc2data-0.2.0.tar.gz
Algorithm Hash digest
SHA256 362e6f064ed8f0d16ffe5408b250c2e5d6311771abed222286b22cc95070ea61
MD5 82ce1227903e5b85bf61ea410926c206
BLAKE2b-256 c3a3a2e725d87415b830865b72888bf4e431679b834733db4802b54a0e1b10b3

See more details on using hashes here.

File details

Details for the file doc2data-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: doc2data-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 18.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.23.0

File hashes

Hashes for doc2data-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ad8ca54060e851bcffe6fc8bd12a9a1450fe35a5d4e0750582bfdf0839d66fe6
MD5 4022a009ce15e220ef360c20ea91d8e9
BLAKE2b-256 e8f5346825bfb27a668448c6bf4c4f1a617e1afad7ab1fdabefd68b3a119ebc7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page