Integrated document processing with machine learning.
Project description
doc2data
About doc2data
doc2data is a Python library that provides functionality to train deep learning models for various document processing tasks.
Currently, models can be trained for four tasks:
- Page rotation
- Page cropping
- Document (multi-page) classification
- Token classification
Please note that doc2data is currently in a prototype stage.
Installation
pip install doc2data
Documentation
The documentation can be found here.
License
doc2data
is distributed under the terms of the Apache-2.0 license.
Credits
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
doc2data-0.2.0.tar.gz
(18.1 kB
view hashes)
Built Distribution
doc2data-0.2.0-py3-none-any.whl
(18.9 kB
view hashes)