Extract tabular data from images and scanned PDFs
Project description
Overview
ExtractTable - API to extract tabular data from images and scanned PDFs
The motivation is to make it easy for developers to extract tabular data from images or scanned PDF files without worrying about the table area, column coordinates, rotation et al.
Prerequisite
Before we talk/boast about the service, a developer MUST need an API key to use the ExtractTable service. FREE credits here.
We beat this market not just in accuracy also in cost, and expiration. You are most welcomed to BUY credits here or email me at saradhi@extracttable.com for assistance.
Installation
pip install -U ExtractTable
Basic Usage
Ok, enough selling. Let the ease in coding do the talk, and the output encourages you to buy credits - put that timer on and count the LOC.
from ExtractTable import *
et_sess = ExtractTable(api_key=YOUR_API_KEY) # Replace your VALID API Key here
print(et_sess.check_usage()) # Checks the API Key validity as well as shows associated plan usage
table_data = et_sess.process_file(filepath=Location_of_Image_with_Tables, output_format="df")
Woahh, as simple as that ?!
Certainly. Do you know the current ExtractTable users use it on
- Bank Statement
- Medical Records
- Invoice Details
- Tax forms
Its up to you now to explore the ways.
Explore
Whatelse is in the store.
ExtractTable._OUTPUT
- check the list of available output formatset_sess.ServerResponse.json()
- check the latest Actual ServerResponse attached to the session
Pull Requests & Rewards
Pull requests are most welcome and greatly appreciated with API credits.
License
This project is licensed under the Apache License 2.0, see the LICENSE file for details.
Social Media
Follow us on Social media for library updates and free credits.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for ExtractTable-1.2.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 544d97f470561e319c5f8393eb29fc933d651be30635bb5972358aa2ed7ddc1d |
|
MD5 | ff4779c3dec62ed3188e78f5e3a0ee59 |
|
BLAKE2b-256 | 345fc18072acc37e1b95fa2142236e3f06a084169cabe2f6992bddf5c2befe85 |