Skip to main content

Extract tabular data from images and scanned PDFs

Project description

image

image image image

Overview

ExtractTable - API to extract tabular data from images and scanned PDFs

The motivation is to make it easy for developers to extract tabular data from images or scanned PDF files without worrying about the table area, column coordinates, rotation et al.

Prerequisite

Before we talk/boast about the service, a developer MUST need an API key to use the ExtractTable service. FREE credits here.

We beat this market not just in accuracy also in cost, and expiration. You are most welcomed to BUY credits here or email me at saradhi@extracttable.com for assistance.

Installation

pip install -U ExtractTable

Basic Usage

Ok, enough selling. Let the ease in coding do the talk, and the output encourages you to buy credits - put that timer on and count the LOC.

from ExtractTable import *
et_sess = ExtractTable(api_key=YOUR_API_KEY)        # Replace your VALID API Key here
print(et_sess.check_usage())        # Checks the API Key validity as well as shows associated plan usage 
table_data = et_sess.process_file(filepath=Location_of_Image_with_Tables, output_format="df")

Detail Code Here

Woahh, as simple as that ?!

Certainly. Do you know the current ExtractTable users use it on

  • Bank Statement
  • Medical Records
  • Invoice Details
  • Tax forms

Its up to you now to explore the ways.

Explore

Whatelse is in the store.

  • ExtractTable._OUTPUT - check the list of available output formats
  • et_sess.ServerResponse.json() - check the latest Actual ServerResponse attached to the session

Pull Requests & Rewards

Pull requests are most welcome and greatly appreciated with API credits.

License

This project is licensed under the Apache License 2.0, see the LICENSE file for details.

Social Media

Follow us on Social media for library updates and free credits.

Image      Image

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ExtractTable-1.2.0.tar.gz (8.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ExtractTable-1.2.0-py3-none-any.whl (14.3 kB view details)

Uploaded Python 3

File details

Details for the file ExtractTable-1.2.0.tar.gz.

File metadata

  • Download URL: ExtractTable-1.2.0.tar.gz
  • Upload date:
  • Size: 8.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.33.0 CPython/3.7.1

File hashes

Hashes for ExtractTable-1.2.0.tar.gz
Algorithm Hash digest
SHA256 15798971c5fc4c831327143dbc51dc20bf42e8692f75fa33a9a24a47abea878d
MD5 dd29a6db03110a315379fe550c344f98
BLAKE2b-256 a5f2526d54ef5416bb70565fbe5db4e0578d0acb2b184589f5bc814c98d8eb10

See more details on using hashes here.

File details

Details for the file ExtractTable-1.2.0-py3-none-any.whl.

File metadata

  • Download URL: ExtractTable-1.2.0-py3-none-any.whl
  • Upload date:
  • Size: 14.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.33.0 CPython/3.7.1

File hashes

Hashes for ExtractTable-1.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 544d97f470561e319c5f8393eb29fc933d651be30635bb5972358aa2ed7ddc1d
MD5 ff4779c3dec62ed3188e78f5e3a0ee59
BLAKE2b-256 345fc18072acc37e1b95fa2142236e3f06a084169cabe2f6992bddf5c2befe85

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page