Skip to main content

It extract the table data from the pdf

Project description

TableParser

tableParser is a python program using the concept of deep learning for extract the data from tables present in the PDF. It basically takes the file name of the pdf and extrcts the tables from the PDF. This is usefull when we need tables of the reserch papers or some thing like long table. This program uses the pytorch framework (made by facebook) and tesseract for parsing the table data.

This is an Intital stage of the project. It have some bugs hopefully later release will fix this issue. 😊

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tableParser-1.0.1.tar.gz (2.9 kB view hashes)

Uploaded Source

Built Distribution

tableParser-1.0.1-py3-none-any.whl (4.3 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page