pdftable: extract tables from PDF files
Project description
pdftable is a python module and command line utility that analyzes XML output from the program pdftohtml in order to extract tables from PDF files and output them as CSV data.
pdftable: extract tables from PDF files
pdftable is a python module and command line utility that analyzes XML output from the program pdftohtml in order to extract tables from PDF files and output them as CSV data.