Simple script for extracting business data from PDFs.
Project description
bizextract
Simple script for extracting business data from PDFs.
Requirements
Installation (with pip)
pip install bizextract
Usage
From a terminal or command prompt:
> bizextract -h
usage: bizextract - a tool to extract business data from PDF reports. [-h] [--pattern PATTERN] output
positional arguments:
output The output path (or file) to write the results. Content will be a CSV file.
options:
-h, --help show this help message and exit
--pattern PATTERN A file name pattern to limit the files parsed by the tool.
Please note the pattern option can be used to select files for parsing. The default is to search the current folder for the common file name pattern given when downloading the reports. When all else fails just add only the reports to a folder and change directories in the terminal to that folder, then run the tool like below in the example.
Example
> bizextract --pattern *.pdf output.csv
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
bizextract-0.2.0.tar.gz
(4.4 kB
view hashes)
Built Distribution
Close
Hashes for bizextract-0.2.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7f89d2d5ff83b0f94f3f2881dff50b701ebbda9d21bbbcf585b9ac8eb153204c |
|
MD5 | 4f232d0c97cd6d540a661754009f1b66 |
|
BLAKE2b-256 | 2bb6c6c90f455b94253dc399f44619c2f3b54238f6434f28497a75fda42d81b6 |