Skip to main content

Simple script for extracting business data from PDFs.

Project description

bizextract

Simple script for extracting business data from PDFs.

Requirements

Installation (with pip)

  1. pip install bizextract

Usage

From a terminal or command prompt:

> bizextract -h
usage: bizextract - a tool to extract business data from PDF reports. [-h] [--pattern PATTERN] output

positional arguments:
  output             The output path (or file) to write the results. Content will be a CSV file.

options:
  -h, --help         show this help message and exit
  --pattern PATTERN  A file name pattern to limit the files parsed by the tool.

Please note the pattern option can be used to select files for parsing. The default is to search the current folder for the common file name pattern given when downloading the reports. When all else fails just add only the reports to a folder and change directories in the terminal to that folder, then run the tool like below in the example.

Example

> bizextract --pattern *.pdf output.csv

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bizextract-0.1.0.tar.gz (4.4 kB view hashes)

Uploaded Source

Built Distribution

bizextract-0.1.0-py3-none-any.whl (5.0 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page