Skip to main content

Simple script for extracting business data from PDFs.

Project description

bizextract

Simple script for extracting business data from PDFs.

Requirements

Installation (with pip)

  1. pip install bizextract

Usage

From a terminal or command prompt:

> bizextract -h
usage: bizextract - a tool to extract business data from PDF reports. [-h] [--pattern PATTERN] output

positional arguments:
  output             The output path (or file) to write the results. Content will be a CSV file.

options:
  -h, --help         show this help message and exit
  --pattern PATTERN  A file name pattern to limit the files parsed by the tool.

Please note the pattern option can be used to select files for parsing. The default is to search the current folder for the common file name pattern given when downloading the reports. When all else fails just add only the reports to a folder and change directories in the terminal to that folder, then run the tool like below in the example.

Example

> bizextract --pattern *.pdf output.csv

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bizextract-0.2.0.tar.gz (4.4 kB view details)

Uploaded Source

Built Distribution

bizextract-0.2.0-py3-none-any.whl (5.0 kB view details)

Uploaded Python 3

File details

Details for the file bizextract-0.2.0.tar.gz.

File metadata

  • Download URL: bizextract-0.2.0.tar.gz
  • Upload date:
  • Size: 4.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.3

File hashes

Hashes for bizextract-0.2.0.tar.gz
Algorithm Hash digest
SHA256 f438b6e401cb0ce5bd7143b28255bd1dd51d99d538526a1f39a37ce6b04f24da
MD5 fb8007c03410e01aafb4e3946e4609b1
BLAKE2b-256 71dd33ead7ddb257dfa8751093e726c3746567400713370625ea0f3763631899

See more details on using hashes here.

File details

Details for the file bizextract-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: bizextract-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 5.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.3

File hashes

Hashes for bizextract-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7f89d2d5ff83b0f94f3f2881dff50b701ebbda9d21bbbcf585b9ac8eb153204c
MD5 4f232d0c97cd6d540a661754009f1b66
BLAKE2b-256 2bb6c6c90f455b94253dc399f44619c2f3b54238f6434f28497a75fda42d81b6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page