Skip to main content

Extract excel form content into structured data

Project description

Excel extractor

Extract excel form content into structured data.

Usage

  1. SECCF extraction: supplier export control classification Form/declaration
from py_excel_form_extractor import extractor, go

company_names = extractor.CompanyNameList()  # the company name which can be mentioned in the file
for company_name in ["Amazon", "Amazon Ltd"]:
    company_names.append(company_name)

extr = extractor.make_seccf_extractor("Example.xlsx", company_names)
extraction = extr.extract()

# convert to JSON string
extr_json = extr.to_json()

BUILD

  1. Building the go binary
go build -o gobinary ./cmd/excelExtractor
  1. Running the program without building the binary
go build -o ./bin/excel-extrator ./cmd/excelFormExtractor/main.go
  1. Run the binary:
./bin/excel-extrator

Local Python bindings generation and installation

pip3 install pybindgen wheel
gopy build --output=py_excel_form_extractor -vm=python3 ./pkg/*
RELEASE_VERSION=YOUR_UPDATED_PACKAGE_VERSION python3 setup.py bdist_wheel --force

# install wheel file
wheel_file=$(ls dist/*.whl | head -n1); pip3 install $wheel_file

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

py_excel_form_extractor-0.3.27-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (6.3 MB view details)

Uploaded CPython 3.12 manylinux: glibc 2.17+ x86-64 manylinux: glibc 2.5+ x86-64

py_excel_form_extractor-0.3.27-cp311-cp311-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (6.3 MB view details)

Uploaded CPython 3.11 manylinux: glibc 2.17+ x86-64 manylinux: glibc 2.5+ x86-64

File details

Details for the file py_excel_form_extractor-0.3.27-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.27-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 795f52bebc1b013771ea531ac9947060f2ddc07b14e1ed1867c2e64ed03651f8
MD5 d947c6f5bc9762eb566835630020f420
BLAKE2b-256 09677b4d5e7d53d6f6f2b4434167f36212959487d7a8f1f8cb2ce68d13561f34

See more details on using hashes here.

File details

Details for the file py_excel_form_extractor-0.3.27-cp311-cp311-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.27-cp311-cp311-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 2c52367ac13885e2f5eb7611a8cc633ec67530a57d3110270f5aa76afe8ead96
MD5 aaa9b9c943c3034697ee2a1b41128cd8
BLAKE2b-256 637edbffb78b8d7e78efcdc973b5d09a82b19c24b630e2a5dd0189b6e280721c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page