Skip to main content

Extract excel form content into structured data

Project description

Excel extractor

Extract excel form content into structured data.

Usage

  1. SECCF extraction: supplier export control classification Form/declaration
from py_excel_form_extractor import extractor, go

company_names = extractor.CompanyNameList()  # the company name which can be mentioned in the file
for company_name in ["Amazon", "Amazon Ltd"]:
    company_names.append(company_name)

extr = extractor.make_seccf_extractor("Example.xlsx", company_names)
extraction = extr.extract()

# convert to JSON string
extr_json = extr.to_json()

BUILD

  1. Building the go binary
go build -o gobinary ./cmd/excelExtractor
  1. Running the program without building the binary
go build -o ./bin/excel-extrator ./cmd/excelFormExtractor/main.go
  1. Run the binary:
./bin/excel-extrator

Local Python bindings generation and installation

pip3 install pybindgen wheel
gopy build --output=py_excel_form_extractor -vm=python3 ./pkg/*
RELEASE_VERSION=YOUR_UPDATED_PACKAGE_VERSION python3 setup.py bdist_wheel --force

# install wheel file
wheel_file=$(ls dist/*.whl | head -n1); pip3 install $wheel_file

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

py_excel_form_extractor-0.3.32-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (5.9 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ ARM64

py_excel_form_extractor-0.3.32-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (6.3 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ x86-64manylinux: glibc 2.5+ x86-64

py_excel_form_extractor-0.3.32-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (5.9 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ ARM64

py_excel_form_extractor-0.3.32-cp311-cp311-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (6.3 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64manylinux: glibc 2.5+ x86-64

File details

Details for the file py_excel_form_extractor-0.3.32-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.32-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 7f90c8373be8254709c7f5b3076e1f233b3e70528c4b2fe551c65bb1c7fa2b41
MD5 f9422d5a5ae0fc9a8a923a0a34844d5a
BLAKE2b-256 6efa8700cfb93e145787622dff12f2bb96c1f75df00f46bda512c0163738d79c

See more details on using hashes here.

File details

Details for the file py_excel_form_extractor-0.3.32-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.32-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 c5658fd9a2abd9b80b4e8718bb875da6803595bc57124592ae592584cfb54b1f
MD5 2369d6df89d75ac76c1d63a5fc64d832
BLAKE2b-256 dfc3ee824f337b2235a7888cf33afb0a6377989ab6eb8af1a2972cf6ef6e61ba

See more details on using hashes here.

File details

Details for the file py_excel_form_extractor-0.3.32-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.32-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 c4e9548a87db64e29d348df5d3e848cad32a793cd5fedc24a8eebab8b82feccd
MD5 d779eed17d097e5aa91edfe702748257
BLAKE2b-256 a06ace933de779948b67853961e0e4c0a58e8c9bb8ce6c031d8bb550345a5309

See more details on using hashes here.

File details

Details for the file py_excel_form_extractor-0.3.32-cp311-cp311-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.32-cp311-cp311-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 fc9d9cbe65a330f485e81803c2098832274c6837db2384f00cbe6e792d5c8c6f
MD5 3ca47bc1621c5feb603b3d8fe2c6c51a
BLAKE2b-256 d32d0a05da795c4fefe6bdfc0704bc7f3d55b38530f2a72ab59566f799f16f66

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page