Skip to main content

Extract excel form content into structured data

Project description

Excel extractor

Extract excel form content into structured data.

Usage

  1. SECCF extraction: supplier export control classification Form/declaration
from py_excel_form_extractor import extractor, go

company_names = extractor.CompanyNameList()  # the company name which can be mentioned in the file
for company_name in ["Amazon", "Amazon Ltd"]:
    company_names.append(company_name)

extr = extractor.make_seccf_extractor("Example.xlsx", company_names)
extraction = extr.extract()

# convert to JSON string
extr_json = extr.to_json()

BUILD

  1. Building the go binary
go build -o gobinary ./cmd/excelExtractor
  1. Running the program without building the binary
go build -o ./bin/excel-extrator ./cmd/excelFormExtractor/main.go
  1. Run the binary:
./bin/excel-extrator

Local Python bindings generation and installation

pip3 install pybindgen wheel
gopy build --output=py_excel_form_extractor -vm=python3 ./pkg/*
RELEASE_VERSION=YOUR_UPDATED_PACKAGE_VERSION python3 setup.py bdist_wheel --force

# install wheel file
wheel_file=$(ls dist/*.whl | head -n1); pip3 install $wheel_file

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

py_excel_form_extractor-0.3.41-cp313-cp313-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (5.9 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.17+ ARM64

py_excel_form_extractor-0.3.41-cp313-cp313-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (6.3 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.17+ x86-64manylinux: glibc 2.5+ x86-64

py_excel_form_extractor-0.3.41-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (5.9 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ ARM64

py_excel_form_extractor-0.3.41-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (6.3 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ x86-64manylinux: glibc 2.5+ x86-64

py_excel_form_extractor-0.3.41-cp312-cp312-macosx_11_0_arm64.whl (3.6 MB view details)

Uploaded CPython 3.12macOS 11.0+ ARM64

py_excel_form_extractor-0.3.41-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (5.9 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ ARM64

py_excel_form_extractor-0.3.41-cp311-cp311-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (6.3 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64manylinux: glibc 2.5+ x86-64

py_excel_form_extractor-0.3.41-cp311-cp311-macosx_11_0_arm64.whl (3.6 MB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

File details

Details for the file py_excel_form_extractor-0.3.41-cp313-cp313-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.41-cp313-cp313-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 fc2df3e24c3070abe711c8286a0f19f0a821ecc5d786555b36da742da5884f0b
MD5 0280ee67fbd7fe4d17432a7f047b4180
BLAKE2b-256 60ed216ed3eca0e2e1e59e0980453bf8fc5f139424a9e72c5e3f4a50b933f02c

See more details on using hashes here.

File details

Details for the file py_excel_form_extractor-0.3.41-cp313-cp313-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.41-cp313-cp313-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 2aebe71655031604a910d1f0148573cdf3f2e628a421291ed9854e9334a9d3d1
MD5 e1cac1866c5862f56e755d88d2dbf550
BLAKE2b-256 6810397ab6a1fe9fb33188754d9447baa66e96bc3d6d7369554a203f7f53bf3d

See more details on using hashes here.

File details

Details for the file py_excel_form_extractor-0.3.41-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.41-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 b630b3479c574fea4aa8a8d92f296dfca3598477e557d4679849bcea1fa4ae10
MD5 66ddb701ce3a0a5345dbeb8b47f54ad7
BLAKE2b-256 f9711928e26df1c2a4242720afed72b22378bb081bc1a00a3cb63c751832559c

See more details on using hashes here.

File details

Details for the file py_excel_form_extractor-0.3.41-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.41-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 6268bc8fe4b6605ff3170107bf1d46c4bce78c0a32ea98f96bd93c1b0d84b6bf
MD5 2ebbad40f6316e8173608488bcef7393
BLAKE2b-256 e186d25f68cf241b274cff1a772af60d750e032f7d4c323c99c979ddbc52e097

See more details on using hashes here.

File details

Details for the file py_excel_form_extractor-0.3.41-cp312-cp312-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.41-cp312-cp312-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 fb1810ae2898e7618e021ac065b7bb5ba5208787eaf31b8219896a7c7934703b
MD5 306b90871660d8b0668c0e9cb36f2f74
BLAKE2b-256 ec03a964643654460335d20a223d6dfd1bdeea6267e5029d50b0954bcbc32610

See more details on using hashes here.

File details

Details for the file py_excel_form_extractor-0.3.41-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.41-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 81023c07d1869f50bef2cb0d673d25067c08fae1a81d74325aaa949863d512e3
MD5 7ed373e1639ec1a9106c38c5cbe8d248
BLAKE2b-256 885e4794500b578d8f7245ff3c090ebf6afc587e62540fb3a7f57833f8746197

See more details on using hashes here.

File details

Details for the file py_excel_form_extractor-0.3.41-cp311-cp311-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.41-cp311-cp311-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 ccaba8ae23bb95f5b7b1b88c2e0b6bb72d331905fcade7fe0f09ebc98d8cd89f
MD5 d2213efe470d13aee6863b77df1c0421
BLAKE2b-256 f9a0b70f266ae10ca58f4a5d5d815951e3850858174c9b51c65b4ba0ae0605ba

See more details on using hashes here.

File details

Details for the file py_excel_form_extractor-0.3.41-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.41-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 ebc5d645729abfb451d54f27d80ffd33e26547320c6db8b24eb3ef240b88e3f2
MD5 92ea85735f0cd751b9f2ee2f19898d14
BLAKE2b-256 83da71cba22e2d85a9d29cb2b67084ef0e638fb97012dd5fe45188512815a9d7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page