Skip to main content

Extract excel form content into structured data

Project description

Excel extractor

Extract excel form content into structured data.

Usage

  1. SECCF extraction: supplier export control classification Form/declaration
from py_excel_form_extractor import extractor, go

company_names = extractor.CompanyNameList()  # the company name which can be mentioned in the file
for company_name in ["Amazon", "Amazon Ltd"]:
    company_names.append(company_name)

extr = extractor.make_seccf_extractor("Example.xlsx", company_names)
extraction = extr.extract()

# convert to JSON string
extr_json = extr.to_json()

BUILD

  1. Building the go binary
go build -o gobinary ./cmd/excelExtractor
  1. Running the program without building the binary
go build -o ./bin/excel-extrator ./cmd/excelFormExtractor/main.go
  1. Run the binary:
./bin/excel-extrator

Local Python bindings generation and installation

pip3 install pybindgen wheel
gopy build --output=py_excel_form_extractor -vm=python3 ./pkg/*
RELEASE_VERSION=YOUR_UPDATED_PACKAGE_VERSION python3 setup.py bdist_wheel --force

# install wheel file
wheel_file=$(ls dist/*.whl | head -n1); pip3 install $wheel_file

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

py_excel_form_extractor-0.3.39-cp313-cp313-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (5.9 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.17+ ARM64

py_excel_form_extractor-0.3.39-cp313-cp313-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (6.3 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.17+ x86-64manylinux: glibc 2.5+ x86-64

py_excel_form_extractor-0.3.39-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (5.9 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ ARM64

py_excel_form_extractor-0.3.39-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (6.3 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ x86-64manylinux: glibc 2.5+ x86-64

py_excel_form_extractor-0.3.39-cp312-cp312-macosx_11_0_arm64.whl (3.6 MB view details)

Uploaded CPython 3.12macOS 11.0+ ARM64

py_excel_form_extractor-0.3.39-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (5.9 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ ARM64

py_excel_form_extractor-0.3.39-cp311-cp311-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (6.3 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64manylinux: glibc 2.5+ x86-64

py_excel_form_extractor-0.3.39-cp311-cp311-macosx_11_0_arm64.whl (3.6 MB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

File details

Details for the file py_excel_form_extractor-0.3.39-cp313-cp313-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.39-cp313-cp313-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 851ba6f199fe7ff5cb2d767479f79135e820603fb18670902424c3215df275f2
MD5 e451db57cbdaed9852db6943bea4084d
BLAKE2b-256 b41c78a7950d53783c04d339831928911184090e09b2dcae68b88702f8135114

See more details on using hashes here.

File details

Details for the file py_excel_form_extractor-0.3.39-cp313-cp313-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.39-cp313-cp313-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 fd537d582b48479aa7496de28d07784b3389395550674351e6a8a9c19f932c6f
MD5 776f347e422ffca8afbe9c6528c78e68
BLAKE2b-256 e8a9952b34ebdf79708ea4ba6928f4457dbaa6a1d4833d9490d96d78da04a1dc

See more details on using hashes here.

File details

Details for the file py_excel_form_extractor-0.3.39-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.39-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 61631b1a3c373db5c59447a6913ce0a423e075ee35b1030f397294b037070aff
MD5 8d902dd8808daaaad687113ebcb135c9
BLAKE2b-256 f57ed1dca1f41a9831933bb8f43026026d45aa8bb39e7fa6844898e66e81a2f2

See more details on using hashes here.

File details

Details for the file py_excel_form_extractor-0.3.39-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.39-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 3fef819149a5e885f1d010a9d508b3b4da5b63ba445c42ed957d0e4decb7417c
MD5 2bfd8b6aa49467f9d5423a2ec7a78975
BLAKE2b-256 519eac8e795f4acef0dcb943060525c65c508b4647fa3c779d5cba1a17d44e61

See more details on using hashes here.

File details

Details for the file py_excel_form_extractor-0.3.39-cp312-cp312-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.39-cp312-cp312-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 d855c80ada0f562c3ac735a93d15a99ba74f0b693b4d5f825c2d0a0ddf6cee7a
MD5 5bd920a1686fdf6e649934453a7f911e
BLAKE2b-256 d98c37c700172494cca3eadf4e63199aa766785e5131e089a525230d1cc818e5

See more details on using hashes here.

File details

Details for the file py_excel_form_extractor-0.3.39-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.39-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 03f04fd2f18bf5ecd71232ae0c284846eb6030163044eb8f44e61d5add1a6f58
MD5 46f3ef983223cd0b075f7463c2af4ae1
BLAKE2b-256 91ac4b97bfbb593c597423ec4b77b375bc234f8d8b601cf7eef18ab7287d1a24

See more details on using hashes here.

File details

Details for the file py_excel_form_extractor-0.3.39-cp311-cp311-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.39-cp311-cp311-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 7f627d86f68cc78dc8c7f74f0875f653312016051ea7d12a876668bc01ed92c4
MD5 d4029587bd87d65e7834cee953458ca1
BLAKE2b-256 9d9ef758a5b034a76af29b9227c28475e6bd531e59f2c7141d6bf1e13dad8e7b

See more details on using hashes here.

File details

Details for the file py_excel_form_extractor-0.3.39-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for py_excel_form_extractor-0.3.39-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 f3148d02fa1363a8c56be4524174109ad51c475dac3f05851962b2c6a891762b
MD5 e3cd1b1a5b090c9284d8fd2d6872f107
BLAKE2b-256 43824e52f9d2074d6e21a67ec7f2b7ae591af5b341a9e3ccfc2f52347895e753

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page