Extract excel form content into structured data
Project description
Excel extractor
Extract excel form content into structured data.
Usage
- SECCF extraction: supplier export control classification Form/declaration
from py_excel_form_extractor import extractor, go
company_names = extractor.CompanyNameList() # the company name which can be mentioned in the file
for company_name in ["Amazon", "Amazon Ltd"]:
company_names.append(company_name)
extr = extractor.make_seccf_extractor("Example.xlsx", company_names)
extraction = extr.extract()
# convert to JSON string
extr_json = extr.to_json()
BUILD
- Building the go binary
go build -o gobinary ./cmd/excelExtractor
- Running the program without building the binary
go build -o ./bin/excel-extrator ./cmd/excelFormExtractor/main.go
- Run the binary:
./bin/excel-extrator
Local Python bindings generation and installation
pip3 install pybindgen wheel
gopy build --output=py_excel_form_extractor -vm=python3 ./pkg/*
RELEASE_VERSION=YOUR_UPDATED_PACKAGE_VERSION python3 setup.py bdist_wheel --force
# install wheel file
wheel_file=$(ls dist/*.whl | head -n1); pip3 install $wheel_file
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file py_excel_form_extractor-0.3.39-cp313-cp313-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.
File metadata
- Download URL: py_excel_form_extractor-0.3.39-cp313-cp313-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
- Upload date:
- Size: 5.9 MB
- Tags: CPython 3.13, manylinux: glibc 2.17+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
851ba6f199fe7ff5cb2d767479f79135e820603fb18670902424c3215df275f2
|
|
| MD5 |
e451db57cbdaed9852db6943bea4084d
|
|
| BLAKE2b-256 |
b41c78a7950d53783c04d339831928911184090e09b2dcae68b88702f8135114
|
File details
Details for the file py_excel_form_extractor-0.3.39-cp313-cp313-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl.
File metadata
- Download URL: py_excel_form_extractor-0.3.39-cp313-cp313-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 6.3 MB
- Tags: CPython 3.13, manylinux: glibc 2.17+ x86-64, manylinux: glibc 2.5+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fd537d582b48479aa7496de28d07784b3389395550674351e6a8a9c19f932c6f
|
|
| MD5 |
776f347e422ffca8afbe9c6528c78e68
|
|
| BLAKE2b-256 |
e8a9952b34ebdf79708ea4ba6928f4457dbaa6a1d4833d9490d96d78da04a1dc
|
File details
Details for the file py_excel_form_extractor-0.3.39-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.
File metadata
- Download URL: py_excel_form_extractor-0.3.39-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
- Upload date:
- Size: 5.9 MB
- Tags: CPython 3.12, manylinux: glibc 2.17+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
61631b1a3c373db5c59447a6913ce0a423e075ee35b1030f397294b037070aff
|
|
| MD5 |
8d902dd8808daaaad687113ebcb135c9
|
|
| BLAKE2b-256 |
f57ed1dca1f41a9831933bb8f43026026d45aa8bb39e7fa6844898e66e81a2f2
|
File details
Details for the file py_excel_form_extractor-0.3.39-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl.
File metadata
- Download URL: py_excel_form_extractor-0.3.39-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 6.3 MB
- Tags: CPython 3.12, manylinux: glibc 2.17+ x86-64, manylinux: glibc 2.5+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3fef819149a5e885f1d010a9d508b3b4da5b63ba445c42ed957d0e4decb7417c
|
|
| MD5 |
2bfd8b6aa49467f9d5423a2ec7a78975
|
|
| BLAKE2b-256 |
519eac8e795f4acef0dcb943060525c65c508b4647fa3c779d5cba1a17d44e61
|
File details
Details for the file py_excel_form_extractor-0.3.39-cp312-cp312-macosx_11_0_arm64.whl.
File metadata
- Download URL: py_excel_form_extractor-0.3.39-cp312-cp312-macosx_11_0_arm64.whl
- Upload date:
- Size: 3.6 MB
- Tags: CPython 3.12, macOS 11.0+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d855c80ada0f562c3ac735a93d15a99ba74f0b693b4d5f825c2d0a0ddf6cee7a
|
|
| MD5 |
5bd920a1686fdf6e649934453a7f911e
|
|
| BLAKE2b-256 |
d98c37c700172494cca3eadf4e63199aa766785e5131e089a525230d1cc818e5
|
File details
Details for the file py_excel_form_extractor-0.3.39-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.
File metadata
- Download URL: py_excel_form_extractor-0.3.39-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
- Upload date:
- Size: 5.9 MB
- Tags: CPython 3.11, manylinux: glibc 2.17+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
03f04fd2f18bf5ecd71232ae0c284846eb6030163044eb8f44e61d5add1a6f58
|
|
| MD5 |
46f3ef983223cd0b075f7463c2af4ae1
|
|
| BLAKE2b-256 |
91ac4b97bfbb593c597423ec4b77b375bc234f8d8b601cf7eef18ab7287d1a24
|
File details
Details for the file py_excel_form_extractor-0.3.39-cp311-cp311-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl.
File metadata
- Download URL: py_excel_form_extractor-0.3.39-cp311-cp311-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 6.3 MB
- Tags: CPython 3.11, manylinux: glibc 2.17+ x86-64, manylinux: glibc 2.5+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7f627d86f68cc78dc8c7f74f0875f653312016051ea7d12a876668bc01ed92c4
|
|
| MD5 |
d4029587bd87d65e7834cee953458ca1
|
|
| BLAKE2b-256 |
9d9ef758a5b034a76af29b9227c28475e6bd531e59f2c7141d6bf1e13dad8e7b
|
File details
Details for the file py_excel_form_extractor-0.3.39-cp311-cp311-macosx_11_0_arm64.whl.
File metadata
- Download URL: py_excel_form_extractor-0.3.39-cp311-cp311-macosx_11_0_arm64.whl
- Upload date:
- Size: 3.6 MB
- Tags: CPython 3.11, macOS 11.0+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f3148d02fa1363a8c56be4524174109ad51c475dac3f05851962b2c6a891762b
|
|
| MD5 |
e3cd1b1a5b090c9284d8fd2d6872f107
|
|
| BLAKE2b-256 |
43824e52f9d2074d6e21a67ec7f2b7ae591af5b341a9e3ccfc2f52347895e753
|