A library for orchestrating the processing of document. Typically in a Gen AI applications (but not limited to just Gen AI).
Project description
GLLM Docproc
Description
A library for orchestrating the processing of document. Typically in a Gen AI applications (but not limited to just Gen AI).
Installation
Prerequisites
Mandatory:
- Python 3.11+ — Install here
- pip — Install here
- uv — Install here
- gcloud CLI (for authentication) — Install here, then log in using:
gcloud auth login
Install from Artifact Registry
This requires authentication via the gcloud CLI.
- Export token
export GCLOUD_ACCESS_TOKEN="$(gcloud auth print-access-token)"
- Configure the index in your
pyproject.tom;
[[tool.uv.index]]
name = "gen-ai-internal"
url = "https://oauth2accesstoken:${GCLOUD_ACCESS_TOKEN}@glsdk.gdplabs.id/gen-ai-internal/simple/"
- Add the dependency
uv add gllm-docproc
Local Development Setup
Prerequisites
-
Python 3.11+ — Install here
-
pip — Install here
-
uv — Install here
-
gcloud CLI — Install here, then log in using:
gcloud auth login
-
Git — Install here
-
Access to the GDP Labs SDK GitHub repository
1. Clone Repository
git clone git@github.com:GDP-ADMIN/gl-sdk.git
cd gl-sdk/libs/gllm-docproc
2. Setup Authentication
Set the following environment variables to authenticate with internal package indexes:
export UV_INDEX_GEN_AI_INTERNAL_USERNAME=oauth2accesstoken
export UV_INDEX_GEN_AI_INTERNAL_PASSWORD="$(gcloud auth print-access-token)"
export UV_INDEX_GEN_AI_USERNAME=oauth2accesstoken
export UV_INDEX_GEN_AI_PASSWORD="$(gcloud auth print-access-token)"
3. Quick Setup
Run:
make setup
4. Activate Virtual Environment
source .venv/bin/activate
Local Development Utilities
The following Makefile commands are available for quick operations:
Install uv
make install-uv
Install Pre-Commit
make install-pre-commit
Install Dependencies
make install
Update Dependencies
make update
Run Tests
make test
Contributing
Please refer to the Python Style Guide for information about code style, documentation standards, and SCA requirements.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file gllm_docproc_binary-0.8.6-cp312-cp312-win_amd64.whl.
File metadata
- Download URL: gllm_docproc_binary-0.8.6-cp312-cp312-win_amd64.whl
- Upload date:
- Size: 1.8 MB
- Tags: CPython 3.12, Windows x86-64
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3cf08b242ce4217675f5be4bee203f8943597bdd693187aa0c51904848625966
|
|
| MD5 |
33bfc9f63aa1430e79a7eb995b3c0412
|
|
| BLAKE2b-256 |
65c171e6e1d845e6af387b9ecca4b60e43ff7c46106b28daa154f450463f7260
|
Provenance
The following attestation bundles were made for gllm_docproc_binary-0.8.6-cp312-cp312-win_amd64.whl:
Publisher:
build-binary.yml on GDP-ADMIN/gl-sdk
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
gllm_docproc_binary-0.8.6-cp312-cp312-win_amd64.whl -
Subject digest:
3cf08b242ce4217675f5be4bee203f8943597bdd693187aa0c51904848625966 - Sigstore transparency entry: 929953677
- Sigstore integration time:
-
Permalink:
GDP-ADMIN/gl-sdk@09ac1b95310a42a54e0cd5f43ca9a5ca136ce9c9 -
Branch / Tag:
refs/tags/gllm_docproc-v0.8.6 - Owner: https://github.com/GDP-ADMIN
-
Access:
private
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
build-binary.yml@09ac1b95310a42a54e0cd5f43ca9a5ca136ce9c9 -
Trigger Event:
push
-
Statement type:
File details
Details for the file gllm_docproc_binary-0.8.6-cp312-cp312-macosx_13_0_arm64.whl.
File metadata
- Download URL: gllm_docproc_binary-0.8.6-cp312-cp312-macosx_13_0_arm64.whl
- Upload date:
- Size: 2.0 MB
- Tags: CPython 3.12, macOS 13.0+ ARM64
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ee4ce9a9602547f0963b52db6d952a72e3f87b763457aa727616bf170f0da63f
|
|
| MD5 |
d0953ba01cdc4320e0cdf385d1d8e46a
|
|
| BLAKE2b-256 |
11bfe77d15752423646ea1fbf33e9ce2b2ea095faacec69565cfbaec5bbf7c96
|
Provenance
The following attestation bundles were made for gllm_docproc_binary-0.8.6-cp312-cp312-macosx_13_0_arm64.whl:
Publisher:
build-binary.yml on GDP-ADMIN/gl-sdk
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
gllm_docproc_binary-0.8.6-cp312-cp312-macosx_13_0_arm64.whl -
Subject digest:
ee4ce9a9602547f0963b52db6d952a72e3f87b763457aa727616bf170f0da63f - Sigstore transparency entry: 929953676
- Sigstore integration time:
-
Permalink:
GDP-ADMIN/gl-sdk@09ac1b95310a42a54e0cd5f43ca9a5ca136ce9c9 -
Branch / Tag:
refs/tags/gllm_docproc-v0.8.6 - Owner: https://github.com/GDP-ADMIN
-
Access:
private
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
build-binary.yml@09ac1b95310a42a54e0cd5f43ca9a5ca136ce9c9 -
Trigger Event:
push
-
Statement type:
File details
Details for the file gllm_docproc_binary-0.8.6-cp311-cp311-win_amd64.whl.
File metadata
- Download URL: gllm_docproc_binary-0.8.6-cp311-cp311-win_amd64.whl
- Upload date:
- Size: 1.9 MB
- Tags: CPython 3.11, Windows x86-64
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a2b65558ec08ba8d3f48e767b57be313bb100e78341d7bb26d01f9ad62051af6
|
|
| MD5 |
b8b06ab7726ea2446711964543825112
|
|
| BLAKE2b-256 |
8e72481053e03154468aee832088fffa9ad64d652d1727938576056a64cd3240
|
Provenance
The following attestation bundles were made for gllm_docproc_binary-0.8.6-cp311-cp311-win_amd64.whl:
Publisher:
build-binary.yml on GDP-ADMIN/gl-sdk
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
gllm_docproc_binary-0.8.6-cp311-cp311-win_amd64.whl -
Subject digest:
a2b65558ec08ba8d3f48e767b57be313bb100e78341d7bb26d01f9ad62051af6 - Sigstore transparency entry: 929953688
- Sigstore integration time:
-
Permalink:
GDP-ADMIN/gl-sdk@09ac1b95310a42a54e0cd5f43ca9a5ca136ce9c9 -
Branch / Tag:
refs/tags/gllm_docproc-v0.8.6 - Owner: https://github.com/GDP-ADMIN
-
Access:
private
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
build-binary.yml@09ac1b95310a42a54e0cd5f43ca9a5ca136ce9c9 -
Trigger Event:
push
-
Statement type:
File details
Details for the file gllm_docproc_binary-0.8.6-cp311-cp311-manylinux_2_31_x86_64.whl.
File metadata
- Download URL: gllm_docproc_binary-0.8.6-cp311-cp311-manylinux_2_31_x86_64.whl
- Upload date:
- Size: 2.4 MB
- Tags: CPython 3.11, manylinux: glibc 2.31+ x86-64
- Uploaded using Trusted Publishing? Yes
- Uploaded via: uv/0.8.24
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
68fd37ef8ceaf621a50af85d99df81d03bffb9b291188e84abc9103e81b2a65b
|
|
| MD5 |
84bb97e31d68e8e3f4e93a0c89b3b1a6
|
|
| BLAKE2b-256 |
fcf340ef6a6f0ac116caa0e6682763a9e96a9c9c2f1bd5ccbe3033c3afe2aea8
|
File details
Details for the file gllm_docproc_binary-0.8.6-cp311-cp311-macosx_13_0_arm64.whl.
File metadata
- Download URL: gllm_docproc_binary-0.8.6-cp311-cp311-macosx_13_0_arm64.whl
- Upload date:
- Size: 2.0 MB
- Tags: CPython 3.11, macOS 13.0+ ARM64
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
93da520d691006ff948e66815af2892f5959a36717b23a07e2b31f0d9939f235
|
|
| MD5 |
2fe7ca0344bfca002a98ebf89d2df0df
|
|
| BLAKE2b-256 |
4ef756b8b3f757089e81f72d1e35a20cd3e9fc7b4ffddb8a436346f1b0b93b84
|
Provenance
The following attestation bundles were made for gllm_docproc_binary-0.8.6-cp311-cp311-macosx_13_0_arm64.whl:
Publisher:
build-binary.yml on GDP-ADMIN/gl-sdk
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
gllm_docproc_binary-0.8.6-cp311-cp311-macosx_13_0_arm64.whl -
Subject digest:
93da520d691006ff948e66815af2892f5959a36717b23a07e2b31f0d9939f235 - Sigstore transparency entry: 929953691
- Sigstore integration time:
-
Permalink:
GDP-ADMIN/gl-sdk@09ac1b95310a42a54e0cd5f43ca9a5ca136ce9c9 -
Branch / Tag:
refs/tags/gllm_docproc-v0.8.6 - Owner: https://github.com/GDP-ADMIN
-
Access:
private
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
build-binary.yml@09ac1b95310a42a54e0cd5f43ca9a5ca136ce9c9 -
Trigger Event:
push
-
Statement type: