JUst Give Me Tables
Project description
jugmt is a minimalistic spex (SPecification EXtractor) implementation with a codebase less than 200 lines of Python. The tool extracts figure information and tables from .docx files, generates HTML and JSON, and validates the JSON using a JSON schema.
When running the tool on a collection of NVMe specification documents, including Base, Boot, MI, NVM, ZNS, KV, PCI, RDMA, and TCP, it consumes a total of 5 seconds of wall-clock time and about 500MB of memory on an i7-1360P using a single thread for all documents combined.
For more information on the source code, extracted table formats, and validation, please refer to the online documentation
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file jugmt-1.0.1.tar.gz
.
File metadata
- Download URL: jugmt-1.0.1.tar.gz
- Upload date:
- Size: 13.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ea4cc20063299d18abb0f17bc7df502b47d5679dd6f42b60a576d0c7974b49c5 |
|
MD5 | 426a21858521ab36c8d6858d6cdd9c9f |
|
BLAKE2b-256 | b52e5c64bf898b2da1b14cb29537204e6e977e95c39f1e5734a977c5246cd5a2 |
File details
Details for the file jugmt-1.0.1-py3-none-any.whl
.
File metadata
- Download URL: jugmt-1.0.1-py3-none-any.whl
- Upload date:
- Size: 13.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 17517aa4326a3ec95a036ee938a32e1e0751ab6146ac33bb8bd8f58898564ef5 |
|
MD5 | 5c19e61c2509e4c937a51604cbae7e73 |
|
BLAKE2b-256 | ada5a3d986ff398eb425f9917292c5460fc99760e5a5c10c1d4204d544906fad |