AI-assisted table configuration generation for Tablassert — entity resolution, YAML validation, and Biolink documentation lookup.
Project description
Tablassist CLI
Python CLI tool for AI-assisted Tablassert table configuration generation — entity resolution, YAML validation, and Biolink documentation lookup.
Tablassist ships with two document extraction modes:
extract-textfor fast raw extraction via Textractextract-text-semanticfor richer Docling-backed semantic extraction with Markdown output andocr=autoby default
Installation
pip install tablassist
The base install now includes Docling, so extract-text-semantic works without a separate helper script or optional extra.
An optional extra is available for CPU compatibility:
pip install "tablassist[rtcompat]" # Polars build for CPUs without required instructions
Requirements
- Python >= 3.13
- Environment variables
TABLASSIST_USERNAMEandTABLASSIST_API_KEYfor API-accessing commands
Usage
# Fetch table configuration documentation
tablassist docs-table-config
# Fetch advanced configuration examples
tablassist docs-advanced-examples
# Fetch the CLI tutorial
tablassist docs-tutorial
Entity Resolution
# Search for entity CURIEs by term
tablassist search-curies "breast cancer"
# Get canonical info for a specific CURIE
tablassist get-curie-info "MONDO:0007254"
# Search gene CURIEs within an NCBI taxon
tablassist search-gene-curies "BRCA1" --ncbi-taxon 9606
# Resolve an NCBI Taxon ID from an organism name
tablassist resolve-taxon-id "Homo sapiens"
Biolink Reference
# List all supported categories, predicates, or qualifiers
tablassist list-categories
tablassist list-predicates
tablassist list-qualifiers
# Fetch documentation for a specific Biolink element
tablassist docs-category "Gene"
tablassist docs-predicate "interacts_with"
tablassist docs-qualifier "qualified_predicate"
YAML Validation
Full config validation requires template: as the top-level key, with optional sections:. Use validate-section-str only for individual section mappings, not for whole config files.
# Validate a full config file
tablassist validate-config-file config.yaml
# Validate a single section from a YAML string
tablassist validate-section-str '<yaml>'
# Validate a full config from a YAML string
tablassist validate-config-str '<yaml>'
# Get the Section JSON schema
tablassist section-schema
Data Preview
# List sheets in an Excel file
tablassist excel-sheets data.xlsx
# Preview rows from an Excel sheet
tablassist preview-excel data.xlsx "Sheet1" 10
# Preview rows from a CSV file
tablassist preview-csv data.csv 10
# Extract text from a document (PDF, DOCX, etc.)
tablassist extract-text document.pdf
# Extract semantic Markdown from a document with Docling
tablassist extract-text-semantic document.pdf
# Extract plain text and explicitly disable OCR
tablassist extract-text-semantic document.pdf text off
extract-text is optimized for fast, low-overhead text grabs.
extract-text-semantic runs IBM Docling directly from the CLI module. It is the better choice when reading order, headings, lists, or table-aware Markdown matter more than raw speed.
Arguments for extract-text-semantic:
file— local document pathoutput_format—markdown(default) ortextocr—auto(default),off, oron
Use ocr=auto for the default balance. Use ocr=on for scans and image-heavy PDFs, and ocr=off when you know the source is born-digital and want the lightest path.
PMC Archive Download
# Download and extract a PMC tar archive
tablassist download-pmc-tar 12345 --dest-dir ./output
Development
uv sync # install dependencies
uv run ruff check . # lint
uv run ruff check --fix . # lint with auto-fix
uv run ruff format . # format
uv run pyright # type check
uv run --group dev python -m pytest # run all tests
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file tablassist-0.2.6.tar.gz.
File metadata
- Download URL: tablassist-0.2.6.tar.gz
- Upload date:
- Size: 151.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
77d123ed94e00bfc8302445c379f05bfb422d971f2ce84d6e712b42b359bb159
|
|
| MD5 |
1bee05483cfd2dfd316fafae96c176d5
|
|
| BLAKE2b-256 |
3f39522d7af82d5c42c46d3735439258f32f71700aecf0931be9261ec5200c3f
|
Provenance
The following attestation bundles were made for tablassist-0.2.6.tar.gz:
Publisher:
pypi.yml on SkyeAv/Tablassist
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
tablassist-0.2.6.tar.gz -
Subject digest:
77d123ed94e00bfc8302445c379f05bfb422d971f2ce84d6e712b42b359bb159 - Sigstore transparency entry: 1343174033
- Sigstore integration time:
-
Permalink:
SkyeAv/Tablassist@ce93a8c13f1249d8912a7aac2b7588602066c567 -
Branch / Tag:
refs/heads/master - Owner: https://github.com/SkyeAv
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
pypi.yml@ce93a8c13f1249d8912a7aac2b7588602066c567 -
Trigger Event:
workflow_dispatch
-
Statement type:
File details
Details for the file tablassist-0.2.6-py3-none-any.whl.
File metadata
- Download URL: tablassist-0.2.6-py3-none-any.whl
- Upload date:
- Size: 8.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a32ee771a370868cd906851334f5f79c415d1b81441e6836f11a6e137bd9d788
|
|
| MD5 |
1339bb5b65abf5de0192889c1fcbc23f
|
|
| BLAKE2b-256 |
7015ace8cf16ed87e63a749600755b9d8337d1edabb4a6ada757cbea85c8815d
|
Provenance
The following attestation bundles were made for tablassist-0.2.6-py3-none-any.whl:
Publisher:
pypi.yml on SkyeAv/Tablassist
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
tablassist-0.2.6-py3-none-any.whl -
Subject digest:
a32ee771a370868cd906851334f5f79c415d1b81441e6836f11a6e137bd9d788 - Sigstore transparency entry: 1343174037
- Sigstore integration time:
-
Permalink:
SkyeAv/Tablassist@ce93a8c13f1249d8912a7aac2b7588602066c567 -
Branch / Tag:
refs/heads/master - Owner: https://github.com/SkyeAv
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
pypi.yml@ce93a8c13f1249d8912a7aac2b7588602066c567 -
Trigger Event:
workflow_dispatch
-
Statement type: