Skip to main content

Another Excel to RDF converter for SKOS vocabs, but one that include profile-based validation of results.

Project description

VocExcel

Another Excel-to-RDF converter for SKOS vocabs, but one that:

  • uses fixed templates to keep it simple
  • meets particular SKOS profile outcomes (VocPub)
  • is under active development, production use, and is commercially supported

An online version of VocExcel is available at https://tools.kurrawong.ai/tools/vocexcel.

Creating vocabularies

The process to create an RDF vocabulary from an Excel template is:

  1. Fill in a copy of an Excel template
  2. Process it
    • Using one of the options, and export an RDF file
    • You can choose to validate the RDF produced while processing

Templates

The template files in this repository’s vocexcel/templates/ folder are to be used to create vocabularies. The templates hopefully contain all the information needed to understand how to fill them in.

Use one Excel workbook per vocabulary.

Latest Template

  • vocexcel/templates/VocExcel-template-100.xlsx

Unless you have a good reason to do something different, please use the latest version of the template.

Older templates still convert, so if you’ve used one and like it, keep using it.

Examples

Example filled-in templates versions are given in the tests/data/ folder. Just ensure you’re looking at examples prefixed with the same template version you are after, e.g. 0.8.10 = 0812.xlsx.

As per semantic versioning, a template of 0.8.5 will work with 0.8.10.

Processing

To process an Excel template, you will need to either:

The Python script can also run as a Python module, i.e. within a larger Python workflow.

Installation

VocExcel uses uv package manager.

You can install VocExcel from PyPI, the Python package index: https://pypi.org/project/vocexcel

Or you can clone this repository and install VocExcel’s dependencies using uv or a similar tool, using the provided pyproject.toml file.

Running

As a command line script

The Python script convert.py in the vocexcel/ directory can be run on Windows/Unix/Linux/Mac systems like this:

python convert.py some-excel-file.xlsx

If you install this program using a Python packaging tool such as uv, then it will run like this:

vocexel some-excel-file.xlsx

An example, using one of the test data files to convert from Excel to RDF:

python convert.py tests/data/0812.xlsx

To convert the other way - RDF to Excel - from with you’ll get a v0.8.x template result:

python convert.py tests/data/085_rdf.ttl

The command line argument options can be found by typing:

python convert.py -h

They are:

usage: vocexcel [-h] [-i] [-o OUTPUTFILE] [input_file]

positional arguments:
  input_file            The Excel file to convert to a SKOS vocabulary in RDF or an RDF file to convert to an Excel file. (default: None)

options:
  -h, --help            show this help message and exit
  -i, --info            The version and other info of this instance of VocExcel. (default: False)
  -o OUTPUTFILE, --outputfile OUTPUTFILE
                        An optionally-provided output file path. If not provided, output from Excel-> RDF is to standard out and RDF->Excel is input file with .xlsx file ending. (default: None)

As a Python library

The convert.py file uses the functions excel_to_rdf() and rdf_to_excel() to do conversions, so you can directly them in other Python programs by importing them like:

from vocexcel.convert import excel_to_rdf, rdf_to_excel
from pathlib import Path

rdf_to_excel(Path(".") / "path" / "to" / "vocab-file.xlsx")

# or

excel_to_rdf(Path(".") / "path" / "to" / "vocab-file.ttl")

Online

KurrawongAI maintains an online VocExcel tool at https://tools.kurrawong.ai/tools/vocexcel

License

This code is licensed using the BSD 3-Clause. See the LICENSE for the deed. Note that Excel is property of Microsoft.

Contact

Commercial support:
https://docs.kurrawong.ai/products/tools/vocexcel/
info@kurrawong.ai

Lead Developer:
Nicholas Car
Data Architect
KurrawongAI
nick@kurrawong.ai

Release Procedure

  • task format
  • update version number in pyprojects.toml
  • update ref to latest template, if changed, in README.md
  • Git commit all changes
    • push to GitHub: git push
  • add a Git tag matching the version number
    • push tags to GitHub: git push --tags
  • make a release on GitHub
    • this will trigger a PyPI release
  • do post release version number update

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vocexcel-1.0.4.tar.gz (5.3 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vocexcel-1.0.4-py3-none-any.whl (1.6 MB view details)

Uploaded Python 3

File details

Details for the file vocexcel-1.0.4.tar.gz.

File metadata

  • Download URL: vocexcel-1.0.4.tar.gz
  • Upload date:
  • Size: 5.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vocexcel-1.0.4.tar.gz
Algorithm Hash digest
SHA256 24453357aa10df41c07a238c8f5fc79f1bd2788cd8abe77059cd4f42a7fc74b3
MD5 2746b49274e903085516554d71ccf126
BLAKE2b-256 f2f1af6bb9d6b2b6c53df3ffb63bb1b06873867483796dc24c8b6a1e2e1f7394

See more details on using hashes here.

Provenance

The following attestation bundles were made for vocexcel-1.0.4.tar.gz:

Publisher: pypi.yml on Kurrawong/VocExcel

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file vocexcel-1.0.4-py3-none-any.whl.

File metadata

  • Download URL: vocexcel-1.0.4-py3-none-any.whl
  • Upload date:
  • Size: 1.6 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vocexcel-1.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 669d62ba92b5926e2cc377a836d0ac310478eed048529e951b258ceeafb4fdb4
MD5 4d98b3eb2dbc89797ac657d8994f2e5b
BLAKE2b-256 b6b6ca97d1167ce30134a18036fb903247c862a17dd7a923da7d10f791135534

See more details on using hashes here.

Provenance

The following attestation bundles were made for vocexcel-1.0.4-py3-none-any.whl:

Publisher: pypi.yml on Kurrawong/VocExcel

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page