Skip to main content

Another Excel to RDF converter for SKOS vocabs, but one that include profile-based validation of results.

Project description

VocExcel

Another Excel-to-RDF converter for SKOS vocabs, but one that:

  • uses fixed templates to keep it simple
  • meets particular SKOS profile outcomes (VocPub)
  • is under active development, production use, and is commercially supported

An online version of VocExcel is available at https://tools.kurrawong.ai/tools/vocexcel.

Creating vocabularies

The process to create an RDF vocabulary from an Excel template is:

  1. Fill in a copy of an Excel template
  2. Process it
    • Using one of the options, and export an RDF file
    • You can choose to validate the RDF produced while processing

Templates

The template files in this repository’s vocexcel/templates/ folder are to be used to create vocabularies. The templates hopefully contain all the information needed to understand how to fill them in.

Use one Excel workbook per vocabulary.

Latest Template

  • vocexcel/templates/VocExcel-template-0812.xlsx

Unless you have a good reason to do something different, please use the latest version of the template.

Older templates still convert, so if you’ve used one and like it, keep using it.

Examples

Example filled-in templates versions are given in the tests/data/ folder. Just ensure you’re looking at examples prefixed with the same template version you are after, e.g. 0.8.10 = 0812.xlsx.

As per semantic versioning, a template of 0.8.5 will work with 0.8.10.

Processing

To process an Excel template, you will need to either:

The Python script can also run as a Python module, i.e. within a larger Python workflow.

Installation

VocExcel uses uv package manager.

You can install VocExcel from PyPI, the Python package index: https://pypi.org/project/vocexcel

Or you can clone this repository and install VocExcel’s dependencies using uv or a similar tool, using the provided pyproject.toml file.

Running

As a command line script

The Python script convert.py in the vocexcel/ directory can be run on Windows/Unix/Linux/Mac systems like this:

python convert.py some-excel-file.xlsx

If you install this program using a Python packaging tool such as uv, then it will run like this:

vocexel some-excel-file.xlsx

An example, using one of the test data files to convert from Excel to RDF:

python convert.py tests/data/0812.xlsx

To convert the other way - RDF to Excel - from with you’ll get a v0.8.x template result:

python convert.py tests/data/085_rdf.ttl

The command line argument options can be found by typing:

python convert.py -h

They are:

usage: vocexcel [-h] [-i] [-o OUTPUTFILE] [input_file]

positional arguments:
  input_file            The Excel file to convert to a SKOS vocabulary in RDF or an RDF file to convert to an Excel file. (default: None)

options:
  -h, --help            show this help message and exit
  -i, --info            The version and other info of this instance of VocExcel. (default: False)
  -o OUTPUTFILE, --outputfile OUTPUTFILE
                        An optionally-provided output file path. If not provided, output from Excel-> RDF is to standard out and RDF->Excel is input file with .xlsx file ending. (default: None)

As a Python library

The convert.py file uses the functions excel_to_rdf() and rdf_to_excel() to do conversions, so you can directly them in other Python programs by importing them like:

from vocexcel.convert import excel_to_rdf, rdf_to_excel
from pathlib import Path

rdf_to_excel(Path(".") / "path" / "to" / "vocab-file.xlsx")

# or

excel_to_rdf(Path(".") / "path" / "to" / "vocab-file.ttl")

Online

KurrawongAI maintains an online VocExcel tool at https://tools.kurrawong.ai/tools/vocexcel

License

This code is licensed using the BSD 3-Clause. See the LICENSE for the deed. Note that Excel is property of Microsoft.

Contact

Commercial support:
https://docs.kurrawong.ai/products/tools/vocexcel/\ info@kurrawong.ai

Lead Developer:
Nicholas Car
Data Architect
KurrawongAI
nick@kurrawong.ai

Release Procedure

  • update version numbers
    • pyprojects.toml
    • vocexcel/init.py
  • update ref to latest template, if changed, in README.adoc
  • reproduce README.md
    • uvx --from Pydowndoc-bin -- downdoc README.adoc -o README.md
  • add a Git tag matching the version number
    • push to GitHub: git push
    • push tags to GitHub: git push --tags
  • build: uv build
  • publish to PyPI: uv publis -u __token__ -p {TOKEN}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vocexcel-0.8.12.tar.gz (4.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vocexcel-0.8.12-py3-none-any.whl (1.2 MB view details)

Uploaded Python 3

File details

Details for the file vocexcel-0.8.12.tar.gz.

File metadata

  • Download URL: vocexcel-0.8.12.tar.gz
  • Upload date:
  • Size: 4.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.15 {"installer":{"name":"uv","version":"0.9.15","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vocexcel-0.8.12.tar.gz
Algorithm Hash digest
SHA256 36f38c269a6240595141a30a0e40341f6d723e66796c194c619c02203ed8ce35
MD5 e72170a9c12cbb1cc4792eff91a16263
BLAKE2b-256 8d940c8e00393b81e4cabc1fe9d739139d3988b5aad6586ea060b6e07ff4df01

See more details on using hashes here.

File details

Details for the file vocexcel-0.8.12-py3-none-any.whl.

File metadata

  • Download URL: vocexcel-0.8.12-py3-none-any.whl
  • Upload date:
  • Size: 1.2 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.15 {"installer":{"name":"uv","version":"0.9.15","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vocexcel-0.8.12-py3-none-any.whl
Algorithm Hash digest
SHA256 2d91e53bf5f184bf2fb7abc5f886daa106d3d793988ef878ef28532e76261018
MD5 daa627f2ea0b2fcdfcf45337126dc608
BLAKE2b-256 d9e007ec482487815f3a4188141a3cad3366169095d83d12a29cc8f5a4a63681

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page