Skip to main content

Generates OWL, SHACL shapes and SKOS concepts from XML Schema (XSD) files

Project description

XSD2RDF

License

A tool to convert XML Schema (XSD) files into various RDF formats (SHACL, OWL, SKOS) with integrated validation capabilities.

Overview

XSD2RDF allows you to convert XML Schema definitions into:

  • SHACL (Shapes Constraint Language) for RDF data validation
  • OWL (Web Ontology Language) for ontology representation
  • SKOS (Simple Knowledge Organization System) for concept schemes and taxonomies

Features

  • Convert XSD to SHACL, OWL, and SKOS based on integrated principles
  • SHACL shape constraints are linked to SKOS concept schemes when applicable
  • Handle complex XSD structures (choices, unions, complex types, enumerations, etc.)
  • SHACL shapes are validated according to SHACL-SHACL

This repository also includes a validation script to check RDF data against the generated SHACL shapes and SKOS concepts.

Installation

From PyPI

pip install xsd2rdf

From Source

git clone https://github.com/YourUsername/xsd2rdf.git
cd xsd2rdf
python -m pip install poetry
poetry install

Basic Usage

Convert an XSD file to all RDF formats (SHACL, OWL, SKOS):

python -m xsd2rdf -x path/to/schema.xsd

This generates the following files:

  • schema.xsd.shape.ttl (SHACL shapes)
  • schema.xsd.owl.ttl (OWL ontology)
  • schema.xsd.*.skos.ttl (SKOS concept schemes, one file per enumeration)

Command Line Parameters

  • -x, --XSD_FILE: XSD file to be converted
  • -f, --FOLDER: Folder containing non-related XSD files to be converted
  • -o, --OUTPUT_DIR: Output directory for generated files (default: same as XSD file)
  • -a, --ABBREVIATIONS_FILE: File containing custom abbreviations, one per line
  • -d, --debug: Enable debug output
  • -nc, --namespaced-concepts: Use namespaced IRIs for SKOS concepts

Either -x or -f must be specified, but not both. If both are specified, -x takes precedence.

SKOS IRI Options

By default, SKOS concept IRIs are created using a flat structure:

targetnamespace/concepts/conceptschemename_conceptname

With the --namespaced-concepts flag, concepts use a hierarchical structure:

targetnamespace/concepts/conceptschemename/conceptname

Examples

With custom output directory:

python -m xsd2rdf  -x path/to/schema.xsd -o output/directory

With folder containing multiple unrelated XSD files:

python -m xsd2rdf -f path/to/folder

Using a custom abbreviations file:

python -m xsd2rdf -x path/to/schema.xsd -a path/to/abbreviations.txt

A practical way to generate a list of abbreviations on a Windows machine using Powershell is with this command:

 Select-String -Path "c:\Users\mathi\Git\era\xsd2rdf\debug\SFERA_v3.00.xsd" -Pattern "\b[A-Z]{2,}\b" -AllMatches | ForEach-Object { $_.Matches } | ForEach-Object { $_.Value } | Where-Object { $_ -cmatch "^[A-Z]{2,}$" } | Sort-Object -Unique | Where-Object { $_.Length -ge 2 -and $_.Length -le 10 }

Using namespaced concept IRIs:

python -m xsd2rdf -x path/to/schema.xsd --namespaced-concepts

The abbreviations file should contain one abbreviation per line. These abbreviations will be preserved as uppercase when creating human-readable labels from camelCase or PascalCase strings.

Validation

This feature is only available from source as it is meant for development purposes.

Prerequisites:

  • Create sample data for validation schema.xsd.shape.ttl in the same directory as the xsd file

To validate RDF data against SHACL shapes with SKOS concepts:

python shacl-validation.py path/to/schema.xsd

This will:

  1. Load the data from schema.xsd.sample.ttl
  2. Include all related SKOS files (schema.xsd.*.skos.ttl)
  3. Perform validation using the generated SHACL shapes (schema.xsd.shape.ttl)
  4. Report results in the command line

Example

Converting an XSD file with enumerations:

python -m xsd2rdf xsd2rdf -x comparison/enumerations.xsd

Validating data using generated shapes and concepts:

python shacl-validation.py comparison/enumerations.xsd

License

EUPL 1.2

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xsd2rdf-0.6.3.tar.gz (21.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

xsd2rdf-0.6.3-py3-none-any.whl (22.2 kB view details)

Uploaded Python 3

File details

Details for the file xsd2rdf-0.6.3.tar.gz.

File metadata

  • Download URL: xsd2rdf-0.6.3.tar.gz
  • Upload date:
  • Size: 21.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for xsd2rdf-0.6.3.tar.gz
Algorithm Hash digest
SHA256 52b0540822229714c4b7fab91e2bc17256edad21be1dde3fe11a80f0b0eca32f
MD5 54f59d97dca6789fe7a51ab9a430c5c3
BLAKE2b-256 a4dacdf1f188cb59288071efc1c4ae00d15b1916125d91606894480d3ac8466b

See more details on using hashes here.

File details

Details for the file xsd2rdf-0.6.3-py3-none-any.whl.

File metadata

  • Download URL: xsd2rdf-0.6.3-py3-none-any.whl
  • Upload date:
  • Size: 22.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for xsd2rdf-0.6.3-py3-none-any.whl
Algorithm Hash digest
SHA256 15ab53448ae7406461b22b4c09003c88621f2ac8db078f5db927f10c8e0841d3
MD5 ebe4903a48645c35cae4334ef3a41791
BLAKE2b-256 aa0b9307315b36931387fb922368c381681136e2922c6ed197f93c7971d11c79

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page