Skip to main content

semantikon - Ontological type system

Project description

semantikon

Push-Pull Coverage

Logo

Motivation

Let's take a look at the following function:

def get_speed(distance: float, time: float) -> float:
    speed = distance / time
    return speed

For you as a human, it is clear that this is a function to calculate the speed for a given distance and a time. But for a computer, it is just a function that takes two floats and returns a float. The computer does not know what the inputs and outputs mean. This is where semantikon comes in. It provides a way to give scientific context to the inputs and outputs, as well as to the function itself.

Installation

You can install semantikon via pip:

pip install semantikon

You can also install semantikon via conda:

conda install -c conda-forge semantikon

Quick insight

In the realm of the workflow management systems, there are well defined inputs and outputs for each node. semantikon is a Python package to give scientific context to python functions by type annotations and decorators, which can then be translated into an rdflib knowledge graph. semantikon utilizes PMD core ontology (PMDco), which is based on the Basic Formal Ontology (BFO) and Ontology for Biomedical Investigations (OBI).

semantikon provides a way to define types for input and output parameters for function via type hinting. Type hinting is done with the function u, which requires the type, and optionally you can define ontological metadata. The type hinting is done in the following way:

>>> from rdflib import Namespace
>>> from semantikon import u
>>>
>>> EX = Namespace("http://example.org/")
>>>
>>> def get_speed(
...     distance: u(float, uri=EX.distance, units="meter"),
...     time: u(float, uri=EX.time, units="second"),
... ) -> u(float, uri=EX.speed, units="meter/second", label="speed"):
...     """some random docstring"""
...     speed = distance / time
...     return speed
>>> 
>>> 
>>> def get_kinetic_energy(
...     mass: u(float, units="kilogram", uri=EX.Mass),
...     velocity: u(float, units="meter/second", uri=EX.Velocity),
... ) -> u(float, units="joule", uri=EX.KineticEnergy):
...     return 0.5 * mass * velocity**2
>>> 
>>> 
>>> def my_kinetic_energy_workflow(
...     distance, time, mass
... ):
...     speed = get_speed(distance, time)
...     kinetic_energy = get_kinetic_energy(mass, speed)
...     return kinetic_energy

The workflow decorator from semantikon.workflow allows you to define a workflow that uses the above functions in semantikon. You can use any workflow management system that can export the workflow in the flowrep-format. Via semantikon.get_knowledge_graph you can extract a knowledge graph from the workflow. The knowledge graph schematically has the following structure:

graph TD
    get_speed["get_speed (obi:planned_process)"] -- bfo:has_part --> speed["speed (pmdco:output_assignment)"]
    speed -- pmdco:has_specified_output --> speed_data["speed_data (obi:value_specification)"]
    speed_data -- obi:specifies_value_of --> EX.speed
    speed_data -- qudt:hasUnit --> meter["meter (qudt:unit)"]
    velocity -- pmdco:has_specified_input --> speed_data
    get_kinetic_energy["get_kinetic_energy (obi:planned_process)"] -- bfo:has_part --> velocity["velocity (pmdco:input_assignment)"]

This is only the first insight into the knowledge graph. You can find the details in the notebook in the notebooks folder.

Side tour: unit conversion with pint

This part has nothing to do with the knowledge graph, but semantikon provides a way to interpret the types of inputs and outputs of a function via a decorator, in order to check consistency of the types and to convert them if necessary. Currently, semantikon provides an interpreter for pint.UnitRegistry objects. The interpreter is applied in the following way:

>>> from semantikon.metadata import u
>>> from semantikon.converter import units
>>> from pint import UnitRegistry
>>>
>>> @units
... def get_speed(
...     distance: u(float, units="meter"),
...     time: u(float, units="second")
... ) -> u(float, units="meter/second", label="speed"):
...     speed = distance / time
...     return speed
>>>
>>> ureg = UnitRegistry()
>>>
>>> print(get_speed(1 * ureg.meter, 1 * ureg.second))
1.0 meter / second

The interpreters check all types and, if necessary, convert them to the expected types before the function is executed, in order for all possible errors would be raised before the function execution. The interpreters convert the types in the way that the underlying function would receive the raw values.

In case there are multiple outputs, the type hints are to be passed as a tuple (e.g. tuple[u(float, units="meter"), u(float, units="second"))).

It is not fully guaranteed as a feature, but relative units as given on this page can be also used.

Interpreters can distinguish between annotated arguments and non-annotated arguments. If the argument is annotated, the interpreter will try to convert the argument to the expected type. If the argument is not annotated, the interpreter will pass the argument as is.

Regardless of whether type hints are provided, the interpreter acts only when the input values contain units and ontological types. If the input values do not contain units and ontological types, the interpreter will pass the input values to the function as is.

License

This project is licensed under the BSD 3-Clause License - see the LICENSE file for details.

Copyright (c) 2025, Max-Planck-Institut für Nachhaltige Materialien GmbH - Computational Materials Design (CM) Department

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

semantikon-1.0.0b3.tar.gz (134.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

semantikon-1.0.0b3-py3-none-any.whl (75.7 kB view details)

Uploaded Python 3

File details

Details for the file semantikon-1.0.0b3.tar.gz.

File metadata

  • Download URL: semantikon-1.0.0b3.tar.gz
  • Upload date:
  • Size: 134.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for semantikon-1.0.0b3.tar.gz
Algorithm Hash digest
SHA256 c629f0828dc20c7fa31e9213ce363d8aaaa4cd1846150768d603f3c4fa8bd877
MD5 a03ffdbf43fd2c36d69757798bee64a0
BLAKE2b-256 70c304f926141064c10d142ae8fd50f7976fedb65b00628d9af58934f080d1b8

See more details on using hashes here.

File details

Details for the file semantikon-1.0.0b3-py3-none-any.whl.

File metadata

  • Download URL: semantikon-1.0.0b3-py3-none-any.whl
  • Upload date:
  • Size: 75.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for semantikon-1.0.0b3-py3-none-any.whl
Algorithm Hash digest
SHA256 d12fae364a910ecb540c3d883399775db7cbfb840c471a8b5455013e2b9bf712
MD5 fce2c169a65431fec83c7a13d6d57289
BLAKE2b-256 9d0dc7207628343f1ac007f7863bd09a6b3fa01fd1b30fb03d833e5d73ab4dfe

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page