The Semantiva is a modular and extensible framework designed to enable semantic transparency and ontology-driven processing for data operations.

Project description

Semantiva

Overview

Semantiva is an open-source, Python-based framework that unifies Domain-Driven Design, Type-Oriented Development, and semantic transparency to streamline data operations. It offers a structured way to define and process domain-specific data types and algorithms, ensuring clarity, consistency, and adaptability even in complex data-driven scenarios.

By enforcing type-safe relationships between data and algorithms, Semantiva simplifies the creation of transparent, interpretable workflows—enabling teams to focus on solving domain problems rather than battling ambiguous data models.

Key Principles

Domain-Driven Design (DDD)
- Aligns data types, algorithms, and operations with core domain concepts.
- Ensures each module speaks a consistent “domain language,” reducing misunderstandings and promoting maintainability.
Type-Oriented Development
- Establishes robust contracts between data and algorithms.
- Minimizes errors by validating data structures at definition time, preventing mismatches or incompatible operations.
Semantic Transparency
- Retains full traceability of how data is transformed and why particular operations are invoked.
- Facilitates clear, explainable workflows, valuable for QA, audits, or scientific reproducibility.
Modular & Extensible Architecture
- Supports adding new data types, algorithm types, and domain ontologies without disrupting existing components.
- Adapts naturally to diverse applications—ranging from basic string manipulations to advanced imaging pipelines or HPC-scale workloads.

Why Semantiva?

Clarity & Consistency: Well-defined semantics for data and algorithms ensure that everyone understands precisely how information flows and transforms.
Adaptive Workflows: Easily extend pipelines with new steps or data types, minimizing rework when domain requirements evolve.
Scalability & HPC Integration: Abstract base classes and a pipeline-oriented design let users scale operations seamlessly, whether on local machines or high-performance clusters.
Interdisciplinary Collaboration: A shared language of data and algorithm types fosters better communication across physics, mathematics, engineering, and software teams.

Core Components

Data Operations
- Abstract classes that enforce type-safe transformations, ensuring data flows remain coherent and domain-accurate.
Context Operations
- Manages contextual or environmental information affecting data processing, enhancing adaptability and domain awareness.
Payload Operations (Pipelines)
- Orchestrates the execution of multiple operations, combining data transformations and context adaptations into a coherent workflow.
Data Types & Algorithm Types
- Defines the structure and constraints of domain-specific data, alongside compatible algorithms (e.g., Image ↔ ImageAlgorithm), guaranteeing semantic integrity.
Execution Tools
- Utilities for executing, monitoring, and debugging pipelines, supporting straightforward deployment and scaling.

License

Semantiva is released under the MIT License, promoting collaborative development and broad adoption.

Getting Started: A Minimal Example

Below is a quick demonstration showing how Semantiva can handle a simple string data type and a matching algorithm. For more advanced domains—like imaging, wafer metrology, or large-scale simulations—users can define new data and algorithm types to match their specific needs.

# 1) Define StringLiteralDataType
from semantiva.data_types import BaseDataType

class StringLiteralDataType(BaseDataType):
    def __init__(self, data: str):
        super().__init__(data)

    def validate(self, data):
        assert isinstance(data, str), "Data must be a string."


# 2) Create a StringLiteralAlgorithm
from semantiva.data_operations import AlgorithmTopologyFactory

StringLiteralAlgorithm = AlgorithmTopologyFactory.create_algorithm(
    input_type=StringLiteralDataType,
    output_type=StringLiteralDataType,
    class_name="StringLiteralAlgorithm",
)


# 3) Define an Operation Extending StringLiteralAlgorithm
class HelloOperation(StringLiteralAlgorithm):
    def _operation(self, data: StringLiteralDataType) -> StringLiteralDataType:
        return StringLiteralDataType(f"Hello, {data.data}")


# 4) Build a Minimal Pipeline
from semantiva.payload_operations import Pipeline
from semantiva.context_operations import ContextPassthrough

node_configurations = [
    {
        "operation": HelloOperation,
        "parameters": {},
        "context_operation": ContextPassthrough,
    },
]

if __name__ == "__main__":
    pipeline = Pipeline(node_configurations)
    input_data = StringLiteralDataType("World!")
    output_data, _ = pipeline.process(input_data, {})
    print("Pipeline completed. Final output:", output_data.data) # "Hello, World!"

Key Takeaways

Strong Type Contracts: The StringLiteralDataType enforces the string constraint; incompatible data will fail early.
Algorithm-Data Alignment: HelloOperation inherits from StringLiteralAlgorithm, ensuring it can only act on StringLiteralDataType.
Scalable Pipeline: Extend this structure with domain-specific types (e.g., Image, Spectrum, AudioClip) and matching algorithms as needs grow.

Summary

Semantiva delivers a structured, type-safe, and domain-driven environment for designing adaptable data pipelines. By emphasizing semantic transparency and explicit domain alignment, it reduces cognitive load, fosters cross-disciplinary collaboration, and enables confident scaling to more complex or HPC-intensive problems—without sacrificing clarity or maintainability. Whether implementing straightforward text operations or tackling sophisticated scientific and industrial tasks, Semantiva equips developers and researchers with the tools to build robust, interpretable, and future-ready data solutions.

Acknowledgments

This framework draws inspiration from the rigorous demands of transparency and traceability in data-driven systems, particularly exemplified by the ALICE O2 project at CERN. The lessons learned from managing large-scale, high-throughput data in that environment—combined with the need for robust, domain-aligned workflows—shaped Semantiva’s emphasis on type-safe design, semantic clarity, and modular extensibility. By blending these concepts with principles of ontology-driven computing, Semantiva aims to deliver the same level of reliability and interpretability for any domain requiring advanced data processing and HPC integration.

Project details

Release history Release notifications | RSS feed

0.5.0

Nov 15, 2025

0.5.0rc11 pre-release

Nov 10, 2025

0.4.0

Jun 8, 2025

0.4.0rc1 pre-release

Jun 7, 2025

0.3.0

Mar 11, 2025

0.2.1.dev0 pre-release

Mar 11, 2025

This version

0.2.0

Jan 27, 2025

0.1.1

Jan 21, 2025

0.1.0

Jan 20, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

semantiva-0.2.0.tar.gz (45.8 kB view details)

Uploaded Jan 27, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

semantiva-0.2.0-py3-none-any.whl (43.5 kB view details)

Uploaded Jan 27, 2025 Python 3

File details

Details for the file semantiva-0.2.0.tar.gz.

File metadata

Download URL: semantiva-0.2.0.tar.gz
Upload date: Jan 27, 2025
Size: 45.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: pdm/2.22.3 CPython/3.10.12 Linux/6.8.0-1020-azure

File hashes

Hashes for semantiva-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`90939741e960c6f36cee5c34c65fdfc52f27861bb0c9a7cc38a16d36b370e3e6`
MD5	`bf6aba669e4ef2905372905800ad8ca1`
BLAKE2b-256	`06788ccb0eddccb39c6399ad5f888427c698839b4091b04062e68252cfea65ee`

See more details on using hashes here.

File details

Details for the file semantiva-0.2.0-py3-none-any.whl.

File metadata

Download URL: semantiva-0.2.0-py3-none-any.whl
Upload date: Jan 27, 2025
Size: 43.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: pdm/2.22.3 CPython/3.10.12 Linux/6.8.0-1020-azure

File hashes

Hashes for semantiva-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a0ad1c877d4d7f9a31ffce780910b807a686a5df97610ea955b37b2d90398758`
MD5	`63d6928fbb6acdc28c6be9661d950f40`
BLAKE2b-256	`084e9a1344aa5b0574eff3dc58bd0ee0165bbdafb1db6712f4995bb2b407f045`

See more details on using hashes here.

semantiva 0.2.0

Navigation

Verified details

Owner

Unverified details

Meta

Project description

Semantiva

Overview

Key Principles

Why Semantiva?

Core Components

License

Getting Started: A Minimal Example

Key Takeaways

Summary

Acknowledgments

Project details

Verified details

Owner

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes