End-to-end ontology embedding via fine-tuning sentence transformers with hyperbolic geometry and role-based rotation for existential restrictions.

These details have not been verified by PyPI

Project description

Ontology-Transformer

End-to-end ontology embedding via fine-tuning sentence transformers with hyperbolic geometry and role-based rotation for embedding EL-concept (e.g., ∃r.C).

Features

One-line training: OntologyTransformer.fit("ontology.owl") → fine-tuned embeddings
Hyperbolic space: Poincaré ball embeddings for hierarchical structures
Role-aware existential restrictions: ∃r.C encoded via learned rotation transformations
LoRA fine-tuning: Parameter-efficient fine-tuning via LoRA adapters (PEFT) — trains <5% of parameters
Automatic data preparation: Converts OWL/OFN axioms to training data (no manual preprocessing)
Best lambda auto-tuning: Centripetal weight optimized on evaluation data and saved with model
Flexible evaluation: Use training ontology samples or separate eval/test ontologies

Installation

pip install ontology-transformer

For LoRA (parameter-efficient fine-tuning) support:

pip install ontology-transformer[lora]

Requirements

Python ≥ 3.9
PyTorch ≥ 2.0 (with CUDA recommended)
sentence-transformers, geoopt, deeponto, datasets
peft (optional, for LoRA fine-tuning)

Quick Start

1. End-to-end: OWL → Fine-tune → Embeddings

from ont import OntologyTransformer

# Train on any OWL/OFN ontology (all axioms used for training)
model = OntologyTransformer.fit(
    owl_path="path/to/ontology.owl",
    output_dir="./output",
    num_epochs=3,
    batch_size=64,        # training batch size (sentences per step)
    eval_batch_size=32,   # evaluation batch size (queries scored per step)
    eval_ratio=0.1,       # 10% of axioms sampled for evaluation
    max_eval=1000,        # max 1000 eval samples
)

# The best lambda (centripetal weight) is determined during training
print(f"Best lambda: {model.best_lambda}")

# Encode concepts
emb = model.encode("food product")

# Encode ∃r.C (existential restrictions) via role rotation
exist_emb = model.encode_existence("has ingredient", "sugar")

2. Use separate ontologies for evaluation/testing

model = OntologyTransformer.fit(
    owl_path="train_ontology.owl",
    eval_owl_path="eval_ontology.owl",   # optional: separate eval ontology
    test_owl_path="test_ontology.owl",   # optional: separate test ontology
    output_dir="./output",
    num_epochs=3,
)

3. Load a pre-trained model

from ont import OntologyTransformer

# Load model (best_lambda is automatically restored)
model = OntologyTransformer.from_pretrained("./output/final")
print(f"Loaded best_lambda: {model.best_lambda}")

# Encode
emb = model.encode("heart disease")
exist_emb = model.encode_existence("has part", "cell membrane")

4. CLI

# Basic training
ont-train --owl ontology.owl --output ./output --epochs 3

# With explicit batch sizes
ont-train --owl ontology.owl --output ./output \
    --epochs 20 --batch-size 256 --eval-batch-size 64

# With separate eval ontology
ont-train --owl train.owl --eval-owl eval.owl --output ./output --epochs 3

# Balanced mode (adds C_neg contrastive loss)
ont-train --owl ontology.owl --output ./output --balanced --epochs 3

# LoRA fine-tuning (parameter-efficient)
ont-train --owl ontology.owl --output ./output --use-lora --epochs 3

# LoRA with custom hyperparameters
ont-train --owl ontology.owl --output ./output --use-lora \
    --lora-r 16 --lora-alpha 32 --lora-dropout 0.1 --epochs 3

5. LoRA Fine-tuning (Parameter-Efficient)

LoRA (Low-Rank Adaptation) freezes the base transformer weights and only trains small adapter matrices, reducing trainable parameters to <5% of the full model. This is especially useful for large base models or limited GPU memory.

from ont import OntologyTransformer

# LoRA fine-tuning — same API, just add use_lora=True
model = OntologyTransformer.fit(
    owl_path="ontology.owl",
    output_dir="./lora_output",
    num_epochs=3,
    use_lora=True,        # enable LoRA
    lora_r=16,            # rank of low-rank matrices (default: 16)
    lora_alpha=32,        # scaling factor (default: 32)
    lora_dropout=0.1,     # dropout for LoRA layers (default: 0.1)
)

# The LoRA model is used exactly like a full fine-tuned model
emb = model.encode("clinical finding")
exist_emb = model.encode_existence("has finding site", "lung structure")

# Save & load works transparently (adapter weights auto-detected)
model.save("./my_lora_model")
loaded = OntologyTransformer.from_pretrained("./my_lora_model")

When to use LoRA vs. full fine-tuning:

Aspect	Full fine-tuning	LoRA
Trainable params	100%	<5%
Memory usage	High	Low
Training speed	Baseline	Faster
Best for	Small/medium ontologies, maximum quality	Large ontologies, limited GPU, quick experiments

Key training parameters

Parameter (Python)	CLI flag	Default	Description
`num_epochs`	`--epochs`	`1`	Number of training epochs
`batch_size`	`--batch-size`	`64`	Sentences per training step. Increase for larger GPUs (e.g. 256 on 40+ GB).
`eval_batch_size`	`--eval-batch-size`	`32`	Queries scored per evaluation step. Increase to speed up evaluation when GPU memory allows.
`learning_rate`	`--lr`	`1e-5`	Learning rate
`balanced`	`--balanced`	`False`	Add C_neg contrastive loss for existential restrictions
`balanced_negatives`	`--balanced-negatives`	`1`	Number of negative samples in balanced mode
`eval_ratio`	—	`0.1`	Fraction of axioms sampled for eval (Python API only)
`max_eval`	—	`1000`	Max number of eval samples (Python API only)
`use_lora`	`--use-lora`	`False`	Enable LoRA parameter-efficient fine-tuning
`lora_r`	`--lora-r`	`16`	LoRA rank (dimension of low-rank matrices)
`lora_alpha`	`--lora-alpha`	`32`	LoRA scaling factor
`lora_dropout`	`--lora-dropout`	`0.1`	LoRA dropout probability

Data Preparation Flow

By default (no separate eval/test ontologies):

All axioms from input ontology → training data (train.jsonl, train_exist.jsonl, train_conj.jsonl)
10% of axioms (max 1000) randomly sampled → evaluation data (val.json)
No test split created (unless test_owl_path is provided)

With external eval/test ontologies:

eval_owl_path: evaluation data prepared from this ontology
test_owl_path: test evaluation performed after training

This design ensures all available training data is used while still enabling hyperparameter tuning (best lambda) via evaluation.

Training Modes

Non-balanced (default)

Standard contrastive loss on taxonomy + existential axioms:

Clustering loss: push child closer to parent
Centripetal loss: pull child away from non-ancestors
Conjunction loss: C₁ ⊓ C₂ ⊑ D
Existential loss: ∃r.C encoded via rotation

Balanced

Adds extra contrastive loss with negative concept samples (C_neg) for existential restrictions:

model = OntologyTransformer.fit(
    owl_path="ontology.owl",
    balanced=True,
    balanced_negatives=5,  # number of negative samples
)

Architecture

Base model: SentenceTransformer fine-tuned in Poincaré ball (hyperbolic space)
Role model: Linear layer mapping role embeddings to rotation angles (rotation or transition mode)
Existential encoding: ∃r.C = rotate(embed(C), f_r(embed(r)))
Best lambda: Centripetal weight λ optimized on eval data, saved in wrapper_config.json

Model Saving & Loading

Models are saved with:

Base sentence transformer weights
Role model weights (role_model.pt)
Configuration (wrapper_config.json) including best_lambda
Concept/role vocabularies

# Save
model.save("./my_model")

# Load (best_lambda automatically restored)
loaded = OntologyTransformer.from_pretrained("./my_model")

Running Tests

# Install with test dependencies
pip install -e ".[test]"

# Run all tests
pytest tests/ -v

# Skip integration tests (large ontologies)
pytest tests/ -v -m "not integration"

# Run specific test
pytest tests/test_pipeline.py::TestPipeline::test_fit_tiny_owl -v

Examples

See examples/ directory for:

Training on FoodOn, SNOMED CT, GALEN ontologies
Evaluating embeddings for subsumption prediction
Using external eval/test ontologies

Citation

If you use this package, please cite:

@inproceedings{yang2025language,
  title={Language Models as Ontology Encoder},
  author={Yang, Hui and Chen, Jiaoyan and Horrocks, Ian},
  booktitle={International Semantic Web Conference (ISWC)},
  year={2025},
  organization={Springer}
}

GitHub: https://github.com/HuiYang1997/OnT

Changelog

0.2.0 (2026-04-09)

Feature: LoRA fine-tuning support — Parameter-efficient fine-tuning via PEFT LoRA adapters. Use use_lora=True in Python API or --use-lora in CLI. Trains <5% of parameters with minimal quality loss. Requires peft package (pip install ontology-transformer[lora]).
Feature: SNOMED CT sub-ontology extraction — New script scripts/extract_snomed_sub.py to extract medium-sized sub-ontologies from SNOMED CT for development and testing.
LoRA models auto-detected during loading (from_pretrained handles both full and adapter models transparently).
CLI updated with --use-lora, --lora-r, --lora-alpha, --lora-dropout options.

0.1.5 (2026-04-02)

Update Readme

0.1.4

Fix: axiom duplication in data preparation — create_dataset() previously counted every axiom twice because getImportsClosure() already includes the ontology itself. The duplicated data inflated training set size 2–3× and degraded embedding quality.
Fix: OOM during evaluation on large ontologies — OnTEvaluator now scores candidates in GPU chunks (cand_chunk_size=4096, configurable) instead of broadcasting the full (batch, N, dim) tensor, eliminating OOM errors for ontologies with 100K+ concepts (e.g. SNOMED CT ~364K concepts).
Improvement: skip repeated data preparation — pipeline.fit() reuses already-prepared data/ directory on restart, avoiding the 5-minute OWL parsing step when resuming crashed runs.

0.1.2

Initial public release.

License

Apache License 2.0 - see LICENSE file for details.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.2.0 yanked

Apr 9, 2026

Reason this release was yanked:

potential confliction

0.1.6

Apr 9, 2026

0.1.5

Apr 2, 2026

0.1.4

Apr 1, 2026

0.1.2

Mar 31, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ontology_transformer-0.2.0.tar.gz (37.6 kB view details)

Uploaded Apr 9, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ontology_transformer-0.2.0-py3-none-any.whl (33.8 kB view details)

Uploaded Apr 9, 2026 Python 3

File details

Details for the file ontology_transformer-0.2.0.tar.gz.

File metadata

Download URL: ontology_transformer-0.2.0.tar.gz
Upload date: Apr 9, 2026
Size: 37.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.4

File hashes

Hashes for ontology_transformer-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`78b1c480f33a2f92bdf2f1e6810642f0aaf9930249cfc185abc7894269ba2f79`
MD5	`81542eb265f6b28df9592786df35939d`
BLAKE2b-256	`012c3626eeaf02ee377bceb518b77772adc4110c956c9e3ee3ec7c4f64029362`

See more details on using hashes here.

File details

Details for the file ontology_transformer-0.2.0-py3-none-any.whl.

File metadata

Download URL: ontology_transformer-0.2.0-py3-none-any.whl
Upload date: Apr 9, 2026
Size: 33.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.4

File hashes

Hashes for ontology_transformer-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b3f3e914f785531e86b632feb113bc6c196c1f2b0acb05fd84513b649a85c1db`
MD5	`f6d1bf8dc06013134cc86c71eca73ec3`
BLAKE2b-256	`cd9dab4a495fa0e731843fdd7e4fe111ff8a0e31e6205ca72c48388d5d9778ac`

See more details on using hashes here.

ontology-transformer 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Ontology-Transformer

Features

Installation

Requirements

Quick Start

1. End-to-end: OWL → Fine-tune → Embeddings

2. Use separate ontologies for evaluation/testing

3. Load a pre-trained model

4. CLI

5. LoRA Fine-tuning (Parameter-Efficient)

Key training parameters

Data Preparation Flow

Training Modes

Non-balanced (default)

Balanced

Architecture

Model Saving & Loading

Running Tests

Examples

Citation

Changelog

0.2.0 (2026-04-09)

0.1.5 (2026-04-02)

0.1.4

0.1.2

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes