OntoLearner

OntoLearner: A Modular Python Library for Ontology Learning with LLMs.

These details have not been verified by PyPI

Project links

Project description

OntoLearner: A Modular Python Library for Ontology Learning with LLMs

OntoLearner is a modular and extensible architecture designed to support ontology learning and reuse. The conceptual and functional architecture of OntoLearner is shown as following. The framework comprises three core components—Ontologizers, Learning Tasks, and Learner Models—structured to enable reusable and customizable ontology engineering workflows.

🧪 Installation

OntoLearner is available on PyPI and you can install using pip:

pip install ontolearner

Next, verify the installation:

import ontolearner

print(ontolearner.__version__)

Please refer to Installation page for further options.

🔗 Essential Resources

Resource	Info
📚 OntoLearner Documentation	OntoLearner's extensive documentation website.
🤗 Datasets on Hugging Face	Access curated, machine-readable ontologies.
Quick Tour on OntoLearner `version=1.2.1`	OntoLearner hands-on Colab tutorials.
🚀 Quickstart	Get started quickly with OntoLearner’s main features and workflow.
🕸️ Learning Tasks	Explore supported ontology learning tasks like LLMs4OL Paradigm tasks and Text2Onto.
🧠 Learner Models	Browse and configure various learner models, including LLMs, Retrieval, or RAG approaches.
📚 Ontologies Documentations	Review benchmark ontologies and datasets used for evaluation and training.
🧩 How to work with Ontologizer?	Learn how to modularize and preprocess ontologies using the Ontologizer module.

🚀 Quick Tour

Get started with OntoLearner in just a few lines of code. This guide demonstrates how to initialize ontologies, load datasets, and train an LLM-assisted learner for ontology engineering tasks.

Basic Usage - Automatic Download from Hugging Face:

from ontolearner import Wine

# 1. Initialize an ontologizer from OntoLearner
ontology = Wine()

# 2. Load the ontology automatically from HuggingFace
ontology.load()

# 3. Extract the learning task dataset
data = ontology.extract()

To see the ontology metadata you can print the ontology:

print(ontology)

Now, explore 150+ ready-to-use ontologies or read on how to work with ontologizers.

Learner Models:

from ontolearner import AutoRetrieverLearner, AgrO, train_test_split, evaluation_report

# 1. Programmatic import of an ontology
ontology = AgrO()
ontology.load()

# 2. Load tasks datasets
ontological_data = ontology.extract()

# 3. Split into train and test sets
train_data, test_data = train_test_split(ontological_data, test_size=0.2, random_state=42)

# 4. Initialize Learner
task = 'non-taxonomic-re'
ret_learner = AutoRetrieverLearner(top_k=5)
ret_learner.load(model_id='sentence-transformers/all-MiniLM-L6-v2')

# 5. Fit the model to training data and do the predict
ret_learner.fit(train_data, task=task)
predicts = ret_learner.predict(test_data, task=task)

# 6. Evaluation
truth = ret_learner.tasks_ground_truth_former(data=test_data, task=task)
metrics = evaluation_report(y_true=truth, y_pred=predicts, task=task)
print(metrics)

Other learners:

LearnerPipeline: The OntoLearner also offers a streamlined LearnerPipeline class that simplifies the entire process of initializing, training, predicting, and evaluating a RAG setup into a single call.

# Import core components from the OntoLearner library
from ontolearner import LearnerPipeline, AgrO, train_test_split

# Load the AgrO ontology, which includes structured agricultural knowledge
ontology = AgrO()
ontology.load()  # Load ontology data (e.g., entities, relations, metadata)

# Extract relation instances from the ontology and split them into training and test sets
train_data, test_data = train_test_split(
    ontology.extract(),      # Extract annotated (head, tail, relation) triples
    test_size=0.2,           # 20% for evaluation
    random_state=42          # Ensures reproducible splits
)

# Initialize the learning pipeline using a dense retriever
pipeline = LearnerPipeline(
    retriever_id='sentence-transformers/all-MiniLM-L6-v2',  # Hugging Face model ID for retrieval
    batch_size=10,       # Number of samples to process per batch (if batching is enabled internally)
    top_k=5              # Retrieve top-5 most relevant support instance per query
)

# Run the pipeline on the training and test data
# The pipeline performs: fit() → predict() → evaluate() in sequence
outputs = pipeline(
    train_data=train_data,
    test_data=test_data,
    evaluate=True,           # If True, computes precision, recall, and F1-score
    task='non-taxonomic-re'  # Specifies that we are doing non-taxonomic relation prediction
)

# Print the evaluation metrics (precision, recall, F1)
print("Metrics:", outputs['metrics'])

# Print the total elapsed time for training and evaluation
print("Elapsed time:", outputs['elapsed_time'])

# Print the full output dictionary (includes predictions)
print(outputs)

⭐ Contribution

We welcome contributions to enhance OntoLearner and make it even better! Please review our contribution guidelines in CONTRIBUTING.md before getting started. You are also welcome to assist with the ongoing maintenance by referring to MAINTENANCE.md. Your support is greatly appreciated.

If you encounter any issues or have questions, please submit them in the GitHub issues tracker.

💡 Acknowledgements

If you find this repository helpful or use OntoLearner in your work or research, feel free to cite our publication:

@inproceedings{babaei2023llms4ol,
  title={LLMs4OL: Large language models for ontology learning},
  author={Babaei Giglou, Hamed and D’Souza, Jennifer and Auer, S{\"o}ren},
  booktitle={International Semantic Web Conference},
  pages={408--427},
  year={2023},
  organization={Springer}
}

or:

@software{babaei_giglou_2025_15399783,
  author       = {Babaei Giglou, Hamed and D'Souza, Jennifer and Aioanei, Andrei and Mihindukulasooriya, Nandana and Auer, Sören},
  title        = {OntoLearner: A Modular Python Library for Ontology Learning with LLMs},
  month        = may,
  year         = 2025,
  publisher    = {Zenodo},
  version      = {v1.3.0},
  doi          = {10.5281/zenodo.15399783},
  url          = {https://doi.org/10.5281/zenodo.15399783},
}

This software is archived in Zenodo under the DOI and is licensed under .

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.6.0

May 4, 2026

1.5.1

Mar 30, 2026

1.5.0

Feb 5, 2026

1.4.11

Jan 5, 2026

1.4.10

Dec 8, 2025

1.4.9

Dec 8, 2025

1.4.8

Nov 30, 2025

1.4.7

Oct 1, 2025

1.4.6

Sep 22, 2025

1.4.5

Sep 16, 2025

This version

1.4.4

Sep 9, 2025

1.4.3

Sep 7, 2025

1.4.2

Sep 1, 2025

1.4.1

Aug 24, 2025

1.4.0

Aug 22, 2025

1.3.1

Aug 13, 2025

1.3.0

Jul 14, 2025

1.2.1

Jun 20, 2025

1.2.0

Jun 20, 2025

1.1.2

Jun 11, 2025

1.1.1

May 27, 2025

1.1.0

May 21, 2025

1.0.0

May 13, 2025

0.5.0

May 8, 2025

0.4.0

May 2, 2025

0.3.0

Apr 24, 2025

0.2.0

Apr 9, 2025

0.1.0

Mar 17, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ontolearner-1.4.4.tar.gz (421.4 kB view details)

Uploaded Sep 9, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ontolearner-1.4.4-py3-none-any.whl (134.6 kB view details)

Uploaded Sep 9, 2025 Python 3

File details

Details for the file ontolearner-1.4.4.tar.gz.

File metadata

Download URL: ontolearner-1.4.4.tar.gz
Upload date: Sep 9, 2025
Size: 421.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.4 CPython/3.10.18 Linux/6.11.0-1018-azure

File hashes

Hashes for ontolearner-1.4.4.tar.gz
Algorithm	Hash digest
SHA256	`917c0bf62c08c71b4d659ef03add8d2cbea40d58eeca6da96b6822a7cdd6919c`
MD5	`ef21d015df6f702b93ad3b1325f55ada`
BLAKE2b-256	`d21ceeef019c3b925d78c25b00f292d8a0231944021c85ae0a91408a927bc2f8`

See more details on using hashes here.

File details

Details for the file ontolearner-1.4.4-py3-none-any.whl.

File metadata

Download URL: ontolearner-1.4.4-py3-none-any.whl
Upload date: Sep 9, 2025
Size: 134.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.4 CPython/3.10.18 Linux/6.11.0-1018-azure

File hashes

Hashes for ontolearner-1.4.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ccfd451fc7fa64714297b4455c4abb190b7ef35d0fb8233f8e49b00f7a2b5de9`
MD5	`50ca796b8e771cd90353fcfeb15ad55e`
BLAKE2b-256	`2dd76982b29fde2cf28d7139785e4dbee9e2d0d4b70ca8e161caebdcbe00ebd8`

See more details on using hashes here.

OntoLearner 1.4.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

OntoLearner: A Modular Python Library for Ontology Learning with LLMs

🧪 Installation

🔗 Essential Resources

🚀 Quick Tour

⭐ Contribution

💡 Acknowledgements

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes