OntoGPT is a Python package for extracting structured information from text with large language models (LLMs), instruction prompts, and ontology-based grounding.

Project description

OntoGPT

OntoGPT Logo

PyPI

Introduction

OntoGPT is a Python package for extracting structured information from text with large language models (LLMs), instruction prompts, and ontology-based grounding.

For more details, please see the full documentation.

Quick Start

OntoGPT runs on the command line, though there's also a minimal web app interface (see Web Application section below).

Ensure you have Python 3.10 or greater installed.
Install with pip:
```
pip install ontogpt
```

Set your OpenAI API key:

runoak set-apikey -e openai <your openai api key>

See the list of all OntoGPT commands:
```
ontogpt --help
```
Try a simple example of information extraction:
```
echo "One treatment for high blood pressure is carvedilol." > example.txt
ontogpt extract -i example.txt -t drug
```
OntoGPT will retrieve the necessary ontologies and output results to the command line. Your output will provide all extracted objects under the heading extracted_object.

Web Application

There is a bare bones web application for running OntoGPT and viewing results.

First, install the required dependencies with pip by running the following command:

pip install ontogpt[web]

Then run this command to start the web application:

web-ontogpt

NOTE: We do not recommend hosting this webapp publicly without authentication.

Model APIs

OntoGPT uses LiteLLM to interface with LLMs.

This means OntoGPT can work with a much broader range of providers than just OpenAI. If a provider and model are supported by the installed LiteLLM version, they will generally work in OntoGPT as well. This includes OpenAI, Azure OpenAI, Anthropic, Mistral, Groq, Cohere, Vertex AI, Replicate, and many others.

The model name to use may be found from the command ontogpt list-models - use the name in the first column with the --model option. In most cases, the most reliable form is a provider-qualified LiteLLM model name such as openai/gpt-4o, anthropic/claude-3-5-sonnet, groq/llama-3.1-8b-instant, or mistral/mistral-large-latest.

Credential handling now follows LiteLLM first. Standard LiteLLM environment variables such as OPENAI_API_KEY, ANTHROPIC_API_KEY, GROQ_API_KEY, MISTRAL_API_KEY, AZURE_API_KEY, AZURE_API_BASE, and AZURE_API_VERSION are supported directly. For backward compatibility, OntoGPT also checks Oaklib credentials created with runoak set-apikey and passes them through to LiteLLM when the corresponding provider settings are missing.

Examples:

runoak set-apikey -e openai <your openai api key>
runoak set-apikey -e anthropic-key <your anthropic api key>
runoak set-apikey -e mistral-key <your mistral api key>
runoak set-apikey -e groq-key <your groq api key>

Some endpoints, such as Azure OpenAI, require additional details. These may be set similarly:

runoak set-apikey -e azure-key <your azure api key>
runoak set-apikey -e azure-base <your azure endpoint url>
runoak set-apikey -e azure-version <your azure api version, e.g. "2023-05-15">

These details may also be set as environment variables as follows:

export AZURE_API_KEY="my-azure-api-key"
export AZURE_API_BASE="https://example-endpoint.openai.azure.com"
export AZURE_API_VERSION="2023-05-15"

If the provider is not encoded in the model name, use --model-provider to specify it explicitly. This is most common for OpenAI-compatible proxy endpoints.

For the current list of supported providers, model naming rules, and credential environment variables, see the LiteLLM docs:

Open Models

Open LLMs may be retrieved and run through the ollama package (https://ollama.com/).

You will need to install ollama (see the GitHub repo), and you may need to start it as a service with a command like ollama serve or sudo systemctl start ollama.

Then retrieve a model with ollama pull <modelname>, e.g., ollama pull llama3.

The model may then be used in OntoGPT by prefixing its name with ollama/, e.g., ollama/llama3, along with the --model option.

Some ollama models may not be listed in ontogpt list-models but the full list of downloaded LLMs can be seen with ollama list command.

Evaluations

OntoGPT's functions have been evaluated on test data. Please see the full documentation for details on these evaluations and how to reproduce them.

Related Projects

TALISMAN, a tool for generating summaries of functions enriched within a gene set. TALISMAN uses OntoGPT to work with LLMs.

Tutorials and Presentations

Presentation: "Staying grounded: assembling structured biological knowledge with help from large language models" - presented by Harry Caufield as part of the AgBioData Consortium webinar series (September 2023)
- Slides
- Video
Presentation: "Transforming unstructured biomedical texts with large language models" - presented by Harry Caufield as part of the BOSC track at ISMB/ECCB 2023 (July 2023)
- Slides
- Video
Presentation: "OntoGPT: A framework for working with ontologies and large language models" - talk by Chris Mungall at Joint Food Ontology Workgroup (May 2023)
- Slides
- Video

Citation

The information extraction approach used in OntoGPT, SPIRES, is described further in: Caufield JH, Hegde H, Emonet V, Harris NL, Joachimiak MP, Matentzoglu N, et al. Structured prompt interrogation and recursive extraction of semantics (SPIRES): A method for populating knowledge bases using zero-shot learning. Bioinformatics, Volume 40, Issue 3, March 2024, btae104, https://doi.org/10.1093/bioinformatics/btae104.

Acknowledgements

This project is part of the Monarch Initiative. We also gratefully acknowledge Bosch Research for their support of this research project.

Project details

Release history Release notifications | RSS feed

This version

1.1.1

Apr 7, 2026

1.1.0

Mar 25, 2026

1.0.19

Jan 30, 2026

1.0.18

Sep 18, 2025

1.0.17

Aug 15, 2025

1.0.16

Aug 6, 2025

1.0.15

Aug 6, 2025

1.0.14

Jun 6, 2025

1.0.13

May 2, 2025

1.0.12

Apr 11, 2025

1.0.11

Mar 28, 2025

1.0.10

Dec 20, 2024

1.0.9

Dec 9, 2024

1.0.8

Nov 21, 2024

1.0.7

Nov 8, 2024

1.0.6

Oct 10, 2024

1.0.5

Sep 24, 2024

1.0.4

Aug 27, 2024

1.0.3

Aug 12, 2024

1.0.2

Aug 6, 2024

1.0.1

Aug 2, 2024

1.0.0

Jul 30, 2024

1.0.0rc2 pre-release

Jul 25, 2024

1.0.0rc1 pre-release

Jul 19, 2024

0.3.15

Jun 12, 2024

0.3.14

May 30, 2024

0.3.13

May 29, 2024

0.3.12

May 20, 2024

0.3.11

Apr 14, 2024

0.3.10

Apr 8, 2024

0.3.9

Mar 20, 2024

0.3.8

Feb 8, 2024

0.3.7

Jan 19, 2024

0.3.6

Dec 20, 2023

0.3.5

Dec 14, 2023

0.3.4

Nov 21, 2023

0.3.3

Sep 25, 2023

0.3.2

Sep 19, 2023

0.3.1

Aug 24, 2023

0.2.10

Jul 23, 2023

0.2.9

May 31, 2023

0.2.8

May 31, 2023

0.2.7

May 18, 2023

0.2.6

May 15, 2023

0.2.5

May 12, 2023

0.2.4

May 4, 2023

0.2.3

May 2, 2023

0.2.2

Apr 21, 2023

0.2.1

Apr 6, 2023

0.2.0

Mar 23, 2023

0.1.1

Jan 5, 2023

0.0.0

Jan 5, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ontogpt-1.1.1.tar.gz (315.5 kB view details)

Uploaded Apr 7, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ontogpt-1.1.1-py3-none-any.whl (475.0 kB view details)

Uploaded Apr 7, 2026 Python 3

File details

Details for the file ontogpt-1.1.1.tar.gz.

File metadata

Download URL: ontogpt-1.1.1.tar.gz
Upload date: Apr 7, 2026
Size: 315.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ontogpt-1.1.1.tar.gz
Algorithm	Hash digest
SHA256	`0b6ba7b9663e008ff07e71463a244e4845654446e62f79dd540a35281b07748b`
MD5	`8c6b0a935c84e903b506daf4a5e86187`
BLAKE2b-256	`fe945bd59ff02b4b311fb1068ddc075bdf011c02bf4ff58aa83ee386ba2fe645`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ontogpt-1.1.1.tar.gz:

Publisher: pypi-publish.yml on monarch-initiative/ontogpt

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ontogpt-1.1.1.tar.gz
- Subject digest: 0b6ba7b9663e008ff07e71463a244e4845654446e62f79dd540a35281b07748b
- Sigstore transparency entry: 1245469183
- Sigstore integration time: Apr 7, 2026
Source repository:
- Permalink: monarch-initiative/ontogpt@fc25134e125a352fa2314171ad697587ea9ed53c
- Branch / Tag: refs/tags/v1.1.1
- Owner: https://github.com/monarch-initiative
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi-publish.yml@fc25134e125a352fa2314171ad697587ea9ed53c
- Trigger Event: release

File details

Details for the file ontogpt-1.1.1-py3-none-any.whl.

File metadata

Download URL: ontogpt-1.1.1-py3-none-any.whl
Upload date: Apr 7, 2026
Size: 475.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ontogpt-1.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5cfc6b6deaa310175fef40810106da5ed00745d1af139d99b31d8c2fb67d1cdd`
MD5	`a36d75425ef71d5d6c672b8644f4c184`
BLAKE2b-256	`6f0991b930c8fc87d542385156c91421f2a8b89137606f14ac4eccdf8cbc2c2a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ontogpt-1.1.1-py3-none-any.whl:

Publisher: pypi-publish.yml on monarch-initiative/ontogpt

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ontogpt-1.1.1-py3-none-any.whl
- Subject digest: 5cfc6b6deaa310175fef40810106da5ed00745d1af139d99b31d8c2fb67d1cdd
- Sigstore transparency entry: 1245469184
- Sigstore integration time: Apr 7, 2026
Source repository:
- Permalink: monarch-initiative/ontogpt@fc25134e125a352fa2314171ad697587ea9ed53c
- Branch / Tag: refs/tags/v1.1.1
- Owner: https://github.com/monarch-initiative
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi-publish.yml@fc25134e125a352fa2314171ad697587ea9ed53c
- Trigger Event: release

ontogpt 1.1.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

OntoGPT

Introduction

Quick Start

Web Application

Model APIs

Open Models

Evaluations

Related Projects

Tutorials and Presentations

Citation

Acknowledgements

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance