LLM plugin providing access to models running on an Ollama server

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

taketwo

These details have not been verified by PyPI

Project description

llm-ollama

LLM plugin providing access to models running on an Ollama server.

Installation

Install this plugin in the same environment as LLM.

llm install llm-ollama

Quickstart

By default, the plugin connects to a local Ollama server. Ensure the server is running and has some models available. Alternatively, the plugin can connect to a remote or managed Ollama cloud server; see Connecting to Ollama server for configuration instructions.

The plugin automatically discovers all models available on the server and registers them with LLM. To see the list of models and their capabilities, run:

llm ollama models

model                             digest          capabilities
gemma3:latest                     c0494fe00251    completion, vision
gpt-oss:120b-cloud                569662207105    completion, tools, thinking
mxbai-embed-large:latest          468836162de7    embedding
qwen3:4b                          2bfd38a7daaf    completion, tools, thinking

Once registered, models are available for prompting, chatting, and embedding. Assuming you have gemma3:latest available, you can run a prompt using:

llm -m gemma3:latest 'How much is 2+2?'

The plugin automatically creates shorter aliases for models that have :latest in the name, so the previous command is equivalent to running:

llm -m gemma3 'How much is 2+2?'

To start an interactive chat session instead of a one-shot prompt, run:

llm chat -m gemma3

Chatting with gemma3:latest
Type 'exit' or 'quit' to exit
Type '!multi' to enter multiple lines, then '!end' to finish
Type '!edit' to open your default editor and modify the prompt
Type '!fragment <my_fragment> [<another_fragment> ...]' to insert one or more fragments
>

Features

Image attachments

Multi-modal Ollama models can accept image attachments using the LLM attachments option:

llm -m llava "Describe this image" -a https://static.simonwillison.net/static/2024/pelicans.jpg

Tools

Ollama models with tools support can make use of LLM tools passed to them:

llm -m llama3.2 -T llm_time 'What is the time?' --td

The plugin also registers ollama_web_search and ollama_web_fetch tools that wrap the web search API provided by ollama.com. These tools augment models with the latest information to reduce hallucinations and improve accuracy. To use these tools, an API key must be configured (see Ollama cloud).

Embeddings

The plugin supports LLM embeddings. Both regular and specialized embedding models (such as mxbai-embed-large) can be used:

llm embed -m mxbai-embed-large -i README.md

By default, the input will be truncated from the end to fit within the context length. This behavior can be changed by setting OLLAMA_EMBED_TRUNCATE=no environment variable. In such cases, embedding operation will fail if the context length is exceeded.

JSON schemas

Ollama's built-in support for structured outputs can be accessed through LLM schemas, for example:

llm -m llama3.2 --schema "name, age int, one_sentence_bio" "invent a cool dog"

Async models

The plugin registers async LLM models suitable for use with Python asyncio.

To utilize an async model, retrieve it using llm.get_async_model() function instead of llm.get_model() and then await the response:

import asyncio, llm

async def run():
    model = llm.get_async_model("llama3.2:latest")
    response = model.prompt("A short poem about tea")
    print(await response.text())

asyncio.run(run())

Model aliases

The same Ollama model may be referenced by several names with different tags. For example, in the following list, there is a single unique model with three different names:

ollama list
NAME                    ID              SIZE    MODIFIED
stable-code:3b          aa5ab8afb862    1.6 GB  9 hours ago
stable-code:code        aa5ab8afb862    1.6 GB  9 seconds ago
stable-code:latest      aa5ab8afb862    1.6 GB  14 seconds ago

In such cases, the plugin will register a single model and create additional aliases. Continuing the previous example, this is what LLM will have:

llm models
...

Ollama: stable-code:3b (aliases: stable-code:code, stable-code:latest, stable-code)

Model options

All models accept Ollama modelfile parameters as options. Use the -o name value syntax to specify them, for example:

-o temperature 0.8: set the temperature of the model
-o num_ctx 256000: set the size of the context window used to generate the next token

See the referenced page for the complete list with descriptions and default values.

Additionally, the -o flag supports plugin-specific options:

-o json_object 1 forces the model to reply with a valid JSON object. Note that your prompt must mention JSON for this to work;
-o think false disables the intermediate reasoning step for thinking-capable models.

Connecting to Ollama server

The plugin connects to an Ollama server to list and run models. Three deployment scenarios are supported: a local server, a self-hosted remote server, and Ollama's hosted cloud service.

Local server

By default, the plugin connects to a local Ollama server at localhost:11434. If your local server runs on a non-default port, set OLLAMA_HOST:

export OLLAMA_HOST=http://localhost:8080

Remote server

To connect to a self-hosted Ollama server on your network or in the cloud, set OLLAMA_HOST to its address:

export OLLAMA_HOST=https://192.168.1.13:11434

Authentication

If the server is protected with Basic Authentication, include the credentials in the URL:

export OLLAMA_HOST=https://username:password@192.168.1.13:11434

Special characters in usernames or passwords must be URL-encoded:

# For username "user@domain" and password "p@ssw0rd"
export OLLAMA_HOST=https://user%40domain:p%40ssw0rd@192.168.1.13:11434

If the server is behind a reverse proxy that requires custom headers, use the OLLAMA_HEADERS environment variable with a comma-separated list of key=value pairs:

# JWT token auth (e.g. Open-WebUI's Ollama endpoint)
export OLLAMA_HEADERS='Authorization=Bearer mytoken,User-Agent=custom-client'

# Cloudflare Tunnel with a Service Token
export OLLAMA_HEADERS='CF-Access-Client-Id=abcdef.access,CF-Access-Client-Secret=123456789'

Ollama cloud

Ollama cloud is a hosted service that lets you run models without installing or operating a local server. It offers a range of open models and does not require any local setup beyond configuring the plugin.

To use it, point the plugin at the cloud endpoint and provide your API key:

export OLLAMA_HOST=https://ollama.com
llm keys set ollama # paste your API key when prompted

The API key is stored securely by llm and used automatically. Alternatively, you can set it as an environment variable:

export OLLAMA_API_KEY=your-api-key

Development

Setup

To set up this plugin locally, first checkout the code. Then create a new virtual environment and install the dependencies. If you are using uv:

cd llm-ollama
uv venv
uv pip install -e '.[test,lint]'

Otherwise, if you prefer using standard tools:

cd llm-ollama
python3 -m venv .venv
pip install -e '.[test,lint]'

Testing and linting

To test or lint the code, first activate the environment:

source .venv/bin/activate

The environment includes llm; it will pick up the local version of the plugin, which is useful for manual testing.

To run automated unit and integration tests:

python -m pytest

Integration tests require a running Ollama server and will be:

Enabled automatically if an Ollama server is available;
Skipped if Ollama server is unavailable;
Force-enabled with --integration (but fail if Ollama server is unavailable);
Force-disabled with --no-integration.

To format the code:

python -m ruff format .

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

taketwo

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.16.0

Apr 19, 2026

0.15.1

Oct 29, 2025

0.15.0

Oct 13, 2025

0.14.0

Sep 12, 2025

0.13.0

Jul 28, 2025

0.12.0

Jul 5, 2025

0.11

May 29, 2025

0.11a0 pre-release

May 14, 2025

0.10.0

May 6, 2025

0.9.1

Mar 10, 2025

0.9.0

Mar 2, 2025

0.8.2

Jan 22, 2025

0.8.1

Dec 20, 2024

0.8.0

Dec 11, 2024

0.7.1

Nov 22, 2024

0.7.0

Nov 6, 2024

0.6.0

Oct 30, 2024

0.5.0

Jul 31, 2024

0.4.3

Jul 2, 2024

0.4.2

Jun 12, 2024

0.4.1

May 29, 2024

0.4.0

May 22, 2024

0.3.0

May 7, 2024

0.2.0

Jan 28, 2024

0.1.0

Jan 20, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_ollama-0.16.0.tar.gz (25.1 kB view details)

Uploaded Apr 19, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llm_ollama-0.16.0-py3-none-any.whl (16.9 kB view details)

Uploaded Apr 19, 2026 Python 3

File details

Details for the file llm_ollama-0.16.0.tar.gz.

File metadata

Download URL: llm_ollama-0.16.0.tar.gz
Upload date: Apr 19, 2026
Size: 25.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for llm_ollama-0.16.0.tar.gz
Algorithm	Hash digest
SHA256	`32b8ba5edc122ddaf87e6be07cc8e8bc795379fc727facf8e4ad6cd875072df2`
MD5	`a89e30c156524429618cc3a7e0df5d49`
BLAKE2b-256	`9bb3f19dbc6d940f84f15ca7f44b328169166326186995db09e0db12858875f4`

See more details on using hashes here.

Provenance

The following attestation bundles were made for llm_ollama-0.16.0.tar.gz:

Publisher: publish.yml on taketwo/llm-ollama

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: llm_ollama-0.16.0.tar.gz
- Subject digest: 32b8ba5edc122ddaf87e6be07cc8e8bc795379fc727facf8e4ad6cd875072df2
- Sigstore transparency entry: 1340773051
- Sigstore integration time: Apr 19, 2026
Source repository:
- Permalink: taketwo/llm-ollama@5c803bfdcbaf343a65a0b6823f35ab2771b98d57
- Branch / Tag: refs/tags/0.16.0
- Owner: https://github.com/taketwo
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@5c803bfdcbaf343a65a0b6823f35ab2771b98d57
- Trigger Event: release

File details

Details for the file llm_ollama-0.16.0-py3-none-any.whl.

File metadata

Download URL: llm_ollama-0.16.0-py3-none-any.whl
Upload date: Apr 19, 2026
Size: 16.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for llm_ollama-0.16.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`797e0fbb8bf226e1f9512fe40ad82bddaf057addd4e582003bf9c19189d60fd7`
MD5	`b60ffebc44326da91062601221b93125`
BLAKE2b-256	`1e027ffa5beaaabc101a950c792d349c008b9d3edd58f9670275c4b1e23f4a3a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for llm_ollama-0.16.0-py3-none-any.whl:

Publisher: publish.yml on taketwo/llm-ollama

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: llm_ollama-0.16.0-py3-none-any.whl
- Subject digest: 797e0fbb8bf226e1f9512fe40ad82bddaf057addd4e582003bf9c19189d60fd7
- Sigstore transparency entry: 1340773063
- Sigstore integration time: Apr 19, 2026
Source repository:
- Permalink: taketwo/llm-ollama@5c803bfdcbaf343a65a0b6823f35ab2771b98d57
- Branch / Tag: refs/tags/0.16.0
- Owner: https://github.com/taketwo
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@5c803bfdcbaf343a65a0b6823f35ab2771b98d57
- Trigger Event: release

llm-ollama 0.16.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

llm-ollama

Installation

Quickstart

Features

Image attachments

Tools

Embeddings

JSON schemas

Async models

Model aliases

Model options

Connecting to Ollama server

Local server

Remote server

Authentication

Ollama cloud

Development

Setup

Testing and linting

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance