A Python library that meters Ollama usage to Revenium.

These details have not been verified by PyPI

Project links

Project description

Revenium Middleware for Ollama

A middleware library for metering and monitoring Ollama API usage in Python applications.

Features

Precise Usage Tracking: Monitor tokens, costs, and request counts across all Ollama API endpoints
Seamless Integration: Drop-in middleware that works with minimal code changes
Flexible Configuration: Customize metering behavior to suit your application needs
Rich Metadata Support: Track usage by subscriber, organization, task type, and more

Installation

pip install revenium-middleware-ollama

Quick Start

For complete examples and setup instructions, see examples/README.md.

1. Create Project Directory

# Create project directory and navigate to it
mkdir my-ollama-project
cd my-ollama-project

2. Create Virtual Environment

# Create virtual environment
python -m venv .venv

# Activate virtual environment
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

3. Install Package

pip install revenium-middleware-ollama

4. Configure Environment Variables

Create a .env file in your project directory:

# Revenium API keys
REVENIUM_METERING_API_KEY="hak_..."
REVENIUM_METERING_BASE_URL="https://api.revenium.ai"

# Ollama is local - no API key needed
# Ensure Ollama is running: ollama serve

# Optional: Enable debug logging
# REVENIUM_LOG_LEVEL="DEBUG"

5. Run Your First Example

Download and run an example from the repository:

curl -O https://raw.githubusercontent.com/revenium/revenium-middleware-ollama-python/main/examples/getting_started.py
python getting_started.py

Or use this simple code:

from dotenv import load_dotenv
load_dotenv()

import ollama
import revenium_middleware_ollama

# Ensure REVENIUM_METERING_API_KEY is set in your .env file
# Ensure Ollama is running (ollama serve)

response: ollama.ChatResponse = ollama.chat(
    model='qwen2.5:0.5b',
    messages=[{'role': 'user', 'content': 'Please verify you are ready to assist me.'}]
)
print(response['message']['content'])

That's it! The middleware automatically meters all Ollama API calls. No code changes required.

For complete examples and setup instructions, see examples/README.md.

examples/getting_started.py - Minimal example with automatic metering
examples/example_simple.py - Basic middleware usage
examples/example_metadata.py - Enhanced tracking with metadata
examples/example_streaming.py - Streaming responses
examples/embeddings_example.py - Text embeddings (single and batch)

Usage

Zero-Config Integration

Simply set your REVENIUM_METERING_API_KEY in your .env file and import the middleware. Your Ollama calls will be metered automatically:

from dotenv import load_dotenv
load_dotenv()

import ollama
import revenium_middleware_ollama

# Ensure REVENIUM_METERING_API_KEY is set in your .env file

response: ollama.ChatResponse = ollama.chat(
    model='qwen2.5:0.5b', messages=[
        {
            'role': 'user',
            'content': 'Why is the sky blue?',
        },
    ])
print(response['message']['content'])

The middleware automatically intercepts Ollama API calls and sends metering data to Revenium without requiring any changes to your existing code.

Enhanced Tracking with Metadata

For more granular usage tracking and detailed reporting, add the usage_metadata parameter:

from dotenv import load_dotenv
load_dotenv()

import ollama
import revenium_middleware_ollama

response = ollama.chat(
    model='qwen2.5:0.5b', messages=[
        {
            'role': 'user',
            'content': 'Why is the sky blue?',
        },
    ],
    usage_metadata={
         "trace_id": "conv-28a7e9d4",
         "task_type": "summarize-customer-issue",
         "subscriber": {
             "id": "subscriberid-1234567890",
             "email": "user@example.com",
             "credential": {
                 "name": "engineering-api-key",
                 "value": "sk-1234567890abcdef"
             }
         },
         "organization_id": "acme-corp",
         "subscription_id": "startup-plan-Q1",
         "product_id": "saas-app-gold-tier",
         "agent": "support-agent",
    },
)
print(response['message']['content'])

Decorator Support

The middleware provides powerful decorators for automatic metadata injection and selective metering, just like the OpenAI middleware.

`@revenium_metadata`

Automatically injects metadata into all Ollama API calls within a function:

from revenium_middleware import revenium_metadata
import ollama

@revenium_metadata(
    trace_id="session-12345",
    task_type="customer-support",
    organization_id="acme-corp"
)
def handle_customer_query(question: str):
    response = ollama.chat(
        model="qwen2.5:0.5b",
        messages=[{"role": "user", "content": question}]
    )
    return response["message"]["content"]

`@revenium_meter()` - Selective Metering

Control which functions are metered by enabling selective metering mode, where only when @revenium_meter() is used the calls are metered when REVENIUM_SELECTIVE_METERING=true.

IMPORTANT: To use selective metering, you MUST set the environment variable:

# In your .env file or environment
REVENIUM_SELECTIVE_METERING=true

How it works:

When REVENIUM_SELECTIVE_METERING=false (default): ALL Ollama API calls are automatically metered
When REVENIUM_SELECTIVE_METERING=true: ONLY calls inside @revenium_meter() decorated functions are metered

Example:

from revenium_middleware import revenium_meter, revenium_metadata

# Set in .env file:
# REVENIUM_SELECTIVE_METERING=true

@revenium_meter()
@revenium_metadata(task_type="premium-feature")
def premium_feature(prompt: str):
    response = ollama.chat(
        model="qwen2.5:0.5b",
        messages=[{"role": "user", "content": prompt}]
    )
    return response["message"]["content"]

def free_feature(prompt: str):
    response = ollama.chat(
        model="qwen2.5:0.5b",
        messages=[{"role": "user", "content": prompt}]
    )
    return response["message"]["content"]

The middleware also supports Ollama embeddings for both single and batch text inputs:

from dotenv import load_dotenv
load_dotenv()

import ollama
import revenium_middleware_ollama

# Single text embedding
response = ollama.embed(
    model='nomic-embed-text',
    input='The quick brown fox jumps over the lazy dog'
)
print(f"Embedding dimensions: {len(response['embeddings'][0])}")

# Batch embeddings
texts = [
    "Machine learning is a subset of artificial intelligence",
    "Deep learning uses neural networks with multiple layers",
    "Natural language processing enables computers to understand text"
]
response = ollama.embed(
    model='nomic-embed-text',
    input=texts,
    usage_metadata={
        "organization_id": "acme-corp",
        "task_type": "semantic-search"
    }
)
print(f"Generated {len(response['embeddings'])} embeddings")

OpenAI Compatibility Mode

The middleware can also be used with Ollama's OpenAI compatibility mode.

import openai
import revenium_middleware_openai

openai.api_key = 'ollama'
openai.base_url = 'http://localhost:11434/v1/'
question = "Why is the sky blue?"

response = openai.chat.completions.create(
    model="gemma2:2b",
    messages=[
       {"role": "system", "content": "You are a helpful assistant."},
       {"role": "user", "content": question}
    ],
    usage_metadata={
         "trace_id": "conv-28a7e9d4",
         "task_type": "summarize-customer-issue",
         "subscriber": {
             "id": "subscriberid-1234567890",
             "email": "user@example.com",
             "credential": {
                 "name": "engineering-api-key",
                 "value": "sk-1234567890abcdef"
             }
         },
         "organization_id": "acme-corp",
         "subscription_id": "startup-plan-Q1",
         "product_id": "saas-app-gold-tier",
         "agent": "support-agent",
    }
)

print(response)

Metadata Fields

The usage_metadata parameter supports the following fields:

Field	Description	Use Case
`trace_id`	Unique identifier for a conversation or session	Group multi-turn conversations into single event for performance & cost tracking
`task_type`	Classification of the AI operation by type of work	Track cost & performance by purpose (e.g., classification, summarization)
`subscriber`	Object containing subscriber information	Track cost & performance by individual users or API keys
`subscriber.id`	The id of the subscriber from non-Revenium systems	Track cost & performance by individual users (if customers are anonymous or tracking by emails is not desired)
`subscriber.email`	The email address of the subscriber	Track cost & performance by individual users (if customer e-mail addresses are known)
`subscriber.credential`	Object containing credential information	Track cost & performance by API key
`subscriber.credential.name`	An alias for an API key used by one or more users	Track cost & performance by individual API keys
`subscriber.credential.value`	The key value associated with the subscriber (i.e an API key)	Track cost & performance by API key value (normally used when the only identifier for a user is an API key)
`organization_id`	Customer or department ID from non-Revenium systems	Track cost & performance by customers or business units
`subscription_id`	Reference to a billing plan in non-Revenium systems	Track cost & performance by a specific subscription
`product_id`	Your product or feature making the AI call	Track cost & performance across different products
`agent`	Identifier for the specific AI agent	Track cost & performance performance by AI agent
`response_quality_score`	Custom quality rating for the AI response (0.0-1.0 scale)	Track user satisfaction or automated quality metrics (e.g., RAGAS, human feedback) for model performance analysis

All metadata fields are optional. Adding them enables more detailed reporting and analytics in Revenium.

Trace Visualization Fields (v0.2.0+)

Enhanced observability fields for distributed tracing and analytics. These can be set via environment variables or passed in usage_metadata:

Field	Environment Variable	Description	Use Case
`environment`	`REVENIUM_ENVIRONMENT`	Deployment environment (e.g., "production", "staging")	Track usage across different deployment environments; auto-detects from `ENVIRONMENT`, `DEPLOYMENT_ENV`
`region`	`REVENIUM_REGION`	Cloud region identifier (e.g., "us-east-1", "eastus")	Multi-region deployment tracking; auto-detects from `AWS_REGION`, `AZURE_REGION`, `GCP_REGION`
`credential_alias`	`REVENIUM_CREDENTIAL_ALIAS`	Human-readable API key name (e.g., "prod-ollama-key")	Track which credential was used for credential rotation and security auditing
`trace_type`	`REVENIUM_TRACE_TYPE`	Workflow category identifier (max 128 chars)	Group similar workflows (e.g., "customer-support", "data-analysis") for analytics
`trace_name`	`REVENIUM_TRACE_NAME`	Human-readable trace label (max 256 chars)	Label trace instances (e.g., "Customer Support Chat", "Document Analysis")
`parent_transaction_id`	`REVENIUM_PARENT_TRANSACTION_ID`	Parent transaction ID for distributed tracing	Link child operations to parent transactions across services
`transaction_name`	`REVENIUM_TRANSACTION_NAME`	Human-friendly operation name	Label individual operations (e.g., "Generate Response", "Analyze Sentiment")

Note: operation_type and operation_subtype are automatically detected by the middleware based on the API endpoint and request parameters.

Resources:

API Reference - Complete metadata field documentation
.env.example - Environment variable configuration examples
examples/trace_visualization_example.py - Comprehensive trace visualization examples

Response Attributes

Response objects include a _revenium_transaction_id attribute for correlating requests with Revenium metering records:

response = ollama.chat(
    model='qwen2.5:0.5b',
    messages=[{'role': 'user', 'content': 'Hello!'}]
)

# Access transaction ID if needed for debugging/correlation
transaction_id = response._revenium_transaction_id

Configuration

Configuration Variables

Variable	Required	Description
`REVENIUM_METERING_API_KEY`	Yes	Your Revenium API key for authentication with the metering service
`REVENIUM_METERING_BASE_URL`	No	Revenium API base URL. Defaults to `https://api.revenium.ai`
`REVENIUM_LOG_LEVEL`	No	Log level for middleware output. Options: `DEBUG`, `INFO` (default), `WARNING`, `ERROR`, `CRITICAL`

Environment Setup Examples

Using a .env file (Recommended):

First, copy the example file:

cp .env.example .env

Then edit .env with your actual API key:

# Revenium API keys
REVENIUM_METERING_API_KEY="hak_..."
REVENIUM_METERING_BASE_URL="https://api.revenium.ai"

# Ollama is local - no API key needed
# Ensure Ollama is running: ollama serve

# Optional: Enable debug logging
# REVENIUM_LOG_LEVEL="DEBUG"

⚠️ Security Note: Never commit your .env file to version control. It's already included in .gitignore.

Then load it in your Python code:

from dotenv import load_dotenv
load_dotenv()

import ollama
import revenium_middleware_ollama

# Your Ollama calls will now be metered

Compatibility

Python 3.8+
Ollama Python SDK 1.0.0+

Supported Endpoints

The middleware automatically tracks usage for all Ollama API endpoints:

ollama.chat() - Chat completions (streaming and non-streaming)
ollama.generate() - Text generation (streaming and non-streaming)
ollama.embed() - Text embeddings (single and batch)

All endpoints support the usage_metadata parameter for enhanced tracking.

Supported Models

This middleware works with any Ollama model. Examples in this package use:

qwen2.5:0.5b, qwen2.5:1.5b (Qwen models)
llama3.1, llama3.2 (Llama models)
gemma2, codellama (Other popular models)

For the complete list of available models, see the Ollama Model Library.

For cost tracking across providers, see the Revenium Model Catalog.

Logging

This module uses Python's standard logging system. You can control the log level by setting the REVENIUM_LOG_LEVEL environment variable in your .env file:

# Add to your .env file
REVENIUM_LOG_LEVEL="DEBUG"

Or when running your script:

REVENIUM_LOG_LEVEL=DEBUG python your_script.py

Available log levels:

DEBUG: Detailed debugging information
INFO: General information (default)
WARNING: Warning messages only
ERROR: Error messages only
CRITICAL: Critical error messages only

Documentation

For detailed documentation, visit docs.revenium.io

Contributing

See CONTRIBUTING.md

Code of Conduct

See CODE_OF_CONDUCT.md

Security

See SECURITY.md

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Built by the Revenium team

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.4.0

Feb 12, 2026

This version

0.3.0

Dec 30, 2025

0.2.0

Dec 5, 2025

0.1.17

Nov 7, 2025

0.1.16

Jun 18, 2025

0.1.15

Jun 18, 2025

0.1.14

Jun 18, 2025

0.1.13

Jun 17, 2025

0.1.11

Jun 17, 2025

0.1.10

Apr 21, 2025

0.1.9

Apr 19, 2025

0.1.8

Apr 18, 2025

0.1.7

Apr 10, 2025

0.1.5

Apr 4, 2025

0.1.4

Mar 26, 2025

0.1.3

Mar 23, 2025

0.1.2

Mar 22, 2025

0.1.1

Mar 22, 2025

0.1.0

Mar 22, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

revenium_middleware_ollama-0.3.0.tar.gz (33.3 kB view details)

Uploaded Dec 30, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

revenium_middleware_ollama-0.3.0-py3-none-any.whl (26.2 kB view details)

Uploaded Dec 30, 2025 Python 3

File details

Details for the file revenium_middleware_ollama-0.3.0.tar.gz.

File metadata

Download URL: revenium_middleware_ollama-0.3.0.tar.gz
Upload date: Dec 30, 2025
Size: 33.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for revenium_middleware_ollama-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`36cd9f7e1dbb953223df663646407eeceb1c3eb806abbbbb58cc9c5fe804aee2`
MD5	`7fe2885456da71e7b6f72e5aec31bb68`
BLAKE2b-256	`1509bff46a3d41091ca8d67039a7f69023bd2b2223135169f6ef42cc37b34de0`

See more details on using hashes here.

File details

Details for the file revenium_middleware_ollama-0.3.0-py3-none-any.whl.

File metadata

Download URL: revenium_middleware_ollama-0.3.0-py3-none-any.whl
Upload date: Dec 30, 2025
Size: 26.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for revenium_middleware_ollama-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`638f04602f593326f21d0e4d325ccef2e3a88644f5a7a586e0374a5264e8099c`
MD5	`fb14aa727783be9d86df38b95e237fec`
BLAKE2b-256	`d8372ca5d2ec258ed51e81e4b855f6fc45fb6a0e6fbaab2774db5b1907766aac`

See more details on using hashes here.

revenium-middleware-ollama 0.3.0

Navigation

Verified details

Owner

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Revenium Middleware for Ollama

Features

Installation

Quick Start

1. Create Project Directory

2. Create Virtual Environment

3. Install Package

4. Configure Environment Variables

5. Run Your First Example

Usage

Zero-Config Integration

Enhanced Tracking with Metadata

Decorator Support

@revenium_metadata

@revenium_meter() - Selective Metering

OpenAI Compatibility Mode

Metadata Fields

Trace Visualization Fields (v0.2.0+)

Response Attributes

Configuration

Configuration Variables

Environment Setup Examples

Compatibility

Supported Endpoints

Supported Models

Logging

Documentation

Contributing

Code of Conduct

Security

License

Acknowledgments

Project details

Verified details

Owner

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`@revenium_metadata`

`@revenium_meter()` - Selective Metering