Skip to main content

Run ollama on Google Collab or Kaggle

Project description

collab-ollama

Run Ollama on a Google Colab GPU and access it from your local machine via a Cloudflare Tunnel — no account or configuration required.

This is useful when models run too slow on your local machine, or when you need GPU-accelerated inference for synthetic data generation in large batches.

Quick Start

Run directly in a Colab cell with uvx — no install step needed:

!uvx collab-ollama

Or with a specific model:

!uvx collab-ollama -m gemma:2b

Alternative: pip install

If you prefer a traditional install:

!pip install collab-ollama
!collab-ollama

Specifying a Model

By default, phi3:mini is pulled and served. Use the -m / --model flag to choose a different model:

!uvx collab-ollama --model llama3:8b
!uvx collab-ollama -m gemma:2b

Once setup is complete, you'll see output like:

Setup is complete!

  Base URL : https://xxxx-xxxxx-xxxxx-xxxxx.trycloudflare.com/v1/
  API Key  : No key required — leave it blank or use any string
  Model    : gemma:2b

Usage

Use the printed Base URL and Model with any OpenAI-compatible client. No API key is needed — leave it blank or pass any arbitrary string.

Ollama CLI

On your local machine, set OLLAMA_HOST to the base URL (without /v1/) and use the Ollama CLI as usual. Inference runs on the Colab GPU, but the experience feels local. Make sure you have the Ollama CLI installed locally.

export OLLAMA_HOST='https://xxxx-xxxxx-xxxxx-xxxxx.trycloudflare.com'
ollama run gemma:2b --verbose

You can pull and run any model that fits in the Colab GPU memory:

ollama pull llama3:8b
ollama run llama3:8b

Python (OpenAI SDK)

Ollama exposes an OpenAI-compatible API. Install the SDK and use the Base URL directly:

pip install openai
from openai import OpenAI

client = OpenAI(
    base_url="https://xxxx-xxxxx-xxxxx-xxxxx.trycloudflare.com/v1/",
    api_key="ollama",  # any string works, or leave blank
)

response = client.chat.completions.create(
    model="gemma:2b",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Hello!"},
    ],
)
print(response.choices[0].message.content)

Node.js (OpenAI SDK)

Install the SDK:

npm install openai
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://xxxx-xxxxx-xxxxx-xxxxx.trycloudflare.com/v1/",
  apiKey: "ollama", // any string works, or leave blank
});

const response = await client.chat.completions.create({
  model: "gemma:2b",
  messages: [
    { role: "system", content: "You are a helpful assistant." },
    { role: "user", content: "Hello!" },
  ],
});
console.log(response.choices[0].message.content);

How It Works

  1. Installs Ollama if not already present.
  2. Installs Cloudflared if not already present.
  3. Starts ollama serve with OLLAMA_ORIGINS=* for broad CORS support.
  4. Pulls the specified model (default phi3:mini).
  5. Opens a Cloudflare quick tunnel to localhost:11434 and prints the Base URL, API Key info, and Model name.

Requirements

  • Colab: A Google Colab notebook with a GPU runtime.
  • Local machine: Ollama CLI (for CLI usage), Python with openai, or Node.js with openai.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

collab_ollama-1.2.3.tar.gz (3.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

collab_ollama-1.2.3-py3-none-any.whl (4.2 kB view details)

Uploaded Python 3

File details

Details for the file collab_ollama-1.2.3.tar.gz.

File metadata

  • Download URL: collab_ollama-1.2.3.tar.gz
  • Upload date:
  • Size: 3.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.6.12

File hashes

Hashes for collab_ollama-1.2.3.tar.gz
Algorithm Hash digest
SHA256 ef26de80f6628c73a7f17330c2a735d6513c9094b521c83bcc3d02bd7018a4d2
MD5 fedf6be7b5240e34c00496e17b0f7911
BLAKE2b-256 b86a8ac39921ebfd1c16bba17df35d7ec4f5fa85f2d46f9dba0e18af9566031f

See more details on using hashes here.

File details

Details for the file collab_ollama-1.2.3-py3-none-any.whl.

File metadata

File hashes

Hashes for collab_ollama-1.2.3-py3-none-any.whl
Algorithm Hash digest
SHA256 157cf76ac37f564cf8914704fc02df8c574d972ed3f563ad2cc3aec0acfeedcb
MD5 37e3eab9e74d3fbd1de343e11946cc7d
BLAKE2b-256 21aa721ccd42e3a900f518ef5ecdc63048528ddab789c77449251619ccd296d3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page