Production-ready Python toolkit for Oracle Cloud Infrastructure Generative AI Service

These details have not been verified by PyPI

Project links

Project description

oci-genai-service

Production-ready Python toolkit for Oracle Cloud Infrastructure Generative AI Service.

Features

Unified client that auto-routes between OpenAI-compatible and native OCI APIs
Chat, streaming, and vision with Llama 4, Grok, GPT-OSS, and Cohere models
Function calling with a @tool decorator that auto-generates OpenAI-format schemas
Text embeddings via Cohere Embed v3 models (native OCI SDK)
Oracle AI Vector Search integration for production-grade vector storage
RAG pipeline with Docling document loading, recursive chunking, and retrieval-augmented generation
Tool-calling agent with memory, multi-turn sessions, and configurable reasoning loops
Guardrails for content moderation, PII detection, and prompt injection defense
Thinking trace extraction for reasoning models (Grok Code, GPT-OSS)
CLI for interactive chat, model browsing, embeddings, and RAG operations
5 auth methods: User Principal, API Key, Instance Principal, Resource Principal, Session Token

Quick Install

pip install oci-genai-service

For Oracle Vector Search support (requires Oracle Database 23ai):

pip install oci-genai-service[oracle]

Quick Start

Chat Completion

from oci_genai_service import GenAIClient

client = GenAIClient(compartment_id="ocid1.compartment.oc1..xxx")
response = client.chat("Explain Oracle AI Vector Search in 3 sentences.")
print(response.text)

Streaming

from oci_genai_service import GenAIClient

client = GenAIClient(compartment_id="ocid1.compartment.oc1..xxx")

for chunk in client.chat("Write a haiku about databases.", stream=True):
    print(chunk, end="", flush=True)

Vision (Image + Text)

from oci_genai_service import GenAIClient

client = GenAIClient(compartment_id="ocid1.compartment.oc1..xxx")

response = client.chat(
    "Describe this architecture diagram.",
    model="meta.llama-3.2-90b-vision-instruct",
    images=["diagram.png"],
)
print(response.text)

RAG with Oracle Vector Search

from oci_genai_service import GenAIClient, RAGPipeline, OracleVectorStore

client = GenAIClient(compartment_id="ocid1.compartment.oc1..xxx")
store = OracleVectorStore(
    dsn="localhost:1521/FREEPDB1",
    user="genai",
    password="genai",
)

pipeline = RAGPipeline(client=client, vector_store=store)
pipeline.ingest("quarterly-report.pdf")

result = pipeline.query("What was revenue in Q4?")
print(result.text)
for source in result.sources:
    print(f"  [{source['score']:.2f}] {source['chunk'][:80]}...")

Agent with Tools

from oci_genai_service import GenAIClient, Agent, tool

@tool
def get_weather(city: str) -> str:
    """Get current weather for a city."""
    return f"72F and sunny in {city}"

@tool
def get_population(city: str) -> int:
    """Get population of a city."""
    return 1_000_000

client = GenAIClient(compartment_id="ocid1.compartment.oc1..xxx")
agent = Agent(client=client, tools=[get_weather, get_population])

result = agent.run("What's the weather and population in Austin?")
print(result.text)
print(f"Tool calls: {len(result.tool_calls)}, Steps: {result.steps}")

Authentication

Method	`auth_type`	When to Use
User Principal	`user_principal`	Local development with `~/.oci/config` (default)
API Key	`api_key`	Simple token-based auth, no OCI config needed
Instance Principal	`instance_principal`	Running on OCI compute instances
Resource Principal	`resource_principal`	OCI Functions, Data Science jobs
Session Token	`session`	Temporary session-based authentication

Environment Variables

export OCI_COMPARTMENT_ID="ocid1.compartment.oc1..xxx"
export OCI_REGION="us-chicago-1"           # default
export OCI_PROFILE="DEFAULT"               # OCI config profile
export OCI_CONFIG_FILE="~/.oci/config"     # OCI config path
export OCI_GENAI_API_KEY="your-api-key"    # triggers api_key auth

Explicit Config

from oci_genai_service import GenAIClient, AuthConfig

# User Principal (default)
client = GenAIClient(compartment_id="ocid1.compartment.oc1..xxx")

# API Key
client = GenAIClient(api_key="your-key", compartment_id="ocid1.compartment.oc1..xxx")

# Instance Principal
config = AuthConfig(auth_type="instance_principal", compartment_id="ocid1.compartment.oc1..xxx")
client = GenAIClient(config=config)

CLI

The oci-genai command is installed automatically.

Interactive Chat

oci-genai chat
# Model: meta.llama-4-maverick
# You> What is Oracle Cloud?
# Assistant> ...

One-shot Chat

oci-genai chat "Explain RAG in one paragraph" --model meta.llama-3.3-70b-instruct

Vision

oci-genai chat "Describe this image" --image photo.jpg --model meta.llama-3.2-90b-vision-instruct

Browse Models

# List all models
oci-genai models list

# Filter by vendor
oci-genai models list --vendor meta

# Filter by capability
oci-genai models list --capability vision

# Model details
oci-genai models info meta.llama-4-maverick

Embeddings

oci-genai embed "Oracle AI Vector Search" --model cohere.embed-english-v3.0

RAG

# Ingest documents
oci-genai rag ingest ./docs/ --dsn localhost:1521/FREEPDB1 --user genai --password genai

# Query
oci-genai rag query "What are the main findings?" --dsn localhost:1521/FREEPDB1

Oracle Vector Search Setup

Start Oracle Database 23ai Free with Docker Compose:

# docker-compose.yml
services:
  oracle-db:
    image: gvenzl/oracle-free:23-slim-faststart
    ports:
      - "1521:1521"
    environment:
      ORACLE_PASSWORD: genai
      APP_USER: genai
      APP_USER_PASSWORD: genai
    volumes:
      - oracle-data:/opt/oracle/oradata

volumes:
  oracle-data:

docker compose up -d

# Verify it's ready
docker compose logs -f oracle-db  # wait for "DATABASE IS READY TO USE"

The OracleVectorStore will auto-create the vector table on first use.

Supported Models

Chat Models (OpenAI-Compatible API)

Model ID	Name	Vendor	Capabilities	Context
`meta.llama-4-scout`	Llama 4 Scout	Meta	chat, function_calling, streaming	131K
`meta.llama-4-maverick`	Llama 4 Maverick	Meta	chat, function_calling, streaming	131K
`meta.llama-3.3-70b-instruct`	Llama 3.3 70B Instruct	Meta	chat, function_calling, streaming	131K
`meta.llama-3.1-405b-instruct`	Llama 3.1 405B Instruct	Meta	chat, function_calling, streaming	131K
`meta.llama-3.1-70b-instruct`	Llama 3.1 70B Instruct	Meta	chat, function_calling, streaming	131K
`xai.grok-4.1-fast`	Grok 4.1 Fast	xAI	chat, streaming	131K
`xai.grok-4`	Grok 4	xAI	chat, streaming	131K
`xai.grok-4-fast`	Grok 4 Fast	xAI	chat, streaming	131K
`xai.grok-3`	Grok 3	xAI	chat, streaming	131K
`xai.grok-3-mini`	Grok 3 Mini	xAI	chat, streaming	131K
`xai.grok-3-mini-fast`	Grok 3 Mini Fast	xAI	chat, streaming	131K
`xai.grok-code-fast-1`	Grok Code Fast 1	xAI	chat, streaming, thinking	131K
`openai.gpt-oss-120b`	GPT-OSS 120B	OpenAI	chat, streaming, thinking	131K
`openai.gpt-oss-20b`	GPT-OSS 20B	OpenAI	chat, streaming	131K

Vision Models (OpenAI-Compatible API)

Model ID	Name	Vendor	Capabilities	Context
`meta.llama-3.2-90b-vision-instruct`	Llama 3.2 90B Vision	Meta	chat, vision, streaming	131K
`meta.llama-3.2-11b-vision-instruct`	Llama 3.2 11B Vision	Meta	chat, vision, streaming	131K

Chat Models (Native OCI API)

Model ID	Name	Vendor	Capabilities	Context
`cohere.command-a`	Command A	Cohere	chat, streaming	262K
`cohere.command-r-plus`	Command R+	Cohere	chat, streaming	131K
`cohere.command-r`	Command R	Cohere	chat, streaming	131K

Embedding Models (Native OCI API)

Model ID	Name	Vendor	Dimensions
`cohere.embed-english-v3.0`	Embed English v3.0	Cohere	1024
`cohere.embed-multilingual-v3.0`	Embed Multilingual v3.0	Cohere	1024
`cohere.embed-english-light-v3.0`	Embed English Light v3.0	Cohere	384
`cohere.embed-multilingual-light-v3.0`	Embed Multilingual Light v3.0	Cohere	384

Examples

See the examples/ directory for complete working examples:

chat_basic.py — simple chat completion
chat_streaming.py — streaming output
vision.py — image analysis with vision models
function_calling.py — tool use with @tool decorator
embeddings.py — text embeddings
rag_pipeline.py — full RAG with Oracle Vector Search
agent.py — tool-calling agent with memory
guardrails.py — content moderation and PII detection
thinking.py — reasoning trace extraction

License

Licensed under the Universal Permissive License v1.0 (UPL-1.0). See LICENSE for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

2.0.1

Mar 11, 2026

This version

2.0.0

Mar 11, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

oci_genai_service-2.0.0.tar.gz (41.8 kB view details)

Uploaded Mar 11, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

oci_genai_service-2.0.0-py3-none-any.whl (30.7 kB view details)

Uploaded Mar 11, 2026 Python 3

File details

Details for the file oci_genai_service-2.0.0.tar.gz.

File metadata

Download URL: oci_genai_service-2.0.0.tar.gz
Upload date: Mar 11, 2026
Size: 41.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for oci_genai_service-2.0.0.tar.gz
Algorithm	Hash digest
SHA256	`6056ccc587676137f994796736b628bba6bf5cca833c3fba7fc0dfc5a9cdc3f3`
MD5	`34224738df9856d608a91f10c50f272a`
BLAKE2b-256	`14963b89c9106743b2163c0d7f7b8a50c0b0fa795feb10e03335dca43813c2f4`

See more details on using hashes here.

File details

Details for the file oci_genai_service-2.0.0-py3-none-any.whl.

File metadata

Download URL: oci_genai_service-2.0.0-py3-none-any.whl
Upload date: Mar 11, 2026
Size: 30.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for oci_genai_service-2.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2b4734c1c57b6615aa9dd7e9578fbb9efde733b1a15692d11f466ff11b7fe668`
MD5	`a29dbbb22c1f8682363a87e29dd982ce`
BLAKE2b-256	`284c8b7f52542990250e8a1fbbd7e320a75cc7b750cd23a860b3c57c48334a79`

See more details on using hashes here.

oci-genai-service 2.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

oci-genai-service

Features

Quick Install

Quick Start

Chat Completion

Streaming

Vision (Image + Text)

RAG with Oracle Vector Search

Agent with Tools

Authentication

Environment Variables

Explicit Config

CLI

Interactive Chat

One-shot Chat

Vision

Browse Models

Embeddings

RAG

Oracle Vector Search Setup

Supported Models

Chat Models (OpenAI-Compatible API)

Vision Models (OpenAI-Compatible API)

Chat Models (Native OCI API)

Embedding Models (Native OCI API)

Examples

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes