An integration package connecting OCI and LangChain

These details have not been verified by PyPI

Project links

Project description

langchain-oci

This package contains the LangChain integrations with oci.

Installation

pip install -U langchain-oci

All integrations in this package assume that you have the credentials setup to connect with oci services.

Quick Start

This repository includes two main integration categories:

OCI Generative AI
OCI Data Science (Model Deployment)

OCI Generative AI Examples

OCI Generative AI supports two types of models:

On-Demand Models: Pre-hosted foundation models.
DAC Models: Models hosted on Dedicated AI Clusters (DAC), including custom models imported from Hugging Face or Object Storage

1a. Use a Chat Model (On-Demand)

ChatOCIGenAI class exposes chat models from OCI Generative AI.

from langchain_oci import ChatOCIGenAI

# Using a pre-hosted on-demand model
llm = ChatOCIGenAI(
    model_id="MY_MODEL_ID",  # Pre-hosted model ID
    service_endpoint="https://inference.generativeai.us-chicago-1.oci.oraclecloud.com",  # Regional endpoint
    compartment_id="ocid1.compartment.oc1..xxxxx",  # Your compartment OCID
    model_kwargs={"max_tokens": 1024},  # Use max_completion_tokens for OpenAI models
    auth_profile="MY_AUTH_PROFILE",
    is_stream=True,
    auth_type="SECURITY_TOKEN"
)

response = llm.invoke("Sing a ballad of LangChain.")

1b. Use a Chat Model (Imported Model on DAC)

For models you've imported and deployed on a Dedicated AI Cluster:

from langchain_oci import ChatOCIGenAI

# Using an imported model on Dedicated AI Cluster
llm = ChatOCIGenAI(
    model_id="ocid1.generativeaiendpoint.oc1.us-chicago-1.xxxxx",  # Endpoint OCID from your DAC
    provider="generic",  # Provider type: "cohere", "google", "meta", or "generic"
    service_endpoint="https://inference.generativeai.us-chicago-1.oci.oraclecloud.com",  # Regional endpoint
    compartment_id="ocid1.compartment.oc1..xxxxx",  # Your compartment OCID
    auth_type="SECURITY_TOKEN",  # Authentication type
    auth_profile="MY_AUTH_PROFILE",
    model_kwargs={"temperature": 0.7, "max_tokens": 500},
)

response = llm.invoke("Hello, what is your name?")

Additional Arguments for Imported Models:

model_id: Use the endpoint OCID (starts with ocid1.generativeaiendpoint)
provider: Provider type for your model. Available providers:
- "cohere": For Cohere models (CohereProvider)
- "google": For Google Gemini models (GeminiProvider) - automatically handles max_output_tokens to max_tokens parameter mapping
- "meta": For Meta Llama models (MetaProvider)
- "generic": Default for other models including OpenAI (GenericProvider) If not specified, the provider is auto-detected from the model_id prefix.
service_endpoint: Use regional API endpoint (not the internal cluster URL)

2. Use a Completion Model

OCIGenAI class exposes LLMs from OCI Generative AI.

from langchain_oci import OCIGenAI

llm = OCIGenAI()
llm.invoke("The meaning of life is")

3. Use an Embedding Model

OCIGenAIEmbeddings class exposes embeddings from OCI Generative AI.

from langchain_oci import OCIGenAIEmbeddings

embeddings = OCIGenAIEmbeddings()
embeddings.embed_query("What is the meaning of life?")

3b. Use Image Embeddings (Multimodal)

OCIGenAIEmbeddings supports image embeddings with multimodal models like cohere.embed-v4.0.

from langchain_oci import OCIGenAIEmbeddings

embeddings = OCIGenAIEmbeddings(
    model_id="cohere.embed-v4.0",
    service_endpoint="https://inference.generativeai.us-chicago-1.oci.oraclecloud.com",
    compartment_id="ocid1.compartment.oc1..xxxxx",
)

# Embed a single image (from file path, bytes, or data URI)
image_vector = embeddings.embed_image("path/to/image.png")

# Embed multiple images in a batch
image_vectors = embeddings.embed_image_batch([
    "path/to/image1.png",
    "path/to/image2.jpg",
    b"\x89PNG...",  # raw bytes
])

# Image and text embeddings share the same vector space for cross-modal retrieval
text_vector = embeddings.embed_query("a photo of a cat")

_{Note: Image embeddings require a multimodal model. Use IMAGE_EMBEDDING_MODELS to check supported models.}

4. Use Structured Output

ChatOCIGenAI supports structured output.

_{Note: The default method is function_calling. If default method returns None (e.g., for Google Gemini models using GeminiProvider), try json_schema or json_mode.}

from langchain_oci import ChatOCIGenAI
from pydantic import BaseModel

class Joke(BaseModel):
    setup: str
    punchline: str

llm = ChatOCIGenAI()
structured_llm = llm.with_structured_output(Joke)
structured_llm.invoke("Tell me a joke about programming")

5. Use OpenAI Responses API

ChatOCIOpenAI supports OpenAI Responses API.

from oci_openai import (
    OciSessionAuth,
)
from langchain_oci import ChatOCIOpenAI
client = ChatOCIOpenAI(
        auth=OciSessionAuth(profile_name="MY_PROFILE_NAME"),
        compartment_id="MY_COMPARTMENT_ID",
        region="us-chicago-1",
        model="openai.gpt-4.1",
        conversation_store_id="MY_CONVERSATION_STORE_ID"
    )
messages = [
        (
            "system",
            "You are a helpful translator. Translate the user sentence to French.",
        ),
        ("human", "I love programming."),
    ]
response = client.invoke(messages)

NOTE: By default store argument is set to True which requires passing conversation_store_id. You can set store to False and not pass conversation_store_id.

from oci_openai import (
    OciSessionAuth,
)
from langchain_oci import ChatOCIOpenAI
client = ChatOCIOpenAI(
        auth=OciSessionAuth(profile_name="MY_PROFILE_NAME"),
        compartment_id="MY_COMPARTMENT_ID",
        region="us-chicago-1",
        model="openai.gpt-4.1",
        store=False
    )
messages = [
        (
            "system",
            "You are a helpful translator. Translate the user sentence to French.",
        ),
        ("human", "I love programming."),
    ]
response = client.invoke(messages)

6. Use Parallel Tool Calling (Meta/Llama 4+ models only)

Enable parallel tool calling to execute multiple tools simultaneously, improving performance for multi-tool workflows.

from langchain_oci import ChatOCIGenAI

llm = ChatOCIGenAI(
    model_id="meta.llama-4-maverick-17b-128e-instruct-fp8",
    service_endpoint="https://inference.generativeai.us-chicago-1.oci.oraclecloud.com",
    compartment_id="MY_COMPARTMENT_ID",
)

# Enable parallel tool calling in bind_tools
llm_with_tools = llm.bind_tools(
    [get_weather, calculate_tip, get_population],
    parallel_tool_calls=True  # Tools can execute simultaneously
)

_{Note: Parallel tool calling is only supported for Llama 4+ models. Llama 3.x (including 3.3) and Cohere models will raise an error if this parameter is used.}

OCI Data Science Model Deployment Examples

1. Use a Chat Model

You may instantiate the OCI Data Science model with the generic ChatOCIModelDeployment or framework specific class like ChatOCIModelDeploymentVLLM.

from langchain_oci.chat_models import ChatOCIModelDeployment, ChatOCIModelDeploymentVLLM

# Create an instance of OCI Model Deployment Endpoint
# Replace the endpoint uri with your own
endpoint = "https://modeldeployment.<region>.oci.customer-oci.com/<ocid>/predict"

messages = [
    (
        "system",
        "You are a helpful assistant that translates English to French. Translate the user sentence.",
    ),
    ("human", "I love programming."),
]

chat = ChatOCIModelDeployment(
    endpoint=endpoint,
    streaming=True,
    max_retries=1,
    model_kwargs={
        "temperature": 0.2,
        "max_tokens": 512,
    },  # other model params...
    default_headers={
        "route": "/v1/chat/completions",
        # other request headers ...
    },
)
chat.invoke(messages)

chat_vllm = ChatOCIModelDeploymentVLLM(endpoint=endpoint)
chat_vllm.invoke(messages)

2. Use a Completion Model

You may instantiate the OCI Data Science model with OCIModelDeploymentLLM or OCIModelDeploymentVLLM.

from langchain_oci.llms import OCIModelDeploymentLLM, OCIModelDeploymentVLLM

# Create an instance of OCI Model Deployment Endpoint
# Replace the endpoint uri and model name with your own
endpoint = "https://modeldeployment.<region>.oci.customer-oci.com/<ocid>/predict"

llm = OCIModelDeploymentLLM(
    endpoint=endpoint,
    model="odsc-llm",
)
llm.invoke("Who is the first president of United States?")

vllm = OCIModelDeploymentVLLM(
    endpoint=endpoint,
)
vllm.invoke("Who is the first president of United States?")

3. Use an Embedding Model

You may instantiate the OCI Data Science model with the OCIModelDeploymentEndpointEmbeddings.

from langchain_oci.embeddings import OCIModelDeploymentEndpointEmbeddings

# Create an instance of OCI Model Deployment Endpoint
# Replace the endpoint uri with your own
endpoint = "https://modeldeployment.<region>.oci.customer-oci.com/<ocid>/predict"

embeddings = OCIModelDeploymentEndpointEmbeddings(
    endpoint=endpoint,
)

query = "Hello World!"
embeddings.embed_query(query)

documents = ["This is a sample document", "and here is another one"]
embeddings.embed_documents(documents)

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.6

May 13, 2026

0.2.5

Mar 6, 2026

This version

0.2.4

Feb 17, 2026

0.2.3

Feb 3, 2026

0.2.2

Jan 15, 2026

0.2.1

Dec 11, 2025

0.2.0

Nov 25, 2025

0.1.6

Nov 5, 2025

0.1.6.dev1 pre-release

Nov 5, 2025

0.1.5

Sep 9, 2025

0.1.3

Aug 14, 2025

0.1.2

Aug 12, 2025

0.1.1

Aug 8, 2025

0.1.0

Jul 29, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

langchain_oci-0.2.4.tar.gz (54.9 kB view details)

Uploaded Feb 17, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

langchain_oci-0.2.4-py3-none-any.whl (69.1 kB view details)

Uploaded Feb 17, 2026 Python 3

File details

Details for the file langchain_oci-0.2.4.tar.gz.

File metadata

Download URL: langchain_oci-0.2.4.tar.gz
Upload date: Feb 17, 2026
Size: 54.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for langchain_oci-0.2.4.tar.gz
Algorithm	Hash digest
SHA256	`e55b5eafe86f4acbf023ebca811d35fa8f5471a840e32b5a9c913ac61a52fdde`
MD5	`95d7a0b59f1a1c5a3d09436e50e73d4b`
BLAKE2b-256	`57d704f1fa0d21191d9b3f1366cb5caccfc6229ae3c931dc55415351fc613abb`

See more details on using hashes here.

File details

Details for the file langchain_oci-0.2.4-py3-none-any.whl.

File metadata

Download URL: langchain_oci-0.2.4-py3-none-any.whl
Upload date: Feb 17, 2026
Size: 69.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for langchain_oci-0.2.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7cf2a707bd7a8ff5e2a1ecb406ef75e7205a044ce82b0a639def8c699bca2400`
MD5	`300b9f0f71513dd6f17bc723a2b8d77d`
BLAKE2b-256	`4d4732070168d9beb1535a6ad2a27cde6ef33c3e8ac4cba6e260c16ef534684f`

See more details on using hashes here.

langchain-oci 0.2.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

langchain-oci

Installation

Quick Start

OCI Generative AI Examples

1a. Use a Chat Model (On-Demand)

1b. Use a Chat Model (Imported Model on DAC)

2. Use a Completion Model

3. Use an Embedding Model

3b. Use Image Embeddings (Multimodal)

4. Use Structured Output

5. Use OpenAI Responses API

6. Use Parallel Tool Calling (Meta/Llama 4+ models only)

OCI Data Science Model Deployment Examples

1. Use a Chat Model

2. Use a Completion Model

3. Use an Embedding Model

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes