KAITO RAG Engine Client

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

KAITO RAG Engine Client

A Python client library for interacting with the KAITO RAGEngine API. This client is generated using the openapi-python-client project against the KAITO RAGEngine OpenAPI spec.

Client Generation

The OpenAPI spec for the KAITO RAGEngine is generated from the FastAPI service the RAGEngine runs. To regenerate this client, first download the openapi.json file from a running RAGEngine Service at <RAG_Engine_Service_Endpoint>/openapi.json. Save the file into the repo and run make generate-client

About KAITO

KAITO (Kubernetes AI Toolchain Operator) is an operator that automates AI/ML model inference workloads in Kubernetes clusters. The RAGEngine component provides powerful Retrieval-Augmented Generation capabilities, combining large language models with information retrieval systems for enhanced, context-aware responses.

Features

🔍 Document Management: Index, update, delete, and list documents
📊 Index Operations: Create, persist, load, and delete indexes
🤖 RAG Queries: Perform retrieval-augmented generation queries
💬 Chat Completions: OpenAI-compatible chat interface with RAG support
🔧 Code-Aware Splitting: Support for code documents with language-specific chunking
🎯 Metadata Filtering: Filter documents by custom metadata
📖 Comprehensive API: Full coverage of KAITO RAGEngine REST API

Integration with KAITO

This client is designed to work with KAITO RAGEngine deployments. To set up a KAITO RAGEngine in your Kubernetes cluster:

Install KAITO operator in your cluster
Deploy a RAGEngine custom resource
Use the service endpoint as your base_url

For detailed setup instructions, see the KAITO documentation.

Usage

For sync usage, you need to create a Client and pass it into the desired api's as follows:

from kaito_rag_engine_client import Client
from kaito_rag_engine_client.api.chat import chat
from kaito_rag_engine_client.models import IndexRequest, Document, UpdateDocumentRequest, DeleteDocumentRequest, ChatRequest
from kaito_rag_engine_client.api.index import list_indexes, create_index, delete_index, list_documents_in_index, delete_documents_in_index, update_documents_in_index, persist_index, load_index

client = Client(base_url="http://api.example.com")

# List all indexes
indexes = list_indexes.sync(client=client)

#Create Index Request
resp = create_index.sync(client=client,body=IndexRequest(
    index_name="test_index",
    documents=[
        Document(
            text="Sample document text",
            metadata={"source": "unit_test"}
        )
    ]
))

# Delete Index Request
delete_resp = delete_index.sync(client=client, index_name="test_index")

# Persist Index Request
persist_index_resp = persist_index.sync(
    client=client,
    index_name="test_index"
)

# Load Index Request
load_index_resp = load_index.sync(
    client=client,
    index_name="test_index",
    overwrite=True
)

# List Documents In Index Request
list_resp = list_documents_in_index.sync(client=client, index_name="test_index")

# Update a Documents text
test_doc = list_resp.documents[0]
test_doc.text = "Updated document text"

# Update Documents Request
update_resp = update_documents_in_index.sync(
    client=client,
    index_name="test_index",
    body=UpdateDocumentRequest(
        documents=[
            test_doc
        ]
    )
)

# Delete Documents Request
delete_doc_resp = delete_documents_in_index.sync(
    client=client,
    index_name="test_index",
    body=DeleteDocumentRequest(
        doc_ids=[
            test_doc.doc_id
        ]
    )
)

# Chat Completions Request
chat_resp = chat.sync(client=client, body=ChatRequest.from_dict({
        "index_name": "test_index",
        "model": "<Your Model>",
        "messages": [{"role": "user", "content": "What can you tell me about AI?"}],
        "temperature": 0.7,
        "max_tokens": 100,
    }
))

Or do the same thing with an async version:

from kaito_rag_engine_client import Client
from kaito_rag_engine_client.api.chat import chat
from kaito_rag_engine_client.models import IndexRequest, Document, UpdateDocumentRequest, DeleteDocumentRequest, ChatRequest
from kaito_rag_engine_client.api.index import list_indexes, create_index, delete_index, list_documents_in_index, delete_documents_in_index, update_documents_in_index, persist_index, load_index

client = Client(base_url="http://api.example.com")

# List all indexes
indexes = await list_indexes.asyncio(client=client)

#Create Index Request
resp = await create_index.asyncio(client=client,body=IndexRequest(
    index_name="test_index",
    documents=[
        Document(
            text="Sample document text",
            metadata={"source": "unit_test"}
        )
    ]
))

# Delete Index Request
delete_resp = await delete_index.asyncio(client=client, index_name="test_index")

# Persist Index Request
persist_index_resp = await persist_index.asyncio(
    client=client,
    index_name="test_index"
)

# Load Index Request
load_index_resp = await load_index.asyncio(
    client=client,
    index_name="test_index",
    overwrite=True
)

# List Documents In Index Request
list_resp = await list_documents_in_index.asyncio(client=client, index_name="test_index")

# Update a Documents text
test_doc = list_resp.documents[0]
test_doc.text = "Updated document text"

# Update Documents Request
update_resp = await update_documents_in_index.asyncio(
    client=client,
    index_name="test_index",
    body=UpdateDocumentRequest(
        documents=[
            test_doc
        ]
    )
)

# Delete Documents Request
delete_doc_resp = await delete_documents_in_index.asyncio(
    client=client,
    index_name="test_index",
    body=DeleteDocumentRequest(
        doc_ids=[
            test_doc.doc_id
        ]
    )
)

# Chat Completions Request
chat_resp = await chat.asyncio(client=client, body=ChatRequest.from_dict({
        "index_name": "test_index",
        "model": "<YOUR_MODEL>",
        "messages": [{"role": "user", "content": "What can you tell me about AI?"}],
        "temperature": 0.7,
        "max_tokens": 100,
    }
))

By default, when you're calling an HTTPS API it will attempt to verify that SSL is working correctly. Using certificate verification is highly recommended most of the time, but sometimes you may need to authenticate to a server (especially an internal server) using a custom certificate bundle.

client = AuthenticatedClient(
    base_url="https://internal_api.example.com", 
    token="SuperSecretToken",
    verify_ssl="/path/to/certificate_bundle.pem",
)

You can also disable certificate validation altogether, but beware that this is a security risk.

client = AuthenticatedClient(
    base_url="https://internal_api.example.com", 
    token="SuperSecretToken", 
    verify_ssl=False
)

Things to know:

Every path/method combo becomes a Python module with four functions:
1. sync: Blocking request that returns parsed data (if successful) or None
2. sync_detailed: Blocking request that always returns a Request, optionally with parsed set if the request was successful.
3. asyncio: Like sync but async instead of blocking
4. asyncio_detailed: Like sync_detailed but async instead of blocking
All path/query params, and bodies become method arguments.
If your endpoint had any tags on it, the first tag will be used as a module name for the function (my_tag above)
Any endpoint which did not have a tag will be in kaito_rag_client.api.default

Advanced customizations

There are more settings on the generated Client class which let you control more runtime behavior, check out the docstring on that class for more info. You can also customize the underlying httpx.Client or httpx.AsyncClient (depending on your use-case):

from kaito_rag_client import Client

def log_request(request):
    print(f"Request event hook: {request.method} {request.url} - Waiting for response")

def log_response(response):
    request = response.request
    print(f"Response event hook: {request.method} {request.url} - Status {response.status_code}")

client = Client(
    base_url="https://api.example.com",
    httpx_args={"event_hooks": {"request": [log_request], "response": [log_response]}},
)

# Or get the underlying httpx client to modify directly with client.get_httpx_client() or client.get_async_httpx_client()

You can even set the httpx client directly, but beware that this will override any existing settings (e.g., base_url):

import httpx
from kaito_rag_engine_client import Client

client = Client(
    base_url="https://api.example.com",
)
# Note that base_url needs to be re-set, as would any shared cookies, headers, etc.
client.set_httpx_client(httpx.Client(base_url="https://api.example.com", proxies="http://localhost:8030"))

License

This project is licensed under the Apache License 2.0. See the LICENSE file for details.

Support

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

brfole.microsoft

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.9.0

Mar 2, 2026

This version

0.8.1

Feb 5, 2026

0.7.0

Nov 3, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kaito_rag_engine_client-0.8.1.tar.gz (32.6 kB view details)

Uploaded Feb 5, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

kaito_rag_engine_client-0.8.1-py3-none-any.whl (59.0 kB view details)

Uploaded Feb 5, 2026 Python 3

File details

Details for the file kaito_rag_engine_client-0.8.1.tar.gz.

File metadata

Download URL: kaito_rag_engine_client-0.8.1.tar.gz
Upload date: Feb 5, 2026
Size: 32.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kaito_rag_engine_client-0.8.1.tar.gz
Algorithm	Hash digest
SHA256	`7a2109b5e6805a84b8dc2a06feeab99089fd98f2153a2285ef48ded5542b1302`
MD5	`527420b6a2db8b50afa10a135e017c3f`
BLAKE2b-256	`a1de6799f9964de58a4120d87e8243f38e8d7c42353dd53dfd45556b3117532c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for kaito_rag_engine_client-0.8.1.tar.gz:

Publisher: create-release.yaml on kaito-project/kaito-rag-api

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: kaito_rag_engine_client-0.8.1.tar.gz
- Subject digest: 7a2109b5e6805a84b8dc2a06feeab99089fd98f2153a2285ef48ded5542b1302
- Sigstore transparency entry: 920067747
- Sigstore integration time: Feb 5, 2026
Source repository:
- Permalink: kaito-project/kaito-rag-api@289b4bcbb1f66818f0fcea1994c96f0176400cd1
- Branch / Tag: refs/heads/main
- Owner: https://github.com/kaito-project
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: create-release.yaml@289b4bcbb1f66818f0fcea1994c96f0176400cd1
- Trigger Event: workflow_dispatch

File details

Details for the file kaito_rag_engine_client-0.8.1-py3-none-any.whl.

File metadata

Download URL: kaito_rag_engine_client-0.8.1-py3-none-any.whl
Upload date: Feb 5, 2026
Size: 59.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kaito_rag_engine_client-0.8.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1bf5a3dcd92146e9e29ccbe060b52d45b36acf8a4e79ae04846f8b893079475f`
MD5	`2334129bd9ac37cb4077c1984d1920e8`
BLAKE2b-256	`73a9ca18b61d0a7e2b65550ec7bf5d50bc38ea92ea17519645fa22e9565177ac`

See more details on using hashes here.

Provenance

The following attestation bundles were made for kaito_rag_engine_client-0.8.1-py3-none-any.whl:

Publisher: create-release.yaml on kaito-project/kaito-rag-api

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: kaito_rag_engine_client-0.8.1-py3-none-any.whl
- Subject digest: 1bf5a3dcd92146e9e29ccbe060b52d45b36acf8a4e79ae04846f8b893079475f
- Sigstore transparency entry: 920067754
- Sigstore integration time: Feb 5, 2026
Source repository:
- Permalink: kaito-project/kaito-rag-api@289b4bcbb1f66818f0fcea1994c96f0176400cd1
- Branch / Tag: refs/heads/main
- Owner: https://github.com/kaito-project
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: create-release.yaml@289b4bcbb1f66818f0fcea1994c96f0176400cd1
- Trigger Event: workflow_dispatch

kaito-rag-engine-client 0.8.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

KAITO RAG Engine Client

Client Generation

About KAITO

Features

Integration with KAITO

Usage

Advanced customizations

License

Support

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance