A package that enables interaction with a Kobai tenant.
Project description
Kobai SDK for Python (Alpha)
Alpha: This SDK is not currently supported for production use while we stabilize the interface.
The Kobai SDK for Python includes functionality to accelerate development with Python on the Kobai Semantic Layer. It does not cover all Kobai Studio features, but rather focuses on integrating a Kobai tenant with data science and AI activities on the backend.
Getting Started
This exercise demonstrates using the Kobai SDK to create a Databricks "Genie" Data Room environment, enabling users to interact with Kobai data in an AI Chat interface.
- Please install Kobai SDK for Python via
pip install kobai-sdk, gather some configuration details of the Kobai instance and tenant to connect to, and instantiateTenantClient:
from kobai import tenant_client, spark_client, databricks_client
schema = 'main.demo'
uri = 'https://demo.kobai.io'
tenant_id = '1'
tenant_name = 'My Demo Tenant'
k = tenant_client.TenantClient(tenant_name, tenant_id, uri, schema)
- Authenticate with the Kobai instance:
client_id = 'your_Entra_app_id_here'
tenant_id = 'your_Entra_directory_id_here'
k.authenticate(client_id, tenant_id)
- Initialize a Spark client using your current
SparkSession, and generate semantically-rich SQL views describing this Kobai tenant:
k.spark_init_session(spark)
k.spark_generate_genie_views()
- Initialize a Databricks API client using your Notebook context, and create a Genie Data Rooms environment for this Kobai tenant.
notebook_context = dbutils.notebook.entry_point.getDbutils().notebook().getContext()
sql_warehouse = '8834d98a8agffa76'
k.databricks_init_notebook(notebook_context, sql_warehouse)
k.databricks_build_genie()
AI Functionality
The Kobai SDK enables users to ask follow-up questions based on the results of previous queries. This functionality currently supports models hosted on Databricks and Azure OpenAI.
Prerequisites
Before asking a follow-up question, ensure that you have instantiated the TenantClient and completed the authentication process.
Steps to Ask a Follow-Up Question
- List Questions: Retrieve the questionId or questionName. You can list all questions or filter by domain.
k.list_questions() # List all questions
k.list_domains() # To get the domain labels
k.list_questions(domain_label="LegoCollecting") # List questions by domain
- Ask a Question: Use either the questionId or questionName to submit your query.
question_json = k.run_question_remote(2927) # By questionId
kobai_query_name = "Set ownership"
question_json = k.run_question_remote(k.get_question_id(kobai_query_name)) # By questionName
- Ask a Follow-Up Question: Based on the initial results, you can ask a follow-up question using either Azure OpenAI, Databricks or a user-provided chat model.
Using Azure OpenAI
Authentication Methods:
- ApiKey
from kobai import ai_query, llm_config
import json
followup_question = "Which owner owns the most sets?"
llm_config = llm_config.LLMConfig(endpoint="https://kobaipoc.openai.azure.com/", api_key="YOUR_API_KEY", deployment="gpt-4o-mini", llm_provider="azure_openai")
output = ai_query.followup_question(followup_question, json.dumps(question_json), kobai_query_name, llm_config=llm_config)
print(output)
- Azure Active Directory Authentication
Ensure that the logged-in tenant has access to Azure OpenAI. In case of databricks notebook, the logged in service principal should have access to Azure OpenAI.
from kobai import ai_query, llm_config
import json
followup_question = "Which owner owns the most sets?"
llm_config = llm_config.LLMConfig(endpoint="https://kobaipoc.openai.azure.com/", deployment="gpt-4o-mini", llm_provider="azure_openai")
llm_config.get_azure_ad_token()
output = ai_query.followup_question(followup_question, json.dumps(question_json), kobai_query_name, llm_config=llm_config)
print(output)
Using Databricks (Default Configuration)
from kobai import ai_query, llm_config
import json
followup_question = "Which owner owns the most sets?"
llm_config = llm_config.LLMConfig()
output = ai_query.followup_question(followup_question, json.dumps(question_json), kobai_query_name, llm_config=llm_config)
print(output)
User Provided Chat Model
from kobai import ai_query, llm_config
import json
from langchain_openai import AzureChatOpenAI
followup_question = "Which owner owns the most sets?"
llm_config = llm_config.LLMConfig(debug=True)
chat_model = AzureChatOpenAI(
azure_endpoint="https://kobaipoc.openai.azure.com/", azure_deployment="gpt-4o-mini",
api_key = "YOUR_API_KEY",
openai_api_version="2024-02-15-preview",
temperature=0.5,
max_tokens=150,)
output = ai_query.followup_question(followup_question, json.dumps(question_json), kobai_query_name, override_model=chat_model, llm_config=llm_config)
print(output)
Limitations
This version of the SDK is limited to use in certain contexts, as described below:
- Authentication is limited to MS Entra AD.
- Functionality limited to Databricks Notebook environments at this time.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file kobai_sdk-0.2.8rc10.tar.gz.
File metadata
- Download URL: kobai_sdk-0.2.8rc10.tar.gz
- Upload date:
- Size: 31.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
141edd55ddcbea6ae6d4248efe32b0808b6b6a0abe89e410524e72b805385cde
|
|
| MD5 |
aaffeb7deb37bc998056102072c4d6ea
|
|
| BLAKE2b-256 |
18aba408a44697ec76394cd630a9326219b4718ab683d3ccb134617f44a02b52
|
File details
Details for the file kobai_sdk-0.2.8rc10-py3-none-any.whl.
File metadata
- Download URL: kobai_sdk-0.2.8rc10-py3-none-any.whl
- Upload date:
- Size: 31.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f0676cfa41182d3dddc03e23cf4f74ab1a57d6873818e1f84e7a5323bc85db2e
|
|
| MD5 |
d36bf842cd93158f0a568e2571ba60f3
|
|
| BLAKE2b-256 |
ba39b625724c0bd747848678b596a6d7f63d66a8996896b18255be7a3b14c5a1
|