KitchenAI Whisk - Whisk Up Your Bento Box. A tool for running kitchenai apps.

Project description

Whisk - KitchenAI Task Management SDK

Whisk is a powerful SDK for building AI applications with KitchenAI. It provides a clean interface for handling queries, storage, embeddings, and dependency management.

Features

Simple CLI interface for managing BentoML services
Seamless integration with KitchenAI's infrastructure
Built-in NATS messaging support
Easy configuration management

Configuration

Whisk can be configured either through a YAML file or environment variables.

Using a Config File

Create a config.yml file:

nats:
  url: "nats://localhost:4222"
  user: "playground"
  password: "kitchenai_playground"
client:
  id: "whisk_client"
llm:
  cloud_api_key: ""  # Set via environment variable LLAMA_CLOUD_API_KEY
chroma:
  path: "chroma_db"

Using Environment Variables

Alternatively, you can configure Whisk using environment variables:

export WHISK_NATS_URL="nats://localhost:4222"
export WHISK_NATS_USER="playground"
export WHISK_NATS_PASSWORD="kitchenai_playground"
export WHISK_CLIENT_ID="whisk_client"
export LLAMA_CLOUD_API_KEY="your-key"
export WHISK_CHROMA_PATH="chroma_db"

Installation

pip install kitchenai-whisk

Quick Start

from whisk.kitchenai_sdk.kitchenai import KitchenAIApp
from whisk.kitchenai_sdk.schema import WhiskQuerySchema, WhiskQueryBaseResponseSchema

# Initialize app
kitchen = KitchenAIApp(namespace="quickstart")

# Create a simple query handler
@kitchen.query.handler("query")
async def query_handler(data: WhiskQuerySchema) -> WhiskQueryBaseResponseSchema:
    return WhiskQueryBaseResponseSchema(
        input=data.query,
        output="Response to: " + data.query
    )

Dependency Management

Whisk provides a type-based dependency injection system similar to FastAPI. Dependencies are automatically injected based on type annotations:

from whisk.kitchenai_sdk.kitchenai import KitchenAIApp
from whisk.kitchenai_sdk.schema import WhiskQuerySchema, WhiskQueryBaseResponseSchema
from llama_index.llms.openai import OpenAI
from llama_index.vector_stores.chroma import ChromaVectorStore
from llama_index.core.prompts import PromptTemplate

# Initialize app
kitchen = KitchenAIApp(namespace="rag-app")

# Initialize and register dependencies
llm = OpenAI(model="gpt-3.5-turbo")
vector_store = ChromaVectorStore(...)
system_prompt = PromptTemplate("You are a helpful assistant...")

kitchen.register_dependency(OpenAI, llm)  # Register by type
kitchen.register_dependency(ChromaVectorStore, vector_store)
kitchen.register_dependency(PromptTemplate, system_prompt)

# Dependencies are injected based on type annotations
@kitchen.query.handler("query")
async def query_handler(
    data: WhiskQuerySchema,
    llm: OpenAI,                    # Injected automatically
    vector_store: ChromaVectorStore,  # Injected automatically
    system_prompt: PromptTemplate     # Injected automatically
) -> WhiskQueryBaseResponseSchema:
    # Use dependencies directly
    response = await llm.acomplete(
        data.query,
        system_prompt=system_prompt
    )
    return WhiskQueryBaseResponseSchema.from_llm_invoke(
        data.query,
        response.text
    )

# You can also use the DependencyType enum if you prefer
from whisk.kitchenai_sdk.schema import DependencyType

@kitchen.query.handler("query")
async def another_handler(
    data: WhiskQuerySchema,
    llm: DependencyType.LLM,              # Also works with enum types
    vector_store: DependencyType.VECTOR_STORE,
    system_prompt: DependencyType.SYSTEM_PROMPT
) -> WhiskQueryBaseResponseSchema:
    # Dependencies are still injected automatically
    response = await llm.acomplete(data.query)
    return WhiskQueryBaseResponseSchema.from_llm_invoke(
        data.query,
        response.text
    )

Available Dependency Types

You can inject dependencies either by their actual types or using the DependencyType enum:

# Using actual types
def handler(
    llm: OpenAI,
    vector_store: ChromaVectorStore,
    embeddings: OpenAIEmbeddings,
    prompt: PromptTemplate
): ...

# Using enum types
def handler(
    llm: DependencyType.LLM,
    vector_store: DependencyType.VECTOR_STORE,
    embeddings: DependencyType.EMBEDDINGS,
    system_prompt: DependencyType.SYSTEM_PROMPT,
    retriever: DependencyType.RETRIEVER
): ...

Registering Dependencies

Dependencies can be registered in several ways:

# By type (recommended)
kitchen.register_dependency(OpenAI, llm)
kitchen.register_dependency(ChromaVectorStore, vector_store)

# By enum
kitchen.register_dependency(DependencyType.LLM, llm)
kitchen.register_dependency(DependencyType.VECTOR_STORE, vector_store)

# With custom keys
kitchen.register_dependency("my_llm", llm)
kitchen.register_dependency("my_store", vector_store)

Best Practices

Use Type Annotations: Prefer using actual types over enum types for better IDE support
Register at Startup: Register all dependencies when initializing your app
Type Safety: Use type hints consistently for better error detection
Single Responsibility: Each handler should only request dependencies it actually needs
Documentation: Document any special dependency requirements in handler docstrings

Handler Types

Query Handlers

Query handlers process text queries and return responses:

@kitchen.query.handler("query", DependencyType.LLM)
async def query_handler(data: WhiskQuerySchema) -> WhiskQueryBaseResponseSchema:
    """
    Args:
        data.query: The input query string
        data.metadata: Optional metadata dictionary
        data.label: Handler label
        data.stream: Whether to stream response
    """
    return WhiskQueryBaseResponseSchema(
        input=data.query,
        output="response",
        token_counts=token_counts,  # Optional
        metadata={"key": "value"}   # Optional
    )

Storage Handlers

Storage handlers manage document ingestion and storage:

@kitchen.storage.handler("storage", DependencyType.VECTOR_STORE)
async def storage_handler(data: WhiskStorageSchema) -> WhiskStorageResponseSchema:
    """
    Args:
        data.id: Document ID
        data.name: Document name
        data.data: Binary document data
        data.label: Handler label
        data.metadata: Optional metadata
    """
    return WhiskStorageResponseSchema(
        id=data.id,
        status="complete",
        metadata={"stored": True}  # Optional
    )

Embedding Handlers

Embedding handlers process text into vector embeddings:

@kitchen.embeddings.handler("embed", DependencyType.EMBEDDINGS)
async def embed_handler(data: WhiskEmbedSchema) -> WhiskEmbedResponseSchema:
    """
    Args:
        data.text: Text to embed
        data.label: Handler label
        data.metadata: Optional metadata
    """
    return WhiskEmbedResponseSchema(
        metadata={"embedded": True},
        token_counts=token_counts  # Optional
    )

Running Your App

Start your Whisk app using the CLI:

# Development with auto-reload
whisk run app:kitchen --reload

# Production
whisk run app:kitchen

Best Practices

Dependency Organization: Register all dependencies at startup
Error Handling: Always return proper response schemas
Metadata: Use metadata for tracking and debugging
Token Counting: Track token usage when possible
Type Hints: Use type hints for better code clarity

Configuration

Configure your app using environment variables or config files:

# config.yml
nats:
  url: nats://localhost:4222
  user: your-user
  password: your-password

For more examples and detailed documentation, visit our documentation.

Usage

whisk --help

Project Structure

For larger projects, it's recommended to organize your handlers into modules. Here are some recommended patterns:

Pattern 1: Module-based Organization

my_whisk_app/
├── app.py              # Main app initialization
├── config.yml          # Configuration
├── handlers/
│   ├── __init__.py
│   ├── query/          # Group query handlers by domain
│   │   ├── __init__.py
│   │   ├── chat.py
│   │   ├── rag.py
│   │   └── tools.py
│   ├── storage/        # Storage handlers
│   │   ├── __init__.py
│   │   └── documents.py
│   └── embed/          # Embedding handlers
│       ├── __init__.py
│       └── text.py
└── dependencies/       # Dependency initialization
    ├── __init__.py
    ├── llm.py
    └── vector_store.py

# app.py
from whisk.kitchenai_sdk.kitchenai import KitchenAIApp
from handlers.query import chat, rag, tools
from handlers.storage import documents
from handlers.embed import text

# Initialize app
kitchen = KitchenAIApp(namespace="large-app")

# Register all handlers
chat.register_handlers(kitchen)
rag.register_handlers(kitchen)
tools.register_handlers(kitchen)
documents.register_handlers(kitchen)
text.register_handlers(kitchen)

# handlers/query/chat.py
from whisk.kitchenai_sdk.schema import WhiskQuerySchema, WhiskQueryBaseResponseSchema, DependencyType

def register_handlers(kitchen):
    @kitchen.query.handler("chat")
    async def chat_handler(data: WhiskQuerySchema, llm=None) -> WhiskQueryBaseResponseSchema:
        """Basic chat handler"""
        ...

    @kitchen.query.handler("chat_stream")
    async def stream_handler(data: WhiskQuerySchema, llm=None) -> WhiskQueryBaseResponseSchema:
        """Streaming chat handler"""
        ...

Pattern 2: Class-based Handlers

For more complex handlers that share state or utilities:

# handlers/query/rag.py
from whisk.kitchenai_sdk.schema import WhiskQuerySchema, WhiskQueryBaseResponseSchema, DependencyType

class RAGHandlers:
    def __init__(self, kitchen):
        self.kitchen = kitchen
        self.register_handlers()
    
    def register_handlers(self):
        # Use instance method to share utilities
        self.kitchen.query.handler("rag")(self.rag_query)
        self.kitchen.query.handler("rag_stream")(self.rag_stream)
    
    async def rag_query(self, data: WhiskQuerySchema, llm=None, vector_store=None) -> WhiskQueryBaseResponseSchema:
        """RAG query handler"""
        docs = await self._get_relevant_docs(data.query, vector_store)
        return await self._generate_response(data.query, docs, llm)
    
    async def rag_stream(self, data: WhiskQuerySchema, llm=None, vector_store=None) -> WhiskQueryBaseResponseSchema:
        """Streaming RAG handler"""
        docs = await self._get_relevant_docs(data.query, vector_store)
        return await self._stream_response(data.query, docs, llm)
    
    async def _get_relevant_docs(self, query, vector_store):
        """Shared utility for document retrieval"""
        ...
    
    async def _generate_response(self, query, docs, llm):
        """Shared response generation logic"""
        ...

# app.py
from handlers.query.rag import RAGHandlers

rag_handlers = RAGHandlers(kitchen)

Pattern 3: Router-based Organization

For grouping related handlers with shared dependencies:

# handlers/query/tools.py
from typing import Protocol
from whisk.kitchenai_sdk.schema import WhiskQuerySchema, WhiskQueryBaseResponseSchema

class ToolRouter:
    def __init__(self, kitchen):
        self.kitchen = kitchen
    
    def register_handlers(self):
        # Register all tool handlers with shared prefix
        @self.kitchen.query.handler("tools/calculator")
        async def calculator(data: WhiskQuerySchema, llm=None):
            """Math calculation tool"""
            ...
        
        @self.kitchen.query.handler("tools/weather")
        async def weather(data: WhiskQuerySchema, llm=None):
            """Weather lookup tool"""
            ...
        
        @self.kitchen.query.handler("tools/search")
        async def search(data: WhiskQuerySchema, llm=None):
            """Web search tool"""
            ...

# app.py
from handlers.query.tools import ToolRouter

tool_router = ToolRouter(kitchen)
tool_router.register_handlers()

Best Practices

Handler Organization:
- Group related handlers in modules
- Use clear naming conventions
- Keep handler files focused and single-purpose
Dependency Management:
- Initialize dependencies at app startup
- Share dependencies across related handlers
- Use dependency injection for testing
Code Structure:
- Use classes for complex handlers with shared logic
- Use routers for grouping related endpoints
- Keep handler registration clear and explicit
Testing:
- Test handlers in isolation
- Use dependency injection for mocking
- Group tests by handler module
Documentation:
- Document handler purposes and requirements
- Include example requests/responses
- Document any special dependencies

This structure makes it easy to:

Add new handlers without touching existing code
Share utilities and dependencies between handlers
Test handlers in isolation
Maintain clear separation of concerns

Sub-Apps and Modular Organization

Whisk supports a modular application structure through sub-apps, allowing you to organize handlers by domain and compose them together:

from whisk.kitchenai_sdk.kitchenai import KitchenAIApp
from whisk.kitchenai_sdk.schema import (
    WhiskQuerySchema, 
    WhiskQueryBaseResponseSchema,
    DependencyType
)

# Create domain-specific sub-apps
chat_app = KitchenAIApp(namespace="chat")
rag_app = KitchenAIApp(namespace="rag")
tools_app = KitchenAIApp(namespace="tools")

# Define handlers in each sub-app
@chat_app.query.handler("basic")
async def basic_chat(data: WhiskQuerySchema, llm=None) -> WhiskQueryBaseResponseSchema:
    """Basic chat handler"""
    response = await llm.acomplete(data.query)
    return WhiskQueryBaseResponseSchema.from_llm_invoke(
        data.query,
        response.text
    )

@rag_app.query.handler("search")
async def rag_search(data: WhiskQuerySchema, llm=None, vector_store=None) -> WhiskQueryBaseResponseSchema:
    """RAG search handler"""
    docs = await vector_store.similarity_search(data.query)
    response = await llm.acomplete(data.query, context=docs)
    return WhiskQueryBaseResponseSchema.from_llm_invoke(
        data.query,
        response.text
    )

# Create main app and mount sub-apps
main_app = KitchenAIApp(namespace="main")
main_app.mount_app("chat", chat_app)    # Creates handler "chat.basic"
main_app.mount_app("rag", rag_app)      # Creates handler "rag.search"
main_app.mount_app("tools", tools_app)  # Creates handler "tools.calculator"

Large Project Structure

For larger projects, organize sub-apps in separate modules:

my_project/
├── apps/
│   ├── __init__.py
│   ├── chat/
│   │   ├── __init__.py
│   │   ├── app.py          # chat_app definition
│   │   ├── handlers.py     # Chat handlers
│   │   └── utils.py        # Chat-specific utilities
│   ├── rag/
│   │   ├── __init__.py
│   │   ├── app.py          # rag_app definition
│   │   ├── handlers.py     # RAG handlers
│   │   └── retriever.py    # RAG-specific utilities
│   └── tools/
│       ├── __init__.py
│       ├── app.py          # tools_app definition
│       └── handlers.py     # Tool handlers
├── main.py                 # Main app composition
└── dependencies.py         # Shared dependencies

# apps/chat/app.py
from whisk.kitchenai_sdk.kitchenai import KitchenAIApp
from .handlers import basic_chat, stream_chat

chat_app = KitchenAIApp(namespace="chat")
chat_app.query.handler("basic")(basic_chat)
chat_app.query.handler("stream")(stream_chat)

# main.py
from whisk.kitchenai_sdk.kitchenai import KitchenAIApp
from apps.chat.app import chat_app
from apps.rag.app import rag_app
from apps.tools.app import tools_app
from dependencies import setup_dependencies

# Create main app
kitchen = KitchenAIApp(namespace="main")

# Setup shared dependencies
setup_dependencies(kitchen)

# Mount sub-apps
kitchen.mount_app("chat", chat_app)
kitchen.mount_app("rag", rag_app)
kitchen.mount_app("tools", tools_app)

# The resulting NATS subjects will be:
# - kitchenai.service.{client_id}.query.chat.basic
# - kitchenai.service.{client_id}.query.chat.stream
# - kitchenai.service.{client_id}.query.rag.search
# - kitchenai.service.{client_id}.query.tools.calculator

Benefits of Sub-Apps

Modularity: Each sub-app can be developed and tested independently
Organization: Group related handlers and their dependencies
Reusability: Sub-apps can be reused across different projects
Maintainability: Easier to manage large codebases
Isolation: Each sub-app maintains its own namespace

Dependency Management

Dependencies can be:

Registered at the sub-app level for domain-specific dependencies
Registered at the main app level for shared dependencies
Automatically propagated to sub-apps when mounted

# Register dependencies at sub-app level
chat_app.register_dependency(DependencyType.LLM, chat_llm)
rag_app.register_dependency(DependencyType.VECTOR_STORE, vector_store)

# Or register shared dependencies at main app level
main_app.register_dependency(DependencyType.LLM, shared_llm)

Handler Labels

When mounting sub-apps, handler labels are automatically prefixed with the sub-app name:

Original label: "basic" in chat_app
Mounted label: "chat.basic" in main_app
NATS subject: kitchenai.service.{client_id}.query.chat.basic

Inter-Handler Dependencies

Handlers can depend on other handlers using the built-in NATS client for inter-handler communication:

from whisk.client import WhiskClient
from whisk.kitchenai_sdk.schema import (
    WhiskQuerySchema, 
    WhiskStorageSchema,
    WhiskQueryBaseResponseSchema
)

# Create a RAG handler that depends on storage and embedding handlers
@rag_app.query.handler("answer")
async def rag_answer(
    data: WhiskQuerySchema, 
    llm=None, 
    client: WhiskClient=None  # Inject the NATS client
) -> WhiskQueryBaseResponseSchema:
    """RAG handler that uses other handlers for storage and embeddings"""
    
    # Use storage handler through NATS
    storage_response = await client.request(
        "storage.ingest",  # Will be prefixed with proper NATS subject
        WhiskStorageSchema(
            id="doc1",
            data=data.metadata.get("document"),
            metadata={"source": "rag_handler"}
        )
    )
    
    # Use embedding handler through NATS
    embed_response = await client.request(
        "embeddings.create",
        WhiskEmbedSchema(
            text=data.query,
            metadata={"type": "query"}
        )
    )
    
    # Use the results to generate final response
    response = await llm.acomplete(
        data.query,
        context=storage_response.data,
        embeddings=embed_response.embeddings
    )
    
    return WhiskQueryBaseResponseSchema.from_llm_invoke(
        data.query,
        response.text
    )

# The client is automatically injected by the WhiskClient when running the app
client = WhiskClient(
    nats_url="nats://localhost:4222",
    kitchen=kitchen
)

This pattern allows you to:

Compose complex handlers from simpler ones
Maintain loose coupling between handlers
Reuse functionality across different handlers
Scale handlers independently
Handle failures gracefully

Best Practices for Inter-Handler Dependencies

Error Handling: Always handle potential failures from dependent handlers
Timeouts: Set appropriate timeouts for inter-handler requests
Circuit Breaking: Implement fallbacks for when dependent handlers fail
Monitoring: Track inter-handler dependencies for observability
Documentation: Document handler dependencies clearly

# Example with better error handling and timeouts
@rag_app.query.handler("answer")
async def rag_answer(data: WhiskQuerySchema, llm=None, client: WhiskClient=None):
    try:
        # Set timeout for storage request
        storage_response = await client.request(
            "storage.ingest",
            WhiskStorageSchema(...),
            timeout=5.0  # 5 second timeout
        )
    except TimeoutError:
        # Fallback behavior
        logger.error("Storage handler timeout")
        storage_response = default_storage_response()
    except Exception as e:
        logger.error(f"Storage handler error: {e}")
        raise

    # Continue with embedding and response generation...

Project details

Release history Release notifications | RSS feed

0.2.3

Feb 20, 2025

0.2.2

Feb 18, 2025

0.2.1

Feb 18, 2025

0.2.0

Feb 17, 2025

This version

0.1.6

Feb 12, 2025

0.1.5

Feb 7, 2025

0.1.4

Feb 5, 2025

0.1.3

Jan 31, 2025

0.1.2

Jan 22, 2025

0.1.1

Jan 21, 2025

0.1.0

Jan 16, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kitchenai_whisk-0.1.6.tar.gz (47.7 kB view details)

Uploaded Feb 12, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

kitchenai_whisk-0.1.6-py3-none-any.whl (26.1 kB view details)

Uploaded Feb 12, 2025 Python 3

File details

Details for the file kitchenai_whisk-0.1.6.tar.gz.

File metadata

Download URL: kitchenai_whisk-0.1.6.tar.gz
Upload date: Feb 12, 2025
Size: 47.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: python-httpx/0.27.2

File hashes

Hashes for kitchenai_whisk-0.1.6.tar.gz
Algorithm	Hash digest
SHA256	`93e22fae638fc10b870ffb4c9a9e3ebe4fa02a0458e6330a33b1f660eeca0c6a`
MD5	`f6dfe20978ad86aeee1512e5404f4b71`
BLAKE2b-256	`cbdfa0e0086a4c2cb83f11478f3876eca71f3998f7f473e7540a8a97bcaf74b9`

See more details on using hashes here.

File details

Details for the file kitchenai_whisk-0.1.6-py3-none-any.whl.

File metadata

Download URL: kitchenai_whisk-0.1.6-py3-none-any.whl
Upload date: Feb 12, 2025
Size: 26.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: python-httpx/0.27.2

File hashes

Hashes for kitchenai_whisk-0.1.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`df1ff776ff6956151dd5521d8fda66dbb4333ceee07970d79163fd1c132bcf39`
MD5	`b70c3751c46382d4f0d83e2e3d608fa2`
BLAKE2b-256	`bcc29c04a1530fa36abe4514250bee1742151b9168c79ca847d156dccd02e60d`

See more details on using hashes here.

kitchenai-whisk 0.1.6

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Whisk - KitchenAI Task Management SDK

Features

Configuration

Using a Config File

Using Environment Variables

Installation

Quick Start

Dependency Management

Available Dependency Types

Registering Dependencies

Best Practices

Handler Types

Query Handlers

Storage Handlers

Embedding Handlers

Running Your App

Best Practices

Configuration

Usage

Project Structure

Pattern 1: Module-based Organization

Pattern 2: Class-based Handlers

Pattern 3: Router-based Organization

Best Practices

Sub-Apps and Modular Organization

Large Project Structure

Benefits of Sub-Apps

Dependency Management

Handler Labels

Inter-Handler Dependencies

Best Practices for Inter-Handler Dependencies

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes