Python SDK for the Kailash container-node architecture

These details have not been verified by PyPI

Project links

Development Status
- 3 - Alpha
Intended Audience
- Developers
Programming Language

Project description

Kailash Python SDK

MIT License Code style: black Tests: 746 passing Coverage: 100%

A Pythonic SDK for the Kailash container-node architecture

Build workflows that seamlessly integrate with Kailash's production environment while maintaining the flexibility to prototype quickly and iterate locally.

✨ Highlights

🚀 Rapid Prototyping: Create and test workflows locally without containerization
🏗️ Architecture-Aligned: Automatically ensures compliance with Kailash standards
🔄 Seamless Handoff: Export prototypes directly to production-ready formats
📊 Real-time Monitoring: Live dashboards with WebSocket streaming and performance metrics
🧩 Extensible: Easy to create custom nodes for domain-specific operations
⚡ Fast Installation: Uses uv for lightning-fast Python package management
🤖 AI-Powered: Complete LLM agents, embeddings, and hierarchical RAG architecture
🧠 Retrieval-Augmented Generation: Full RAG pipeline with intelligent document processing
🌐 REST API Wrapper: Expose any workflow as a production-ready API in 3 lines

🎯 Who Is This For?

The Kailash Python SDK is designed for:

AI Business Coaches (ABCs) who need to prototype workflows quickly
Data Scientists building ML pipelines compatible with production infrastructure
Engineers who want to test Kailash workflows locally before deployment
Teams looking to standardize their workflow development process

🚀 Quick Start

Installation

Requirements: Python 3.11 or higher

# Install uv if you haven't already
curl -LsSf https://astral.sh/uv/install.sh | sh

# For users: Install from PyPI
pip install kailash

# For developers: Clone and sync
git clone https://github.com/integrum/kailash-python-sdk.git
cd kailash-python-sdk
uv sync

Your First Workflow

from kailash.workflow import Workflow
from kailash.nodes.data import CSVReader
from kailash.nodes.code import PythonCodeNode
from kailash.runtime.local import LocalRuntime
import pandas as pd

# Create a workflow
workflow = Workflow("customer_analysis", name="customer_analysis")

# Add data reader
reader = CSVReader(file_path="customers.csv")
workflow.add_node("read_customers", reader)

# Add custom processing using Python code
def analyze_customers(data):
    """Analyze customer data and compute metrics."""
    df = pd.DataFrame(data)
    # Convert total_spent to numeric
    df['total_spent'] = pd.to_numeric(df['total_spent'])
    return {
        "result": {
            "total_customers": len(df),
            "avg_spend": df["total_spent"].mean(),
            "top_customers": df.nlargest(10, "total_spent").to_dict("records")
        }
    }

analyzer = PythonCodeNode.from_function(analyze_customers, name="analyzer")
workflow.add_node("analyze", analyzer)

# Connect nodes
workflow.connect("read_customers", "analyze", {"data": "data"})

# Run locally
runtime = LocalRuntime()
results, run_id = runtime.execute(workflow)
print(f"Analysis complete! Results: {results}")

# Export for production
from kailash.utils.export import WorkflowExporter
exporter = WorkflowExporter()
workflow.save("customer_analysis.yaml", format="yaml")

SharePoint Integration Example

from kailash.workflow import Workflow
from kailash.nodes.data import SharePointGraphReader, CSVWriter
import os

# Create workflow for SharePoint file processing
workflow = Workflow("sharepoint_processor", name="sharepoint_processor")

# Configure SharePoint reader (using environment variables)
sharepoint = SharePointGraphReader()
workflow.add_node("read_sharepoint", sharepoint)

# Process downloaded files
csv_writer = CSVWriter(file_path="sharepoint_output.csv")
workflow.add_node("save_locally", csv_writer)

# Connect nodes
workflow.connect("read_sharepoint", "save_locally")

# Execute with credentials
from kailash.runtime.local import LocalRuntime

inputs = {
    "read_sharepoint": {
        "tenant_id": os.getenv("SHAREPOINT_TENANT_ID"),
        "client_id": os.getenv("SHAREPOINT_CLIENT_ID"),
        "client_secret": os.getenv("SHAREPOINT_CLIENT_SECRET"),
        "site_url": "https://yourcompany.sharepoint.com/sites/YourSite",
        "operation": "list_files",
        "library_name": "Documents"
    }
}

runtime = LocalRuntime()
results, run_id = runtime.execute(workflow, inputs=inputs)

Hierarchical RAG Example

from kailash.workflow import Workflow
from kailash.nodes.ai.embedding_generator import EmbeddingGenerator
from kailash.nodes.ai.llm_agent import LLMAgent
from kailash.nodes.data.sources import DocumentSourceNode, QuerySourceNode
from kailash.nodes.data.retrieval import RelevanceScorerNode
from kailash.nodes.transform.chunkers import HierarchicalChunkerNode
from kailash.nodes.transform.formatters import (
    ChunkTextExtractorNode, QueryTextWrapperNode, ContextFormatterNode
)

# Create hierarchical RAG workflow
workflow = Workflow("hierarchical_rag", name="Hierarchical RAG Workflow")

# Data sources (autonomous - no external files needed)
doc_source = DocumentSourceNode()
query_source = QuerySourceNode()

# Document processing pipeline
chunker = HierarchicalChunkerNode()
chunk_text_extractor = ChunkTextExtractorNode()
query_text_wrapper = QueryTextWrapperNode()

# AI processing with Ollama
chunk_embedder = EmbeddingGenerator(
    provider="ollama", model="nomic-embed-text", operation="embed_batch"
)
query_embedder = EmbeddingGenerator(
    provider="ollama", model="nomic-embed-text", operation="embed_batch"
)

# Retrieval and response generation
relevance_scorer = RelevanceScorerNode()
context_formatter = ContextFormatterNode()
llm_agent = LLMAgent(provider="ollama", model="llama3.2", temperature=0.7)

# Add all nodes to workflow
for name, node in {
    "doc_source": doc_source, "query_source": query_source,
    "chunker": chunker, "chunk_text_extractor": chunk_text_extractor,
    "query_text_wrapper": query_text_wrapper, "chunk_embedder": chunk_embedder,
    "query_embedder": query_embedder, "relevance_scorer": relevance_scorer,
    "context_formatter": context_formatter, "llm_agent": llm_agent
}.items():
    workflow.add_node(name, node)

# Connect the RAG pipeline
workflow.connect("doc_source", "chunker", {"documents": "documents"})
workflow.connect("chunker", "chunk_text_extractor", {"chunks": "chunks"})
workflow.connect("chunk_text_extractor", "chunk_embedder", {"input_texts": "input_texts"})
workflow.connect("query_source", "query_text_wrapper", {"query": "query"})
workflow.connect("query_text_wrapper", "query_embedder", {"input_texts": "input_texts"})
workflow.connect("chunker", "relevance_scorer", {"chunks": "chunks"})
workflow.connect("query_embedder", "relevance_scorer", {"embeddings": "query_embedding"})
workflow.connect("chunk_embedder", "relevance_scorer", {"embeddings": "chunk_embeddings"})
workflow.connect("relevance_scorer", "context_formatter", {"relevant_chunks": "relevant_chunks"})
workflow.connect("query_source", "context_formatter", {"query": "query"})
workflow.connect("context_formatter", "llm_agent", {"messages": "messages"})

# Execute the RAG workflow
from kailash.runtime.local import LocalRuntime
runtime = LocalRuntime()
results, run_id = runtime.execute(workflow)

print("RAG Response:", results["llm_agent"]["response"])

Workflow API Wrapper - Expose Workflows as REST APIs

Transform any Kailash workflow into a production-ready REST API in just 3 lines of code:

from kailash.api.workflow_api import WorkflowAPI

# Take any workflow and expose it as an API
api = WorkflowAPI(workflow)
api.run(port=8000)  # That's it! Your workflow is now a REST API

Features

Automatic REST Endpoints:
- POST /execute - Execute workflow with inputs
- GET /workflow/info - Get workflow metadata
- GET /health - Health check endpoint
- Automatic OpenAPI docs at /docs

Multiple Execution Modes:

# Synchronous execution (wait for results)
curl -X POST http://localhost:8000/execute \
  -d '{"inputs": {...}, "mode": "sync"}'

# Asynchronous execution (get execution ID)
curl -X POST http://localhost:8000/execute \
  -d '{"inputs": {...}, "mode": "async"}'

# Check async status
curl http://localhost:8000/status/{execution_id}

Specialized APIs for specific domains:

from kailash.api.workflow_api import create_workflow_api

# Create a RAG-specific API with custom endpoints
api = create_workflow_api(rag_workflow, api_type="rag")
# Adds /documents and /query endpoints

Production Ready:

# Development
api.run(reload=True, log_level="debug")

# Production with SSL
api.run(
    host="0.0.0.0",
    port=443,
    ssl_keyfile="key.pem",
    ssl_certfile="cert.pem",
    workers=4
)

See the API demo example for complete usage patterns.

📚 Documentation

Resource	Description
📖 User Guide	Comprehensive guide for using the SDK
📋 API Reference	Detailed API documentation
🌐 API Integration Guide	Complete API integration documentation
🎓 Examples	Working examples and tutorials
🤝 Contributing	Contribution guidelines

🛠️ Features

📦 Pre-built Nodes

The SDK includes a rich set of pre-built nodes for common operations:

Data Operations

CSVReader - Read CSV files
JSONReader - Read JSON files
DocumentSourceNode - Sample document provider
QuerySourceNode - Sample query provider
RelevanceScorerNode - Multi-method similarity
SQLDatabaseNode - Query databases
CSVWriter - Write CSV files
JSONWriter - Write JSON files

Transform Nodes

PythonCodeNode - Custom Python logic
DataTransformer - Transform data
HierarchicalChunkerNode - Document chunking
ChunkTextExtractorNode - Extract chunk text
QueryTextWrapperNode - Wrap queries for processing
ContextFormatterNode - Format LLM context
Filter - Filter records
Aggregator - Aggregate data

Logic Nodes

Switch - Conditional routing
Merge - Combine multiple inputs
WorkflowNode - Wrap workflows as reusable nodes

AI/ML Nodes

LLMAgent - Multi-provider LLM with memory & tools
EmbeddingGenerator - Vector embeddings with caching
MCPClient/MCPServer - Model Context Protocol
TextClassifier - Text classification
SentimentAnalyzer - Sentiment analysis
NamedEntityRecognizer - NER extraction

API Integration Nodes

HTTPRequestNode - HTTP requests
RESTAPINode - REST API client
GraphQLClientNode - GraphQL queries
OAuth2AuthNode - OAuth 2.0 authentication
RateLimitedAPINode - Rate-limited API calls

Other Integration Nodes

KafkaConsumerNode - Kafka streaming
WebSocketNode - WebSocket connections
EmailNode - Send emails

SharePoint Integration

SharePointGraphReader - Read SharePoint files
SharePointGraphWriter - Upload to SharePoint

Real-time Monitoring

RealTimeDashboard - Live workflow monitoring
WorkflowPerformanceReporter - Comprehensive reports
SimpleDashboardAPI - REST API for metrics
DashboardAPIServer - WebSocket streaming server

🔧 Core Capabilities

Workflow Management

from kailash.workflow import Workflow
from kailash.nodes.logic import Switch
from kailash.nodes.transform import DataTransformer

# Create complex workflows with branching logic
workflow = Workflow("data_pipeline", name="data_pipeline")

# Add conditional branching with Switch node
switch = Switch()
workflow.add_node("route", switch)

# Different paths based on validation
processor_a = DataTransformer(transformations=["lambda x: x"])
error_handler = DataTransformer(transformations=["lambda x: {'error': str(x)}"])
workflow.add_node("process_valid", processor_a)
workflow.add_node("handle_errors", error_handler)

# Connect with switch routing
workflow.connect("route", "process_valid")
workflow.connect("route", "handle_errors")

Hierarchical Workflow Composition

from kailash.workflow import Workflow
from kailash.nodes.logic import WorkflowNode
from kailash.runtime.local import LocalRuntime

# Create a reusable data processing workflow
inner_workflow = Workflow("data_processor", name="Data Processor")
# ... add nodes to inner workflow ...

# Wrap the workflow as a node
processor_node = WorkflowNode(
    workflow=inner_workflow,
    name="data_processor"
)

# Use in a larger workflow
main_workflow = Workflow("main", name="Main Pipeline")
main_workflow.add_node("process", processor_node)
main_workflow.add_node("analyze", analyzer_node)

# Connect workflows
main_workflow.connect("process", "analyze")

# Execute - parameters automatically mapped to inner workflow
runtime = LocalRuntime()
results, _ = runtime.execute(main_workflow)

Immutable State Management

from kailash.workflow import Workflow
from kailash.workflow.state import WorkflowStateWrapper
from pydantic import BaseModel

# Define state model
class MyStateModel(BaseModel):
    counter: int = 0
    status: str = "pending"
    nested: dict = {}

# Create workflow
workflow = Workflow("state_workflow", name="state_workflow")

# Create and wrap state object
state = MyStateModel()
state_wrapper = workflow.create_state_wrapper(state)

# Single path-based update
updated_wrapper = state_wrapper.update_in(
    ["counter"],
    42
)

# Batch update multiple fields atomically
updated_wrapper = state_wrapper.batch_update([
    (["counter"], 10),
    (["status"], "processing")
])

# Access the updated state
print(f"Updated counter: {updated_wrapper._state.counter}")
print(f"Updated status: {updated_wrapper._state.status}")

Task Tracking

from kailash.tracking import TaskManager

# Initialize task manager
task_manager = TaskManager()

# Create a sample workflow
from kailash.workflow import Workflow
workflow = Workflow("sample_workflow", name="Sample Workflow")

# Run workflow with tracking
from kailash.runtime.local import LocalRuntime
runtime = LocalRuntime()
results, run_id = runtime.execute(workflow)

# Query execution history
# Note: list_runs() may fail with timezone comparison errors in some cases
try:
    # List all runs
    all_runs = task_manager.list_runs()

    # Filter by status
    completed_runs = task_manager.list_runs(status="completed")
    failed_runs = task_manager.list_runs(status="failed")

    # Filter by workflow name
    workflow_runs = task_manager.list_runs(workflow_name="sample_workflow")

    # Process run information
    for run in completed_runs[:5]:  # First 5 runs
        print(f"Run {run.run_id[:8]}: {run.workflow_name} - {run.status}")

except Exception as e:
    print(f"Error listing runs: {e}")
    # Fallback: Access run details directly if available
    if hasattr(task_manager, 'storage'):
        run = task_manager.get_run(run_id)

Local Testing

from kailash.runtime.local import LocalRuntime
from kailash.workflow import Workflow

# Create a test workflow
workflow = Workflow("test_workflow", name="test_workflow")

# Create test runtime with debugging enabled
runtime = LocalRuntime(debug=True)

# Execute with test data
results, run_id = runtime.execute(workflow)

# Validate results
assert isinstance(results, dict)

Performance Monitoring & Real-time Dashboards

from kailash.visualization.performance import PerformanceVisualizer
from kailash.visualization.dashboard import RealTimeDashboard, DashboardConfig
from kailash.visualization.reports import WorkflowPerformanceReporter, ReportFormat
from kailash.tracking import TaskManager
from kailash.runtime.local import LocalRuntime
from kailash.workflow import Workflow
from kailash.nodes.transform import DataTransformer

# Create a workflow to monitor
workflow = Workflow("monitored_workflow", name="monitored_workflow")
node = DataTransformer(transformations=["lambda x: x"])
workflow.add_node("transform", node)

# Run workflow with task tracking
# Note: Pass task_manager to execute() to enable performance tracking
task_manager = TaskManager()
runtime = LocalRuntime()
results, run_id = runtime.execute(workflow, task_manager=task_manager)

# Static performance analysis
from pathlib import Path
perf_viz = PerformanceVisualizer(task_manager)
outputs = perf_viz.create_run_performance_summary(run_id, output_dir=Path("performance_report"))

# Real-time monitoring dashboard
config = DashboardConfig(
    update_interval=1.0,
    max_history_points=100,
    auto_refresh=True,
    theme="light"
)

dashboard = RealTimeDashboard(task_manager, config)
dashboard.start_monitoring(run_id)

# Add real-time callbacks
def on_metrics_update(metrics):
    print(f"Tasks: {metrics.completed_tasks} completed, {metrics.active_tasks} active")

dashboard.add_metrics_callback(on_metrics_update)

# Generate live HTML dashboard
dashboard.generate_live_report("live_dashboard.html", include_charts=True)
dashboard.stop_monitoring()

# Comprehensive performance reports
reporter = WorkflowPerformanceReporter(task_manager)
report_path = reporter.generate_report(
    run_id,
    output_path="workflow_report.html",
    format=ReportFormat.HTML
)

Real-time Dashboard Features:

⚡ Live Metrics Streaming: Real-time task progress and resource monitoring
📊 Interactive Charts: CPU, memory, and throughput visualizations with Chart.js
🔌 API Endpoints: REST and WebSocket APIs for custom integrations
📈 Performance Reports: Multi-format reports (HTML, Markdown, JSON) with insights
🎯 Bottleneck Detection: Automatic identification of performance issues
📱 Responsive Design: Mobile-friendly dashboards with auto-refresh

Performance Metrics Collected:

Execution Timeline: Gantt charts showing node execution order and duration
Resource Usage: Real-time CPU and memory consumption
I/O Analysis: Read/write operations and data transfer volumes
Performance Heatmaps: Identify bottlenecks across workflow runs
Throughput Metrics: Tasks per minute and completion rates
Error Tracking: Failed task analysis and error patterns

API Integration

from kailash.nodes.api import (
    HTTPRequestNode as RESTAPINode,
    # OAuth2AuthNode,
    # RateLimitedAPINode,
    # RateLimitConfig
)

# OAuth 2.0 authentication
# # auth_node = OAuth2AuthNode(
#     client_id="your_client_id",
#     client_secret="your_client_secret",
#     token_url="https://api.example.com/oauth/token"
# )

# Rate-limited API client
rate_config = None  # RateLimitConfig(
#     max_requests=100,
#     time_window=60.0,
#     strategy="token_bucket"
# )

api_client = RESTAPINode(
    base_url="https://api.example.com"
    # auth_node=auth_node
)

# rate_limited_client = RateLimitedAPINode(
#     wrapped_node=api_client,
#     rate_limit_config=rate_config
# )

Export Formats

from kailash.utils.export import WorkflowExporter, ExportConfig
from kailash.workflow import Workflow
from kailash.nodes.transform import DataTransformer

# Create a workflow to export
workflow = Workflow("export_example", name="export_example")
node = DataTransformer(transformations=["lambda x: x"])
workflow.add_node("transform", node)

exporter = WorkflowExporter()

# Export to different formats
workflow.save("workflow.yaml", format="yaml")  # Kailash YAML format
workflow.save("workflow.json", format="json")  # JSON representation

# Export with custom configuration
config = ExportConfig(
    include_metadata=True,
    container_tag="latest"
)
workflow.save("deployment.yaml")

🎨 Visualization

from kailash.workflow import Workflow
from kailash.workflow.visualization import WorkflowVisualizer
from kailash.nodes.transform import DataTransformer

# Create a workflow to visualize
workflow = Workflow("viz_example", name="viz_example")
node = DataTransformer(transformations=["lambda x: x"])
workflow.add_node("transform", node)

# Generate Mermaid diagram (recommended for documentation)
mermaid_code = workflow.to_mermaid()
print(mermaid_code)

# Save as Mermaid markdown file
with open("workflow.md", "w") as f:
    f.write(workflow.to_mermaid_markdown(title="My Workflow"))

# Or use matplotlib visualization
visualizer = WorkflowVisualizer(workflow)
visualizer.visualize()
visualizer.save("workflow.png", dpi=300)  # Save as PNG

Hierarchical RAG (Retrieval-Augmented Generation)

from kailash.workflow import Workflow
from kailash.nodes.data.sources import DocumentSourceNode, QuerySourceNode
from kailash.nodes.data.retrieval import RelevanceScorerNode
from kailash.nodes.transform.chunkers import HierarchicalChunkerNode
from kailash.nodes.transform.formatters import (
    ChunkTextExtractorNode,
    QueryTextWrapperNode,
    ContextFormatterNode,
)
from kailash.nodes.ai.llm_agent import LLMAgent
from kailash.nodes.ai.embedding_generator import EmbeddingGenerator

# Create hierarchical RAG workflow
workflow = Workflow(
    workflow_id="hierarchical_rag_example",
    name="Hierarchical RAG Workflow",
    description="Complete RAG pipeline with embedding-based retrieval",
    version="1.0.0"
)

# Create data source nodes
doc_source = DocumentSourceNode()
query_source = QuerySourceNode()

# Create document processing pipeline
chunker = HierarchicalChunkerNode()
chunk_text_extractor = ChunkTextExtractorNode()
query_text_wrapper = QueryTextWrapperNode()

# Create embedding generators
chunk_embedder = EmbeddingGenerator(
    provider="ollama",
    model="nomic-embed-text",
    operation="embed_batch"
)

query_embedder = EmbeddingGenerator(
    provider="ollama",
    model="nomic-embed-text",
    operation="embed_batch"
)

# Create retrieval and formatting nodes
relevance_scorer = RelevanceScorerNode(similarity_method="cosine")
context_formatter = ContextFormatterNode()

# Create LLM agent for final answer generation
llm_agent = LLMAgent(
    provider="ollama",
    model="llama3.2",
    temperature=0.7,
    max_tokens=500
)

# Add all nodes to workflow
for node_id, node in [
    ("doc_source", doc_source),
    ("chunker", chunker),
    ("query_source", query_source),
    ("chunk_text_extractor", chunk_text_extractor),
    ("query_text_wrapper", query_text_wrapper),
    ("chunk_embedder", chunk_embedder),
    ("query_embedder", query_embedder),
    ("relevance_scorer", relevance_scorer),
    ("context_formatter", context_formatter),
    ("llm_agent", llm_agent)
]:
    workflow.add_node(node_id, node)

# Connect the workflow pipeline
# Document processing: docs → chunks → text → embeddings
workflow.connect("doc_source", "chunker", {"documents": "documents"})
workflow.connect("chunker", "chunk_text_extractor", {"chunks": "chunks"})
workflow.connect("chunk_text_extractor", "chunk_embedder", {"input_texts": "input_texts"})

# Query processing: query → text wrapper → embeddings
workflow.connect("query_source", "query_text_wrapper", {"query": "query"})
workflow.connect("query_text_wrapper", "query_embedder", {"input_texts": "input_texts"})

# Relevance scoring: chunks + embeddings → scored chunks
workflow.connect("chunker", "relevance_scorer", {"chunks": "chunks"})
workflow.connect("query_embedder", "relevance_scorer", {"embeddings": "query_embedding"})
workflow.connect("chunk_embedder", "relevance_scorer", {"embeddings": "chunk_embeddings"})

# Context formatting: relevant chunks + query → formatted context
workflow.connect("relevance_scorer", "context_formatter", {"relevant_chunks": "relevant_chunks"})
workflow.connect("query_source", "context_formatter", {"query": "query"})

# Final answer generation: formatted context → LLM response
workflow.connect("context_formatter", "llm_agent", {"messages": "messages"})

# Execute workflow
results, run_id = workflow.run()

# Access results
print("🎯 Top Relevant Chunks:")
for chunk in results["relevance_scorer"]["relevant_chunks"]:
    print(f"  - {chunk['document_title']}: {chunk['relevance_score']:.3f}")

print("\n🤖 Final Answer:")
print(results["llm_agent"]["response"]["content"])

This example demonstrates:

Document chunking with hierarchical structure
Vector embeddings using Ollama's nomic-embed-text model
Semantic similarity scoring with cosine similarity
Context formatting for LLM input
Answer generation using Ollama's llama3.2 model

💻 CLI Commands

The SDK includes a comprehensive CLI for workflow management:

# Project initialization
kailash init my-project --template data-pipeline

# Workflow operations
kailash validate workflow.yaml
kailash run workflow.yaml --inputs data.json
kailash export workflow.py --format kubernetes

# Task management
kailash tasks list --status running
kailash tasks show run-123
kailash tasks cancel run-123

# Development tools
kailash test workflow.yaml --data test_data.json
kailash debug workflow.yaml --breakpoint node-id

🏗️ Architecture

The SDK follows a clean, modular architecture:

kailash/
├── nodes/           # Node implementations and base classes
│   ├── base.py      # Abstract Node class
│   ├── data/        # Data I/O nodes
│   ├── transform/   # Transformation nodes
│   ├── logic/       # Business logic nodes
│   └── ai/          # AI/ML nodes
├── workflow/        # Workflow management
│   ├── graph.py     # DAG representation
│   └── visualization.py  # Visualization tools
├── visualization/   # Performance visualization
│   └── performance.py    # Performance metrics charts
├── runtime/         # Execution engines
│   ├── local.py     # Local execution
│   └── docker.py    # Docker execution (planned)
├── tracking/        # Monitoring and tracking
│   ├── manager.py   # Task management
│   └── metrics_collector.py  # Performance metrics
│   └── storage/     # Storage backends
├── cli/             # Command-line interface
└── utils/           # Utilities and helpers

🤖 Unified AI Provider Architecture

The SDK features a unified provider architecture for AI capabilities:

from kailash.nodes.ai import LLMAgent, EmbeddingGenerator

# Multi-provider LLM support
agent = LLMAgent()
result = agent.run(
    provider="ollama",  # or "openai", "anthropic", "mock"
    model="llama3.1:8b-instruct-q8_0",
    messages=[{"role": "user", "content": "Explain quantum computing"}],
    generation_config={"temperature": 0.7, "max_tokens": 500}
)

# Vector embeddings with the same providers
embedder = EmbeddingGenerator()
embedding = embedder.run(
    provider="ollama",  # Same providers support embeddings
    model="snowflake-arctic-embed2",
    operation="embed_text",
    input_text="Quantum computing uses quantum mechanics principles"
)

# Check available providers and capabilities
from kailash.nodes.ai.ai_providers import get_available_providers
providers = get_available_providers()
# Returns: {"ollama": {"available": True, "chat": True, "embeddings": True}, ...}

Supported AI Providers:

Ollama: Local LLMs with both chat and embeddings (llama3.1, mistral, etc.)
OpenAI: GPT models and text-embedding-3 series
Anthropic: Claude models (chat only)
Cohere: Embedding models (embed-english-v3.0)
HuggingFace: Sentence transformers and local models
Mock: Testing provider with consistent outputs

🧪 Testing

The SDK is thoroughly tested with comprehensive test suites:

# Run all tests
uv run pytest

# Run with coverage
uv run pytest --cov=kailash --cov-report=html

# Run specific test categories
uv run pytest tests/unit/
uv run pytest tests/integration/
uv run pytest tests/e2e/

🤝 Contributing

We welcome contributions! Please see our Contributing Guide for details.

Development Setup

# Clone the repository
git clone https://github.com/integrum/kailash-python-sdk.git
cd kailash-python-sdk

# Install uv if you haven't already
curl -LsSf https://astral.sh/uv/install.sh | sh

# Sync dependencies (creates venv automatically and installs everything)
uv sync

# Run commands using uv (no need to activate venv)
uv run pytest
uv run kailash --help

# Or activate the venv if you prefer
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

# Install development dependencies
uv add --dev pre-commit detect-secrets doc8

# Install Trivy (macOS with Homebrew)
brew install trivy

# Set up pre-commit hooks
pre-commit install
pre-commit install --hook-type pre-push

# Run initial setup (formats code and fixes issues)
pre-commit run --all-files

Code Quality & Pre-commit Hooks

We use automated pre-commit hooks to ensure code quality:

Hooks Include:

Black: Code formatting
isort: Import sorting
Ruff: Fast Python linting
pytest: Unit tests
Trivy: Security vulnerability scanning
detect-secrets: Secret detection
doc8: Documentation linting
mypy: Type checking

Manual Quality Checks:

# Format code
black src/ tests/
isort src/ tests/

# Linting and fixes
ruff check src/ tests/ --fix

# Type checking
mypy src/

# Run all pre-commit hooks manually
pre-commit run --all-files

# Run specific hooks
pre-commit run black
pre-commit run pytest-check

📈 Project Status

✅ Completed

Core node system with 15+ node types
Workflow builder with DAG validation
Local & async execution engines
Task tracking with metrics
Multiple storage backends
Export functionality (YAML/JSON)
CLI interface
Immutable state management
API integration with rate limiting
OAuth 2.0 authentication
SharePoint Graph API integration
Real-time performance metrics collection
Performance visualization dashboards
Real-time monitoring dashboard with WebSocket streaming
Comprehensive performance reports (HTML, Markdown, JSON)
89% test coverage (571 tests)
15 test categories all passing
37 working examples

🚧 In Progress

Comprehensive API documentation
Security audit & hardening
Performance optimizations
Docker runtime finalization

📋 Planned

Cloud deployment templates
Visual workflow editor
Plugin system
Additional integrations

🎯 Test Suite Status

Total Tests: 571 passing (89%)
Test Categories: 15/15 at 100%
Integration Tests: 65 passing
Examples: 37/37 working
Code Coverage: 89%

⚠️ Known Issues

DateTime Comparison in list_runs(): The TaskManager.list_runs() method may encounter timezone comparison errors between timezone-aware and timezone-naive datetime objects. Workaround: Use try-catch blocks when calling list_runs() or access run details directly via get_run(run_id).
Performance Tracking: To enable performance metrics collection, you must pass the task_manager parameter to the runtime.execute() method: runtime.execute(workflow, task_manager=task_manager).

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

The Integrum team for the Kailash architecture
All contributors who have helped shape this SDK
The Python community for excellent tools and libraries

📞 Support

Made with ❤️ by the Integrum Team

Project details

These details have not been verified by PyPI

Project links

Development Status
- 3 - Alpha
Intended Audience
- Developers
Programming Language

Release history Release notifications | RSS feed

2.23.0

May 18, 2026

2.22.1

May 18, 2026

2.22.0

May 17, 2026

2.21.3

May 17, 2026

2.21.2

May 17, 2026

2.21.1

May 16, 2026

2.21.0

May 13, 2026

2.20.3

May 11, 2026

2.20.2

May 11, 2026

2.20.1

May 10, 2026

2.20.0

May 10, 2026

2.19.0

May 10, 2026

2.18.1

May 10, 2026

2.18.0

May 8, 2026

2.17.0

May 8, 2026

2.16.1

May 8, 2026

2.16.0

May 7, 2026

2.15.0

May 7, 2026

2.14.1

May 7, 2026

2.14.0

May 7, 2026

2.13.5

May 6, 2026

2.13.4

May 3, 2026

2.13.3

May 1, 2026

2.13.2

May 1, 2026

2.13.1

Apr 30, 2026

2.13.0

Apr 30, 2026

2.12.0

Apr 28, 2026

2.11.3

Apr 27, 2026

2.11.2

Apr 26, 2026

2.11.1

Apr 26, 2026

2.11.0

Apr 25, 2026

2.10.0

Apr 25, 2026

2.9.2

Apr 24, 2026

2.9.1

Apr 24, 2026

2.9.0

Apr 24, 2026

2.8.12

Apr 21, 2026

2.8.11

Apr 20, 2026

2.8.10

Apr 19, 2026

2.8.9

Apr 19, 2026

2.8.8

Apr 19, 2026

2.8.7

Apr 19, 2026

2.8.6

Apr 14, 2026

2.8.5

Apr 13, 2026

2.8.4

Apr 12, 2026

2.8.3

Apr 12, 2026

2.8.2

Apr 12, 2026

2.6.0

Apr 6, 2026

2.5.1

Apr 5, 2026

2.5.0

Apr 4, 2026

2.4.1

Apr 3, 2026

2.4.0

Apr 1, 2026

2.3.4

Mar 31, 2026

2.3.3

Mar 31, 2026

2.3.2

Mar 31, 2026

2.3.1

Mar 30, 2026

2.3.0

Mar 30, 2026

2.2.1

Mar 29, 2026

2.2.0

Mar 27, 2026

2.1.0

Mar 24, 2026

2.0.1

Mar 23, 2026

2.0.0

Mar 21, 2026

1.0.0

Mar 18, 2026

0.13.0

Mar 17, 2026

0.12.5

Mar 9, 2026

0.12.4

Mar 7, 2026

0.12.3

Feb 23, 2026

0.12.2

Feb 23, 2026

0.12.1

Feb 22, 2026

0.12.0

Feb 22, 2026

0.11.0

Feb 9, 2026

0.10.17

Feb 7, 2026

0.10.16

Jan 31, 2026

0.10.15

Jan 15, 2026

0.10.14

Jan 13, 2026

0.10.13

Nov 19, 2025

0.10.12

Nov 17, 2025

0.10.11

Nov 12, 2025

0.10.10

Nov 7, 2025

0.10.9

Nov 7, 2025

0.10.8

Nov 7, 2025

0.10.7

Nov 2, 2025

0.10.6

Nov 2, 2025

0.10.5

Oct 31, 2025

0.10.4

Oct 30, 2025

0.10.3

Oct 29, 2025

0.10.2

Oct 27, 2025

0.10.1

Oct 27, 2025

0.10.0

Oct 26, 2025

0.9.31

Oct 24, 2025

0.9.30

Oct 24, 2025

0.9.29

Oct 24, 2025

0.9.28

Oct 23, 2025

0.9.27

Oct 22, 2025

0.9.26

Oct 21, 2025

0.9.25

Oct 15, 2025

0.9.24

Oct 15, 2025

0.9.23

Oct 13, 2025

0.9.22

Oct 11, 2025

0.9.21

Oct 8, 2025

0.9.20

Oct 6, 2025

0.9.19

Aug 21, 2025

0.9.18

Aug 15, 2025

0.9.17

Aug 14, 2025

0.9.16

Aug 13, 2025

0.9.15

Aug 10, 2025

0.9.14

Aug 10, 2025

0.9.13

Aug 5, 2025

0.9.12

Aug 5, 2025

0.9.11

Aug 4, 2025

0.9.10

Aug 2, 2025

0.9.9

Aug 1, 2025

0.9.8

Jul 31, 2025

0.9.7

Jul 31, 2025

0.9.6

Jul 31, 2025

0.9.5

Jul 31, 2025

0.9.4

Jul 31, 2025

0.9.3

Jul 30, 2025

0.9.2

Jul 29, 2025

0.9.1

Jul 29, 2025

0.9.0

Jul 27, 2025

0.8.7

Jul 25, 2025

0.8.6

Jul 22, 2025

0.8.5

Jul 20, 2025

0.8.4

Jul 19, 2025

0.8.3

Jul 18, 2025

0.8.2

Jul 18, 2025

0.8.1

Jul 17, 2025

0.8.0

Jul 17, 2025

0.7.0

Jul 10, 2025

0.6.6

Jul 8, 2025

0.6.5

Jul 8, 2025

0.6.4

Jul 6, 2025

0.6.3

Jul 3, 2025

0.6.2

Jul 3, 2025

0.6.1

Jun 26, 2025

0.6.0

Jun 24, 2025

0.5.0

Jun 19, 2025

0.4.2

Jun 18, 2025

0.4.1

Jun 16, 2025

0.4.0

Jun 15, 2025

0.3.2

Jun 11, 2025

0.3.1

Jun 11, 2025

0.3.0

Jun 10, 2025

0.2.2

Jun 10, 2025

0.2.1

Jun 8, 2025

0.2.0

Jun 8, 2025

0.1.5

Jun 4, 2025

0.1.4

Jun 3, 2025

This version

0.1.3

Jun 3, 2025

0.1.2

Jun 2, 2025

0.1.1

May 31, 2025

0.1.0 yanked

May 31, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kailash-0.1.3.tar.gz (268.6 kB view details)

Uploaded Jun 3, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

kailash-0.1.3-py3-none-any.whl (290.5 kB view details)

Uploaded Jun 3, 2025 Python 3

File details

Details for the file kailash-0.1.3.tar.gz.

File metadata

Download URL: kailash-0.1.3.tar.gz
Upload date: Jun 3, 2025
Size: 268.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for kailash-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`320054ee5545d84d566c90a88d08655c891c537cbe4870601da6289bf26717b4`
MD5	`2e809af6beb2a4c7d365a158242503de`
BLAKE2b-256	`23d5d53dbcfb9620aaeb5b65401f8781c650287d401bd9a9b45a7886f9e4e92e`

See more details on using hashes here.

File details

Details for the file kailash-0.1.3-py3-none-any.whl.

File metadata

Download URL: kailash-0.1.3-py3-none-any.whl
Upload date: Jun 3, 2025
Size: 290.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for kailash-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ac2d3ca751f2fb025b08150a012d65b895c50e8f657dfb3bcfda99be48853258`
MD5	`7cc0d2e03602406dea8e538845c483ae`
BLAKE2b-256	`28f698f84703e3df0ca04fa70b63d4288c08bef9838e21ed3acd5aa3b9d58fd0`

See more details on using hashes here.

kailash 0.1.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Kailash Python SDK

✨ Highlights

🎯 Who Is This For?

🚀 Quick Start

Installation

Your First Workflow

SharePoint Integration Example

Hierarchical RAG Example

Workflow API Wrapper - Expose Workflows as REST APIs

Features

📚 Documentation

🛠️ Features

📦 Pre-built Nodes

🔧 Core Capabilities

Workflow Management

Hierarchical Workflow Composition

Immutable State Management

Task Tracking

Local Testing

Performance Monitoring & Real-time Dashboards

API Integration

Export Formats

🎨 Visualization

Hierarchical RAG (Retrieval-Augmented Generation)

💻 CLI Commands

🏗️ Architecture

🤖 Unified AI Provider Architecture

🧪 Testing

🤝 Contributing

Development Setup

Code Quality & Pre-commit Hooks

📈 Project Status

✅ Completed

🚧 In Progress

📋 Planned

🎯 Test Suite Status

⚠️ Known Issues

📄 License

🙏 Acknowledgments

📞 Support

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes