The simplest way to use AI in Python with automatic cost tracking and optimization

These details have not been verified by PyPI

Project links

Project description

Cost Katana Python 🥷

AI that just works. With automatic cost tracking.

import cost_katana as ck

response = ck.ai('gpt-4', 'Hello, world!')
print(response.text)        # "Hello! How can I help you today?"
print(f"Cost: ${response.cost}")  # "Cost: $0.0012"

That's it. No setup. No configuration. No complexity.

Installation

pip install costkatana

Package Names:

Python: costkatana (PyPI)

JavaScript/Node: cost-katana (NPM)

CLI: cost-katana-cli (NPM) or included with Python package

Quick Start

Zero Configuration

import cost_katana as ck

# Just works with any AI model
ck.ai('gpt-4', 'Explain quantum computing')
ck.ai('claude-3-sonnet', 'Write a haiku')
ck.ai('gemini-pro', 'Solve this: 2x + 5 = 13')

Chat Conversations

import cost_katana as ck

chat = ck.chat('gpt-4')
chat.send('Hello!')
chat.send('What can you help me with?')
chat.send('Tell me a joke')

print(f"Total cost: ${chat.total_cost}")

Type-Safe Model Selection (Recommended) ✨

Use model constants instead of strings to prevent typos and get autocomplete:

import cost_katana as ck
from cost_katana import openai, anthropic, google

# Type-safe model selection (recommended)
response = ck.ai(openai.gpt_4, 'Hello, world!')
print(response.text)

# Compare models easily
models = [openai.gpt_4, anthropic.claude_3_5_sonnet_20241022, google.gemini_2_5_pro]
for model in models:
    response = ck.ai(model, 'Explain AI in one sentence')
    print(f"Cost: ${response.cost}")

Benefits:

✅ IDE autocomplete for all models
✅ No spelling mistakes
✅ Type checking in editors
✅ Self-documenting code

Available namespaces:

openai - GPT-4, GPT-3.5, DALL-E, Whisper, etc.
anthropic - Claude 3.5 Sonnet, Haiku, Opus, etc.
google - Gemini 2.5 Pro, Flash, etc.
aws_bedrock - Nova, Claude on Bedrock, etc.
xai - Grok models
deepseek - DeepSeek models
mistral - Mistral AI models
cohere - Command models
groq - Groq models
meta - Llama models

Migration from string names:

# Old way (still works with warning)
response = ck.ai('gpt-4', 'Hello')

# New way (recommended)
from cost_katana import openai
response = ck.ai(openai.gpt_4, 'Hello')

📚 More Examples

Looking for more comprehensive examples? Check out our complete examples repository

🔗 github.com/Hypothesize-Tech/costkatana-examples

What's included:

✅ 44 feature sections covering every Cost Katana capability
✅ Python SDK examples in Section 8 and throughout
✅ HTTP REST API examples (.http files)
✅ TypeScript/Node.js examples
✅ Framework integrations (Express, Next.js, Fastify, NestJS, FastAPI)
✅ Real-world use cases with best practices
✅ Production-ready code with full error handling

Popular examples:

Python SDK Examples - Complete Python guides
Cost Tracking - Track costs across all providers
Webhooks - Real-time notifications
Workflows - Multi-step AI orchestration
Semantic Caching - 30-40% cost reduction
FastAPI Integration - Framework examples

Compare Models

import cost_katana as ck

models = ['gpt-4', 'claude-3-sonnet', 'gemini-pro']
prompt = 'Explain relativity in one sentence'

for model in models:
    response = ck.ai(model, prompt)
    print(f"{model}: ${response.cost:.4f}")

Features

💰 Cost Tracking

Every response includes cost information:

response = ck.ai('gpt-4', 'Write a story')
print(f"Cost: ${response.cost}")
print(f"Tokens: {response.tokens}")
print(f"Model: {response.model}")
print(f"Provider: {response.provider}")

💾 Smart Caching

Save money by caching repeated requests:

# First call - costs money
r1 = ck.ai('gpt-4', 'What is 2+2?', cache=True)
print(r1.cached)  # False

# Second call - free from cache
r2 = ck.ai('gpt-4', 'What is 2+2?', cache=True)
print(r2.cached)  # True - saved money!

⚡ Cortex Optimization

Reduce costs by 70-95%:

response = ck.ai('gpt-4', 'Write a comprehensive guide to Python', 
                 cortex=True)

print(response.optimized)  # True
print(f"Saved: ${response.saved_amount}")

🔄 Auto-Failover

Never fail - automatically switch providers:

# If OpenAI is down, automatically uses Claude or Gemini
response = ck.ai('gpt-4', 'Hello')
print(response.provider)  # Might be 'anthropic' if OpenAI failed

📊 Analytics Dashboard

All usage syncs to your dashboard at costkatana.com:

response = ck.ai('gpt-4', 'Hello')
# Automatically tracked in your dashboard
# View at: https://costkatana.com/dashboard

Configuration

Environment Variables

# Option 1: Cost Katana (Recommended - all features)
export COST_KATANA_API_KEY="dak_your_key_here"

# Option 2: Direct provider keys (limited features)
export OPENAI_API_KEY="sk-..."
export ANTHROPIC_API_KEY="sk-ant-..."

Manual Configuration

import cost_katana as ck

ck.configure(
    api_key='dak_your_key',
    cortex=True,     # 70-95% cost savings
    cache=True,      # Smart caching
    firewall=True    # Security
)

Advanced Options

response = ck.ai('gpt-4', 'Your prompt',
    temperature=0.7,        # Creativity level (0-2)
    max_tokens=500,         # Response length limit
    system_message='You are helpful',  # System prompt
    cache=True,             # Enable caching
    cortex=True            # Enable optimization
)

Multi-Provider Support

Works with all major AI providers:

# OpenAI
ck.ai('gpt-4', 'Hello')
ck.ai('gpt-3.5-turbo', 'Hello')

# Anthropic
ck.ai('claude-3-sonnet', 'Hello')
ck.ai('claude-3-haiku', 'Hello')

# Google
ck.ai('gemini-pro', 'Hello')
ck.ai('gemini-flash', 'Hello')

# AWS Bedrock
ck.ai('nova-pro', 'Hello')
ck.ai('nova-lite', 'Hello')

# And many more...

Real-World Examples

Customer Support Bot

import cost_katana as ck

support = ck.chat('gpt-3.5-turbo',
    system_message='You are a helpful customer support agent.')

def handle_customer_query(query: str):
    response = support.send(query)
    print(f"Cost so far: ${support.total_cost}")
    return response

Content Generation

import cost_katana as ck

def generate_blog_post(topic: str):
    # Use Cortex for long-form content (40-75% savings)
    post = ck.ai('gpt-4', f'Write a blog post about {topic}',
                 cortex=True, max_tokens=2000)
    
    return {
        'content': post.text,
        'cost': post.cost,
        'word_count': len(post.text.split())
    }

Code Assistant

import cost_katana as ck

def review_code(code: str):
    review = ck.ai('claude-3-sonnet',
        f'Review this code and suggest improvements:\n\n{code}',
        cache=True)  # Cache for repeated reviews
    
    return review.text

Translation Service

import cost_katana as ck

def translate(text: str, target_language: str):
    # Use cheaper model for translations
    translated = ck.ai('gpt-3.5-turbo',
        f'Translate to {target_language}: {text}',
        cache=True)
    
    return translated.text

Framework Integration

FastAPI

from fastapi import FastAPI
import cost_katana as ck

app = FastAPI()

@app.post('/api/chat')
async def chat(request: dict):
    response = ck.ai('gpt-4', request['prompt'])
    return {
        'text': response.text,
        'cost': response.cost
    }

Flask

from flask import Flask, request, jsonify
import cost_katana as ck

app = Flask(__name__)

@app.route('/api/chat', methods=['POST'])
def chat():
    prompt = request.json['prompt']
    response = ck.ai('gpt-4', prompt)
    return jsonify({
        'text': response.text,
        'cost': response.cost
    })

Django

from django.http import JsonResponse
import cost_katana as ck

def chat_view(request):
    prompt = request.POST.get('prompt')
    response = ck.ai('gpt-4', prompt)
    return JsonResponse({
        'text': response.text,
        'cost': response.cost
    })

Command Line Interface

The Python package includes a CLI:

# After installing the package
pip install costkatana

# Use the CLI
costkatana chat
costkatana ask "What is Python?"

Or install the dedicated CLI:

npm install -g cost-katana-cli
cost-katana chat

Error Handling

import cost_katana as ck
from cost_katana.exceptions import CostKatanaError

try:
    response = ck.ai('gpt-4', 'Hello')
    print(response.text)
except CostKatanaError as e:
    if 'API key' in str(e):
        print('Please set your API key')
    elif 'rate limit' in str(e):
        print('Rate limit exceeded')
    elif 'model' in str(e):
        print('Model not found')
    else:
        print(f'Error: {e}')

Cost Optimization Tips

1. Use Appropriate Models

# For simple tasks, use cheaper models
ck.ai('gpt-3.5-turbo', 'Simple question')  # $0.0001

# For complex tasks, use powerful models
ck.ai('gpt-4', 'Complex analysis')  # $0.01

2. Enable Caching

# Cache repeated queries
ck.ai('gpt-4', 'Common question', cache=True)

3. Use Cortex for Long Content

# 70-95% savings on long-form content
ck.ai('gpt-4', 'Write a book chapter', cortex=True)

4. Batch Similar Requests

session = ck.chat('gpt-3.5-turbo')
# Reuse session for related queries
session.send('Query 1')
session.send('Query 2')

Monitoring & Analytics

Track Usage

import cost_katana as ck

chat = ck.chat('gpt-4')
chat.send('Hello')
chat.send('How are you?')

print(f'Messages: {len(chat.history)}')
print(f'Total cost: ${chat.total_cost}')
print(f'Total tokens: {chat.total_tokens}')

Dashboard Features

Visit costkatana.com/dashboard to see:

Real-time cost tracking
Usage by model and provider
Daily/weekly/monthly spending
Token usage analytics
Optimization recommendations
Team usage breakdown
Budget alerts

Migration Guide

From OpenAI SDK

# Before (OpenAI SDK)
from openai import OpenAI
client = OpenAI(api_key='sk-...')
completion = client.chat.completions.create(
    model='gpt-4',
    messages=[{'role': 'user', 'content': 'Hello'}]
)
print(completion.choices[0].message.content)

# After (Cost Katana)
import cost_katana as ck
response = ck.ai('gpt-4', 'Hello')
print(response.text)
print(f"Cost: ${response.cost}")  # Bonus: cost tracking!

From Anthropic SDK

# Before (Anthropic SDK)
import anthropic
client = anthropic.Anthropic(api_key='sk-ant-...')
message = client.messages.create(
    model='claude-3-sonnet-20241022',
    messages=[{'role': 'user', 'content': 'Hello'}]
)

# After (Cost Katana)
import cost_katana as ck
response = ck.ai('claude-3-sonnet', 'Hello')

From Google AI SDK

# Before (Google AI SDK)
import google.generativeai as genai
genai.configure(api_key='...')
model = genai.GenerativeModel('gemini-pro')
response = model.generate_content('Hello')

# After (Cost Katana)
import cost_katana as ck
response = ck.ai('gemini-pro', 'Hello')

Package Naming

Important: Different package names for different languages to avoid conflicts:

Language	Package Manager	Install Command	Import
Python	PyPI	`pip install costkatana`	`import cost_katana`
JavaScript/Node	NPM	`npm install cost-katana`	`import { ai } from 'cost-katana'`
CLI (NPM)	NPM	`npm install -g cost-katana-cli`	`cost-katana chat`
CLI (Python)	PyPI	`pip install costkatana`	`costkatana chat`

Troubleshooting

No API Keys Found

# Set Cost Katana key (recommended)
export COST_KATANA_API_KEY="dak_your_key"

# Or set provider keys directly
export OPENAI_API_KEY="sk-..."

Model Not Available

# Check available models
try:
    response = ck.ai('model-name', 'test')
except Exception as e:
    print(f'Error: {e}')
    # Error message includes available models

Rate Limits

# Automatic retry with backoff
response = ck.ai('gpt-4', 'Hello', retry=True)

Support

Dashboard: costkatana.com
Documentation: docs.costkatana.com
GitHub: github.com/Hypothesize-Tech/cost-katana-python
Email: support@costkatana.com
Discord: discord.gg/Wcwzw8wM

License

MIT © Cost Katana

Start saving on AI costs today!

pip install costkatana

import cost_katana as ck
response = ck.ai('gpt-4', 'Hello, world!')

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

2.5.7

Apr 29, 2026

2.5.5

Apr 29, 2026

2.5.4

Apr 21, 2026

2.5.3

Mar 29, 2026

2.5.2

Mar 27, 2026

2.5.1

Mar 25, 2026

2.5.0

Mar 25, 2026

2.4.0

Feb 21, 2026

2.2.6

Jan 31, 2026

2.2.5

Jan 17, 2026

2.2.4

Dec 2, 2025

2.2.3

Nov 26, 2025

2.2.2

Nov 24, 2025

2.2.1

Nov 19, 2025

This version

2.2.0

Nov 19, 2025

2.1.0

Nov 17, 2025

2.0.8

Nov 15, 2025

2.0.7

Oct 23, 2025

2.0.6

Oct 23, 2025

2.0.5

Oct 23, 2025

2.0.4

Oct 12, 2025

2.0.2

Oct 9, 2025

2.0.1

Oct 2, 2025

2.0.0

Sep 11, 2025

1.0.3

Sep 8, 2025

1.0.2

Sep 8, 2025

1.0.1

Aug 4, 2025

1.0.0

Aug 4, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cost_katana-2.2.0.tar.gz (39.3 kB view details)

Uploaded Nov 19, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

cost_katana-2.2.0-py3-none-any.whl (33.1 kB view details)

Uploaded Nov 19, 2025 Python 3

File details

Details for the file cost_katana-2.2.0.tar.gz.

File metadata

Download URL: cost_katana-2.2.0.tar.gz
Upload date: Nov 19, 2025
Size: 39.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for cost_katana-2.2.0.tar.gz
Algorithm	Hash digest
SHA256	`ce1d08c827c9c0f272a7df150ef99ea4b4e1be1705ffb8eb0019acd16e121c50`
MD5	`2d143a285bdac19bbed84b65364227b5`
BLAKE2b-256	`4b3df75ef05334ab9cdf73c1d12cf3402263eafcd13bcac58fd0c9db4ad4f58a`

See more details on using hashes here.

File details

Details for the file cost_katana-2.2.0-py3-none-any.whl.

File metadata

Download URL: cost_katana-2.2.0-py3-none-any.whl
Upload date: Nov 19, 2025
Size: 33.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for cost_katana-2.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ee65bbb7cf386222ca4a29202ee65ec48afd1b8c6a3ad4a79cb0e97c25d69f94`
MD5	`308d6357d61a2a55e6d284e4a24ea4e7`
BLAKE2b-256	`789e6c300355db0b67c57a0e866bb406e9d448b9431b1fc5a98a6571a042dcaf`

See more details on using hashes here.

cost-katana 2.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Cost Katana Python 🥷

Installation

Quick Start

Zero Configuration

Chat Conversations

Type-Safe Model Selection (Recommended) ✨

📚 More Examples

Compare Models

Features

💰 Cost Tracking

💾 Smart Caching

⚡ Cortex Optimization

🔄 Auto-Failover

📊 Analytics Dashboard

Configuration

Environment Variables

Manual Configuration

Advanced Options

Multi-Provider Support

Real-World Examples

Customer Support Bot

Content Generation

Code Assistant

Translation Service

Framework Integration

FastAPI

Flask

Django

Command Line Interface

Error Handling

Cost Optimization Tips

1. Use Appropriate Models

2. Enable Caching

3. Use Cortex for Long Content

4. Batch Similar Requests

Monitoring & Analytics

Track Usage

Dashboard Features

Migration Guide

From OpenAI SDK

From Anthropic SDK

From Google AI SDK

Package Naming

Troubleshooting

No API Keys Found

Model Not Available

Rate Limits

Support

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes