Reduce LLM costs by 90% - AI recommendations with NO API keys needed!

These details have not been verified by PyPI

Project links

Project description

🚀 LLMOptimize - AI Cost Optimization Made Simple

Reduce your LLM API costs by 90% with zero configuration. No API keys required for reports!

✨ What Makes LLMOptimize Different?

💎 No API Keys Needed!

Unlike other tools, you don't need OpenAI, Anthropic, or any API keys to get cost reports and smart recommendations. Our server does the AI analysis using our keys!

🎯 Smart, Context-Aware Recommendations

Track embeddings? Get embedding alternatives (not chat models!)
Track chat? Get chat alternatives
Track reasoning models? Get reasoning alternatives
ML-powered suggestions that learn from your patterns

📊 Beautiful Interactive Reports

Gorgeous terminal UI with animations, color-coded insights, and actionable recommendations.

🎬 Quick Demo

import llmoptimize
import openai

# Use OpenAI normally - LLMOptimize tracks automatically!
client = openai.OpenAI()
response = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "Hello!"}]
)

# See your beautiful cost report (NO API KEY NEEDED!)
llmoptimize.report()

Output:

╔══════════════════════════════════════════════════════════════╗
║                                                              ║
║     🚀  L L M O P T I M I Z E   R E P O R T  🚀            ║
║                                                              ║
║         Recommendations from OUR AI (No Key Needed!)         ║
║                                                              ║
╚══════════════════════════════════════════════════════════════╝

📊 YOUR USAGE SUMMARY

🚀  Total API Calls Tracked
   15

💰  Total Cost
   $0.1047

💎  Potential Savings
   $0.0938
   That's 90% you could save!

📈 MODEL USAGE
   gpt-4: 10 calls
   text-embedding-3-large: 5 calls

💡 SMART RECOMMENDATIONS
   (From OUR AI - no API key needed!)

╭──────────────────────────────────────────────────────╮
│ #1 Recommendation                                    │
├──────────────────────────────────────────────────────┤
│ 🎯 gpt-4o-mini                                       │
│ 💰 Save 99%                                          │
│                                                      │
│ 💡 Used gpt-4 10x - gpt-4o-mini is 99% cheaper      │
╰──────────────────────────────────────────────────────╯

╭──────────────────────────────────────────────────────╮
│ #2 Recommendation                                    │
├──────────────────────────────────────────────────────┤
│ 🎯 text-embedding-3-small                           │
│ 💰 Save 85%                                          │
│                                                      │
│ 💡 Used text-embedding-3-large 5x - smaller model   │
│    is 85% cheaper with 95%+ quality                  │
╰──────────────────────────────────────────────────────╯

✨ All powered by OUR server intelligence!

🚀 Installation

pip install llmoptimize

That's it! No configuration needed. Just import and use.

📚 Features

🎯 Auto-Tracking

Automatically tracks OpenAI, Anthropic, Groq, and LangChain
Zero configuration required
Works with existing code - no changes needed
Supports: Chat, Embeddings, Images, Audio

🤖 Smart Recommendations

ML-powered model suggestions
Context-aware (embeddings → embeddings, chat → chat)
Learns from your usage patterns
Based on 50+ AI models across all providers

📊 Beautiful Dashboards

Interactive animated terminal UI
Real-time cost tracking
Usage analytics by model
Weekly usage timelines

🔒 Privacy-First Design

No prompts sent to our server
Only metadata tracked (model, tokens)
Anonymous session IDs
GDPR compliant

💡 No API Keys Required

Reports work without ANY API keys
AI recommendations from OUR server
You keep your keys, we provide the intelligence

🎯 Supported Providers & Models

OpenAI

✅ Chat: GPT-4, GPT-4o, GPT-4o-mini, GPT-3.5-turbo, o1, o1-mini, o3-mini
✅ Embeddings: text-embedding-3-large, text-embedding-3-small, ada-002
✅ Images: DALL-E 3, DALL-E 2
✅ Audio: Whisper

Anthropic

✅ Claude 3.5: Opus, Sonnet, Haiku
✅ Claude 3: Opus, Sonnet, Haiku

Google

✅ Gemini 2.0: Pro, Flash, Flash-Lite
✅ Gemini 1.5: Pro, Flash, Flash-8B

Groq

✅ Llama 3.3 70B, Llama 3.1 70B/8B
✅ Mixtral 8x7B
✅ Gemma 2 9B

Mistral

✅ Large, Medium, Small, Nemo, Codestral
✅ Open Mixtral, Open Mistral

Cohere

✅ Command R+, Command R, Command

Total: 50+ models tracked with accurate pricing!

📖 Usage Examples

Example 1: Chat Completion

import llmoptimize
import openai

client = openai.OpenAI()

# Make expensive call
response = client.chat.completions.create(
    model="gpt-4",  # $0.03 per 1K input tokens
    messages=[{"role": "user", "content": "Explain quantum computing"}],
    max_tokens=500
)

# See recommendations
llmoptimize.report()
# Suggests: gpt-4o-mini (99% cheaper!) or gpt-3.5-turbo (95% cheaper)

Example 2: Embeddings

import llmoptimize
import openai

client = openai.OpenAI()

# Expensive embeddings
embeddings = client.embeddings.create(
    model="text-embedding-3-large",  # $0.13 per 1M tokens
    input=["Document 1", "Document 2", "Document 3"]
)

llmoptimize.report()
# Suggests: text-embedding-3-small (85% cheaper with 95%+ quality)

Example 3: Anthropic Claude

import llmoptimize
import anthropic

client = anthropic.Anthropic()

# Expensive Claude Opus
response = client.messages.create(
    model="claude-3-opus-20240229",  # $15 per 1M input tokens
    max_tokens=1000,
    messages=[{"role": "user", "content": "Write a poem"}]
)

llmoptimize.report()
# Suggests: claude-3-5-haiku (95% cheaper) or claude-3-haiku (98% cheaper)

Example 4: Mixed Usage

import llmoptimize
import openai

client = openai.OpenAI()

# Multiple different calls
client.chat.completions.create(model="gpt-4", messages=[...])
client.chat.completions.create(model="gpt-3.5-turbo", messages=[...])
client.embeddings.create(model="text-embedding-3-large", input=[...])

llmoptimize.report()
# Gets CONTEXT-AWARE recommendations for each model type!

Example 5: Simple Text Report

import llmoptimize

# For scripts/logs - simple text output
llmoptimize.report(interactive=False)

Output:

📊 LLMOptimize Report
────────────────────────────────────────

Calls: 15
Cost: $0.1047
Savings: $0.0938 (90%)

Models:
  gpt-4: 10
  text-embedding-3-large: 5

Recommendations (from OUR AI):
  1. gpt-4o-mini - 99%
  2. text-embedding-3-small - 85%

No API key needed!

Example 6: Manual Tracking

import llmoptimize

# For custom APIs or non-supported providers
llmoptimize.track(
    model="custom-model-v1",
    prompt_tokens=1000,
    completion_tokens=500,
    provider="custom"
)

llmoptimize.report()

🎨 Advanced Features

1. LangChain Integration

import llmoptimize
from langchain_openai import ChatOpenAI

# LangChain auto-tracked!
llm = ChatOpenAI(model="gpt-4")
result = llm.invoke("Hello!")

llmoptimize.report()
# Automatically tracks LangChain calls

2. Agent Workflow Tracking

import llmoptimize

# Track multi-step agent workflows
# Loop detection, context window monitoring
# Step-by-step cost breakdown

llmoptimize.report()
# Shows per-step costs and optimization opportunities

3. Real-Time Monitoring

import llmoptimize

# In production scripts
while True:
    # Your AI calls...
    
    # Periodic cost check
    if call_count % 100 == 0:
        llmoptimize.report(interactive=False)

📊 Real Results

Case Study 1: SaaS Startup

"We reduced our OpenAI bill from $4,200/month to $380/month using LLMOptimize recommendations. The ROI was immediate."

— AI-powered SaaS, 50K users

Changes made:

GPT-4 → GPT-4o-mini for 80% of tasks: 99% savings
text-embedding-3-large → text-embedding-3-small: 85% savings
GPT-4 → Claude-3-Haiku for simple Q&A: 98% savings

Case Study 2: AI Research Team

"The interactive dashboard makes it so easy to spot optimization opportunities. Saved us 92% on GPT-4 costs without quality loss."

— University AI Lab

Changes made:

GPT-4 → GPT-3.5-turbo for data classification: 95% savings
Claude-3-Opus → Claude-3-Sonnet for analysis: 80% savings
Eliminated redundant embedding calls: 100% savings

Case Study 3: E-commerce Platform

"We were using GPT-4 for EVERYTHING. LLMOptimize showed us where GPT-3.5-turbo or even Groq's free tier would work. $2,800/month → $290/month."

— E-commerce AI chatbot, 100K conversations/month

🏗️ How It Works

┌─────────────────────────────────────────────────────┐
│ 1. YOU: Import llmoptimize                          │
│    → Auto-patches OpenAI/Anthropic/Groq SDKs       │
└─────────────────────────────────────────────────────┘
                       ↓
┌─────────────────────────────────────────────────────┐
│ 2. YOU: Make API calls normally                     │
│    → client.chat.completions.create(...)            │
│    → client.embeddings.create(...)                  │
└─────────────────────────────────────────────────────┘
                       ↓
┌─────────────────────────────────────────────────────┐
│ 3. LLMOptimize: Tracks metadata                     │
│    → Model name, token counts                       │
│    → Sends to OUR server (not your API!)            │
│    → NO prompts sent (privacy-first!)               │
└─────────────────────────────────────────────────────┘
                       ↓
┌─────────────────────────────────────────────────────┐
│ 4. OUR Server: Analyzes with OUR AI                 │
│    → Uses ML + 50+ model pricing data               │
│    → Context-aware recommendations                  │
│    → Learning from usage patterns                   │
└─────────────────────────────────────────────────────┘
                       ↓
┌─────────────────────────────────────────────────────┐
│ 5. YOU: See beautiful report                        │
│    → llmoptimize.report()                           │
│    → NO API KEY NEEDED!                             │
│    → Smart, actionable recommendations              │
└─────────────────────────────────────────────────────┘

🔐 Privacy & Security

What We Track:

✅ Model name (e.g., "gpt-4")
✅ Token counts (input/output)
✅ Anonymous session ID
✅ Provider (OpenAI/Anthropic/etc)

What We DON'T Track:

❌ Your prompts or AI responses
❌ Personal information
❌ API keys
❌ IP addresses
❌ User identifiers

Data Storage:

🔒 Anonymous sessions only
🔒 No PII collected
🔒 GDPR compliant
🔒 Can run self-hosted if needed

🎯 Why LLMOptimize?

Feature	LLMOptimize	Competitors
No API Keys for Reports	✅ Our AI, Our Keys	❌ Need your keys
Context-Aware Recommendations	✅ Embeddings → Embeddings	❌ Generic suggestions
Auto-Tracking	✅ Zero config	⚠️ Manual setup
Beautiful UI	✅ Animated terminal	❌ Plain text
50+ Models	✅ All providers	⚠️ OpenAI only
Privacy-First	✅ No prompts sent	⚠️ Varies
ML Learning	✅ Improves over time	❌ Static rules
Free Tier	✅ Unlimited tracking	⚠️ Limited

🛠️ Configuration

Environment Variables (Optional)

# Custom server URL (for self-hosted)
export LLMOPTIMIZE_SERVER_URL=https://your-server.com

# Session ID (auto-generated if not set)
export LLMOPTIMIZE_SESSION_ID=your-session-id

Programmatic Config

import llmoptimize

# Use custom server
llmoptimize.SERVER_URL = "https://your-server.com"

# Custom session
llmoptimize.SESSION_ID = "my-session"

📦 Installation Options

Basic (Default)

pip install llmoptimize

With All Provider SDKs

pip install llmoptimize[full]
# Includes: anthropic, groq, langchain

Development Tools

pip install llmoptimize[dev]
# Includes: pytest, black, mypy

🤝 Contributing

This is a proprietary project. For bugs and feature requests, please open an issue on GitHub.

📄 License

🔗 Links

Homepage: https://aioptimize.up.railway.app
GitHub: https://github.com/hackrudra1234/llmoptimize
PyPI: https://pypi.org/project/llmoptimize
Documentation: Coming soon!

💬 Support

Issues: GitHub Issues
Email: hackrudra@gmail.com
Twitter: Coming soon!

🎉 Get Started Now!

# Install
pip install llmoptimize

# Use
import llmoptimize
# ... your AI code ...
llmoptimize.report()

Save 90% on AI costs. Beautiful reports. Zero configuration. No API keys needed! 🚀

Made with ❤️ by the LLMOptimize Team

Star us on GitHub if this saves you money! ⭐

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

3.3.0

Mar 4, 2026

3.2.8

Mar 4, 2026

3.2.7

Mar 4, 2026

3.2.6

Mar 4, 2026

3.2.5

Mar 4, 2026

3.2.4

Mar 4, 2026

3.2.3

Mar 4, 2026

3.2.2

Mar 3, 2026

3.2.1

Mar 2, 2026

This version

3.2.0

Mar 2, 2026

3.1.0

Mar 2, 2026

2.1.1

Feb 28, 2026

2.1.0

Feb 28, 2026

1.0.6

Feb 27, 2026

1.0.5

Feb 24, 2026

1.0.4

Feb 24, 2026

1.0.3

Feb 24, 2026

1.0.2

Feb 24, 2026

1.0.1

Feb 24, 2026

1.0.0

Feb 24, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llmoptimize-3.2.0.tar.gz (11.9 kB view details)

Uploaded Mar 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llmoptimize-3.2.0-py3-none-any.whl (10.0 kB view details)

Uploaded Mar 2, 2026 Python 3

File details

Details for the file llmoptimize-3.2.0.tar.gz.

File metadata

Download URL: llmoptimize-3.2.0.tar.gz
Upload date: Mar 2, 2026
Size: 11.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for llmoptimize-3.2.0.tar.gz
Algorithm	Hash digest
SHA256	`558ab4c7548deb7b71e69c64691282993aaf3e16fc0c20ebeebc0cb52389a06a`
MD5	`66a9bd05397658351ce6333d74968482`
BLAKE2b-256	`422e92ab1ce7231422ac8c16f2754275e30dcab9faf9f751a016717821f18a45`

See more details on using hashes here.

File details

Details for the file llmoptimize-3.2.0-py3-none-any.whl.

File metadata

Download URL: llmoptimize-3.2.0-py3-none-any.whl
Upload date: Mar 2, 2026
Size: 10.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for llmoptimize-3.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`418db9b20c1f9fd4021c4abaad3b678d97014b9830dcfa2bebd8b5a05258ca3e`
MD5	`8296e6a04766a418a296995bb733978c`
BLAKE2b-256	`489a35b60b97af21fdbb498a3335de4cf07dc8e11a694d28c41171801eb5367c`

See more details on using hashes here.

llmoptimize 3.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

🚀 LLMOptimize - AI Cost Optimization Made Simple

✨ What Makes LLMOptimize Different?

💎 No API Keys Needed!

🎯 Smart, Context-Aware Recommendations

📊 Beautiful Interactive Reports

🎬 Quick Demo

🚀 Installation

📚 Features

🎯 Auto-Tracking

🤖 Smart Recommendations

📊 Beautiful Dashboards

🔒 Privacy-First Design

💡 No API Keys Required

🎯 Supported Providers & Models

OpenAI

Anthropic

Google

Groq

Mistral

Cohere

📖 Usage Examples

Example 1: Chat Completion

Example 2: Embeddings

Example 3: Anthropic Claude

Example 4: Mixed Usage

Example 5: Simple Text Report

Example 6: Manual Tracking

🎨 Advanced Features

1. LangChain Integration

2. Agent Workflow Tracking

3. Real-Time Monitoring

📊 Real Results

Case Study 1: SaaS Startup

Case Study 2: AI Research Team

Case Study 3: E-commerce Platform

🏗️ How It Works

🔐 Privacy & Security

What We Track:

What We DON'T Track:

Data Storage:

🎯 Why LLMOptimize?

🛠️ Configuration

Environment Variables (Optional)

Programmatic Config

📦 Installation Options

Basic (Default)

With All Provider SDKs

Development Tools

🤝 Contributing

📄 License

🔗 Links

💬 Support

🎉 Get Started Now!

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes