Skip to main content

Python SDK for AI Observability - Monitor LLM usage across 21+ models with smart routing, cost tracking, and budget management

Project description

AI Observability Python SDK

PyPI version Python 3.7+ License: MIT

Monitor, track, and optimize your LLM usage across 21+ AI models with one SDK.

✨ Features

  • 🚀 Smart Routing - Route prompts by cost, speed, or quality
  • 📊 Cost Tracking - Track usage and costs across all models
  • 💰 Budget Management - Set budgets and get alerts
  • 🔍 Model Comparison - Compare 21+ models side-by-side
  • 📚 RAG Knowledge Base - Search your documents
  • 🖼️ Image Generation - Generate images from text
  • Async Support - High-throughput async operations

✅ Confirmed Working Models

Provider Model Latency Cost
Groq llama-3.1-8b-instant 58ms ~$0.00003
Groq llama-3.3-70b-versatile 386ms ~$0.00003
Groq gpt-oss-120b 308ms ~$0.00014
Google gemma-3-27b-it 773ms FREE
Google gemma-3-1b-it 694ms FREE
Google gemini-2.5-flash 1640ms ~$0.000002

📦 Installation

pip install freelanceflow
🚀 Quick Start
python
from aiobservability import AIObservability

# Initialize client
client = AIObservability(api_key="your_api_key")

# Route a prompt (automatically picks best model)
response = client.route("What is machine learning?", preference="speed")
print(f"Response: {response['response']}")
print(f"Cost: ${response['cost']}")
print(f"Latency: {response['latencyMs']}ms")

📊 Usage Examples

Track LLM Usage

python
from aiobservability import LLMUsage

usage = LLMUsage(
    tenant_id="my-company",
    user_id="user-123",
    provider="groq",
    model="llama-3.1-8b-instant",
    prompt="What is AI?",
    completion="AI is artificial intelligence...",
    prompt_tokens=10,
    completion_tokens=50,
    duration_ms=150
)
client.track(usage)
Get Usage History
python

# Get history (uses default tenant)
history = client.get_usage_history(limit=10)
print(f"Found {len(history)} records")

# Or specify a different tenant
history = client.get_usage_history(tenant_id="other-tenant", limit=10)

# Compare Models

python
# Compare models side-by-side
comparison = client.compare_models("What is Python?")
for model in comparison['results']:
    print(f"{model['model']}: ${model['cost']} - {model['latencyMs']}ms")
Clean Up
python

# Close the client connection
client.close()

📖 API Reference
Full documentation: https://ai-api.usefreelanceflow.com/docs

🔑 Authentication
Get your API key from the AI Observability Dashboard.

Demo key for testing: ai_demo_key_12345

🧪 Testing Your Installation
After installing, verify it works:

bash
python -c "from aiobservability import AIObservability; print('SDK works!')"
🤝 Contributing
Contributions welcome! Please see CONTRIBUTING.md.

📄 License
MIT License - see LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

freelanceflow-1.0.1.tar.gz (10.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

freelanceflow-1.0.1-py3-none-any.whl (8.9 kB view details)

Uploaded Python 3

File details

Details for the file freelanceflow-1.0.1.tar.gz.

File metadata

  • Download URL: freelanceflow-1.0.1.tar.gz
  • Upload date:
  • Size: 10.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for freelanceflow-1.0.1.tar.gz
Algorithm Hash digest
SHA256 890dfc2c45157bfe71848014e8fd24fc97d64b3be35daca364af4a37d91a0e73
MD5 7d10184172c656dac2427ca5c611a426
BLAKE2b-256 968fc30c986d6351bf61342ecec4316bd8297973c4169cc637fe8a09dd8e1ae6

See more details on using hashes here.

File details

Details for the file freelanceflow-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: freelanceflow-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 8.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for freelanceflow-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 ee488832bab21f5f6be419d7366de9d497cc341dbbf30b4e73373dc2794d7c6a
MD5 a5f2a55c8c6b47a08320c74330c1b0df
BLAKE2b-256 be46ad364c1ab006ee171a92c87bf79d51c4d7ec551910ca803cb25e3ad78462

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page