Skip to main content

Drop-in Tessera integration for CrewAI. One line of config routes your CrewAI agents' LLM calls through Tessera's auto-route + auto-cache + auto-compress + auto-batch proxy. Free 60M tokens/mo. Production: 20% of measured savings.

Project description

tessera-crewai

PyPI Python License

Drop-in Tessera integration for CrewAI. One line of config routes every LLM call your crew makes through Tessera's auto-route + auto-cache + auto-compress + auto-batch proxy.

Free 60M tokens/month. Production: 20% of measured savings. No card up front.

Install

pip install tessera-crewai

Requires Python 3.10+. CrewAI is a peer dependency. Install it in your environment alongside this package.

Usage

The most common pattern uses one of the bundled factory functions to construct a pre-wired CrewAI LLM instance:

from crewai import Agent, Crew, Task
from tessera_crewai import tessera_openai_llm

llm = tessera_openai_llm(
    model="gpt-4o",
    openai_api_key="sk-...",   # your OpenAI key
    tessera_api_key="tk_...",  # get a free one at tesseraai.io/dev
)

researcher = Agent(
    role="Senior Researcher",
    goal="Uncover cutting-edge developments in AI",
    backstory="You are a seasoned researcher...",
    llm=llm,
)

# Rest of your CrewAI code runs unchanged. Crew, Task, kickoff()
# all route through Tessera and benefit from auto-optimization.

For Anthropic models:

from tessera_crewai import tessera_anthropic_llm

llm = tessera_anthropic_llm(
    model="claude-sonnet-4-6",
    anthropic_api_key="sk-ant-...",
    tessera_api_key="tk_...",
)

For explicit LLM construction (rare; useful when you need fine-grained LLM kwargs):

from crewai import LLM
from tessera_crewai import tessera_openai_config

llm = LLM(
    model="openai/gpt-4o",
    api_key="sk-...",
    **tessera_openai_config(api_key="tk_..."),
)

What Tessera does for your CrewAI workloads

  • Auto-route: calls to expensive models are evaluated for a cheaper alternative that preserves quality on canary samples.
  • Auto-cache: exact-match + semantic cache for repeat queries. CrewAI's tool-use loops often hit identical sub-prompts; cache returns are free.
  • Auto-compress: per-role heuristic compression on system prompts and verbose tool descriptions (system + user toggles independent). Preserves code fences and JSON shapes. 5–15% on prompt tokens.
  • Auto-batch: async crews with batch-tolerant SLAs get arbitraged onto provider batch APIs for ~50% cost reduction.

All gated by per-workload quality canaries; toggle any mechanic on/off from the Tessera dashboard. Free Sandbox tier gives you observe-only mechanics; Production tier unlocks the full stack.

Supported providers (v0.1)

Provider Status Config function
OpenAI ✅ verified tessera_openai_config, tessera_openai_llm
Anthropic ✅ verified tessera_anthropic_config, tessera_anthropic_llm
Mistral / Groq / Cohere 🚧 queued for v0.2 n/a

v0.1 covers ~85% of customer traffic per our outreach research. Open an issue if you need a provider on the queue surfaced sooner.

Companion packages

Companion to tessera-sdk (vanilla provider SDKs), tessera-langchain (LangChain integration), tessera-vercel-ai (Vercel AI SDK integration), tessera-llamaindex (LlamaIndex integration), tessera-mastra (Mastra Agent framework integration), tessera-pydantic-ai (Pydantic AI integration), and tessera-autogen (AutoGen 0.4+ multi-agent integration). Same proxy, same mechanic stack, CrewAI-shaped API.

License

Apache 2.0. See LICENSE.


About Tessera

Tessera is the substrate layer for LLM cost optimization, also called the Optimize Layer in our product surface. A thin proxy that sits in your application's request-path, applies a conservative cascade of optimization mechanics, and measures every saved dollar against an audit-immutable baseline. We bill 20% of verified savings, prepaid. Zero savings = zero fee. No per-token gateway fee, no subscription, no minimum monthly commitment; the category we operate in is "success-fee LLM optimizer," distinct from per-token AI gateways and observability dashboards.

Where observability tools tell you what you spent and AI gateways re-shape the request without measuring the cost delta, Tessera is the layer that does both, and only takes a cut when the measured savings are positive. The verified-savings ledger at ledger.tesseraai.io shows every original-vs-actual cost pair, snapshot-pinned to a pricing_catalog version captured at request time. Mid-contract price changes don't retroactively alter past savings. This is the FinOps-friendly model for AI inference: every line of the bill traces to a code-enforced rule.

Apache-2.0. Operated by Fintechagency OÜ (Tallinn, Estonia, registry code 16638667). Issues: github.com/tessera-llm/tessera-crewai/issues.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tessera_crewai-0.1.1.tar.gz (11.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tessera_crewai-0.1.1-py3-none-any.whl (11.0 kB view details)

Uploaded Python 3

File details

Details for the file tessera_crewai-0.1.1.tar.gz.

File metadata

  • Download URL: tessera_crewai-0.1.1.tar.gz
  • Upload date:
  • Size: 11.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for tessera_crewai-0.1.1.tar.gz
Algorithm Hash digest
SHA256 f81b7c373387710c02dce86a6812c8f3563fd0fd0bafd40f757c3e7eb4fbeb97
MD5 3f79c652e459b4f6a9d9c49ec83a1746
BLAKE2b-256 a7225ce2d1b0b61daa37fd7dc449cd690213eedb7e357543f4fd9d6249d435a1

See more details on using hashes here.

File details

Details for the file tessera_crewai-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: tessera_crewai-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 11.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for tessera_crewai-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 fe4490cfab9e8f37bf5647295f0a1b9abd0ee315fcccb83fa8af840889d41bce
MD5 2cb06fe376f6cdad56d96de12d29a43f
BLAKE2b-256 1aafa6b8991e799612272ca2dbbdbc2a1c3f3e2bbafed37347803ca42bc52421

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page