Production-grade AI agent framework with RAG, memory, tools, and multi-model support

These details have not been verified by PyPI

Project description

Definable

Build LLM agents that work in production.

A Python framework for building agent applications with tools, RAG, persistent memory, guardrails, skills, file readers, messaging platform integrations, and the Model Context Protocol. Switch providers without rewriting agent code.

Install

pip install definable

Quick Start

from definable.agents import Agent
from definable.models.openai import OpenAIChat

agent = Agent(
    model=OpenAIChat(id="gpt-4o-mini"),
    instructions="You are a helpful assistant.",
)

output = agent.run("What is the capital of Japan?")
print(output.content)  # "The capital of Japan is Tokyo."

Add Tools

from definable.tools.decorator import tool

@tool
def get_weather(city: str) -> str:
    """Get current weather for a city."""
    return f"Sunny, 72°F in {city}"

agent = Agent(
    model=OpenAIChat(id="gpt-4o-mini"),
    tools=[get_weather],
    instructions="Help users check the weather.",
)

output = agent.run("What's the weather in Tokyo?")

The agent calls tools automatically. No manual function routing.

Structured Output

from pydantic import BaseModel

class WeatherReport(BaseModel):
    city: str
    temperature: float
    conditions: str

agent = Agent(
    model=OpenAIChat(id="gpt-4o-mini"),
    tools=[get_weather],
    instructions="Report weather data.",
)

output = agent.run("Weather in Tokyo?", output_schema=WeatherReport)
report = output.parsed  # WeatherReport(city="Tokyo", temperature=72.0, ...)

Pass any Pydantic model to output_schema and get validated, typed results back.

Streaming

agent = Agent(
    model=OpenAIChat(id="gpt-4o-mini"),
    instructions="You are a helpful assistant.",
)

for event in agent.run_stream("Write a haiku about Python."):
    if event.content:
        print(event.content, end="", flush=True)

run_stream() yields events as they arrive — content chunks, tool calls, and completion signals.

Multi-Turn Conversations

output1 = agent.run("My name is Alice.")
output2 = agent.run("What's my name?", messages=output1.messages)
print(output2.content)  # "Your name is Alice."

Pass messages from a previous run to continue the conversation.

Persistent Memory

from definable.memory import CognitiveMemory, SQLiteMemoryStore

memory = CognitiveMemory(store=SQLiteMemoryStore("memory.db"))

agent = Agent(
    model=OpenAIChat(id="gpt-4o-mini"),
    memory=memory,
    instructions="You are a personal assistant.",
)

agent.run("My name is Alice and I prefer dark mode.", user_id="alice")
# Later, even in a new session...
agent.run("What's my name?", user_id="alice")  # Recalls "Alice"

Memory is automatic: the agent stores interactions and recalls relevant context on each turn. Eight store backends available (SQLite, PostgreSQL, Redis, Qdrant, Chroma, Pinecone, MongoDB, in-memory).

Knowledge Base (RAG)

from definable.knowledge import Knowledge, InMemoryVectorDB, Document
from definable.knowledge.embedders.openai import OpenAIEmbedder
from definable.agents import AgentConfig, KnowledgeConfig

kb = Knowledge(
    vector_db=InMemoryVectorDB(dimensions=1536),
    embedder=OpenAIEmbedder(),
)
kb.add(Document(content="Company vacation policy: 20 days PTO per year."))

agent = Agent(
    model=OpenAIChat(id="gpt-4o-mini"),
    instructions="You are an HR assistant.",
    config=AgentConfig(knowledge=KnowledgeConfig(knowledge=kb, top_k=3)),
)

output = agent.run("How many vacation days do I get?")

The agent retrieves relevant documents before responding. Supports embedders (OpenAI, Voyage), vector DBs (in-memory, PostgreSQL), rerankers (Cohere), and chunkers.

Guardrails

from definable.guardrails import Guardrails, max_tokens, pii_filter, tool_blocklist

agent = Agent(
    model=OpenAIChat(id="gpt-4o-mini"),
    instructions="You are a support agent.",
    tools=[get_weather],
    guardrails=Guardrails(
        input=[max_tokens(500)],
        output=[pii_filter()],
        tool=[tool_blocklist(["dangerous_tool"])],
    ),
)

output = agent.run("What's the weather?")

Guardrails check, modify, or block content at input, output, and tool-call checkpoints. Built-ins include token limits, PII redaction, topic blocking, and regex filters. Compose rules with ALL, ANY, NOT, and when().

Skills

from definable.skills import Calculator, WebSearch, DateTime

agent = Agent(
    model=OpenAIChat(id="gpt-4o-mini"),
    skills=[Calculator(), WebSearch(), DateTime()],
    instructions="You are a helpful assistant.",
)

output = agent.run("What is 15% of 230?")

Skills bundle domain expertise (instructions) with tools. Built-in skills include Calculator, WebSearch, DateTime, HTTPRequests, JSONOperations, TextProcessing, Shell, and FileOperations. Create custom skills by subclassing Skill.

File Readers

from definable.media import File

agent = Agent(
    model=OpenAIChat(id="gpt-4o-mini"),
    readers=True,
    instructions="Summarize the uploaded document.",
)

output = agent.run("Summarize this.", files=[File(filepath="report.pdf")])

Pass readers=True to enable automatic parsing. Supports PDF, DOCX, PPTX, XLSX, ODS, RTF, HTML, images, and audio. AI-powered OCR available via Mistral, OpenAI, Anthropic, and Google providers.

Deploy It

from definable.triggers import Webhook, Cron
from definable.auth import APIKeyAuth

agent = Agent(
    model=OpenAIChat(id="gpt-4o-mini"),
    instructions="You are a support agent.",
)

agent.on(Webhook(path="/support", method="POST"))
agent.on(Cron(schedule="0 9 * * *", instruction="Send the daily summary."))
agent.auth = APIKeyAuth(keys=["sk-my-secret-key"])
agent.serve(host="0.0.0.0", port=8000, dev=True)

agent.serve() starts an HTTP server with registered webhooks, cron triggers, and interfaces in a single process. Add dev=True for hot-reload during development.

Connect to Platforms

from definable.interfaces.telegram import TelegramInterface, TelegramConfig

telegram = TelegramInterface(
    config=TelegramConfig(bot_token="BOT_TOKEN"),
)

agent = Agent(
    model=OpenAIChat(id="gpt-4o-mini"),
    instructions="You are a Telegram bot.",
)

agent.serve(telegram)

One agent, multiple platforms. Discord and Signal interfaces also available.

MCP

from definable.mcp import MCPConfig, MCPServerConfig, MCPToolkit

config = MCPConfig(
    servers=[
        MCPServerConfig(
            name="filesystem",
            command="npx",
            args=["-y", "@modelcontextprotocol/server-filesystem", "/tmp"],
        )
    ]
)

async with MCPToolkit(config) as toolkit:
    agent = Agent(model=OpenAIChat(id="gpt-4o-mini"), toolkits=[toolkit])
    await agent.arun("List files in /tmp")

Connect to any MCP server. Use the same tools as Claude Desktop.

Replay & Compare

from definable.agents import MockModel

# Inspect a past run
output = agent.run("Explain quantum computing.")
replay = agent.replay(run_output=output)
print(replay.steps)       # Each model call and tool invocation
print(replay.tokens)      # Token usage breakdown

# Re-run with a different model and compare
new_output = agent.replay(run_output=output, model=OpenAIChat(id="gpt-4o"))
comparison = agent.compare(output, new_output)
print(comparison.cost_diff)   # Cost difference between runs
print(comparison.token_diff)  # Token usage difference

Replay lets you inspect past runs, re-execute them with different models or instructions, and compare results side by side.

Testing

from definable.agents import Agent, MockModel

agent = Agent(
    model=MockModel(responses=["The capital of France is Paris."]),
    instructions="You are a geography expert.",
)

output = agent.run("What is the capital of France?")
assert "Paris" in output.content

MockModel returns canned responses — no API keys needed. Use it in unit tests to verify agent behavior deterministically.

Features

Category	Details
Models	OpenAI, DeepSeek, Moonshot, xAI, any OpenAI-compatible provider
Agents	Multi-turn conversations, structured output, configurable retries, max iterations
Tools	`@tool` decorator with automatic parameter extraction from type hints and docstrings
Toolkits	Composable tool groups, `KnowledgeToolkit` for explicit RAG search
Skills	Domain expertise + tools in one package; 8 built-in skills, custom `Skill` subclass
Knowledge / RAG	Embedders, vector DBs, rerankers (Cohere), chunkers, automatic retrieval
Memory	`CognitiveMemory` with multi-tier recall, distillation, topic prediction
Memory Stores	SQLite, PostgreSQL, Redis, Qdrant, Chroma, Pinecone, MongoDB, in-memory
Readers	PDF, DOCX, PPTX, XLSX, ODS, RTF, HTML, images, audio
Reader Providers	Mistral OCR, OpenAI, Anthropic, Google (AI-powered document parsing)
Guardrails	Input/output/tool checkpoints, PII redaction, token limits, topic blocking, regex filters
Guardrails Composition	`ALL`, `ANY`, `NOT`, `when()` combinators for complex policy rules
Interfaces	Telegram, Discord, Signal, session management, identity resolution
Runtime	`agent.serve()`, webhooks, cron triggers, event triggers, `dev=True` hot-reload
Auth	`APIKeyAuth`, `JWTAuth`, `AllowlistAuth`, `CompositeAuth`, pluggable `AuthProvider` protocol
Streaming	Real-time response and tool call streaming
Replay	Inspect past runs, re-execute with overrides, `agent.compare()` for side-by-side diffs
Middleware	Request/response transforms, logging, retry, metrics
Tracing	JSONL trace export for debugging and analysis
Compression	Automatic context window management for long conversations
Testing	`MockModel`, `AgentTestCase`, `create_test_agent` utilities
MCP	Model Context Protocol client for external tool servers
Types	Full Pydantic models, mypy verified

Supported Models

from definable.models.openai import OpenAIChat      # GPT-4o, GPT-4o-mini, o1, o3, ...
from definable.models.deepseek import DeepSeekChat   # deepseek-chat, deepseek-reasoner
from definable.models.moonshot import MoonshotChat   # moonshot-v1-8k, moonshot-v1-128k
from definable.models.xai import xAIChat             # grok-2-latest

Any OpenAI-compatible API works with OpenAIChat(base_url=..., api_key=...).

Optional Extras

Install only what you need:

pip install definable[readers]          # PDF, DOCX, PPTX, XLSX, ODS, RTF parsers
pip install definable[serve]            # FastAPI + Uvicorn for agent.serve()
pip install definable[cron]             # Cron trigger support
pip install definable[jwt]              # JWT authentication
pip install definable[runtime]          # serve + cron combined
pip install definable[discord]          # Discord interface
pip install definable[signal]           # Signal interface
pip install definable[interfaces]       # All interface dependencies
pip install definable[postgres-memory]  # PostgreSQL memory store
pip install definable[redis-memory]     # Redis memory store
pip install definable[qdrant-memory]    # Qdrant memory store
pip install definable[chroma-memory]    # Chroma memory store
pip install definable[mongodb-memory]   # MongoDB memory store
pip install definable[pinecone-memory]  # Pinecone memory store
pip install definable[mistral-ocr]      # Mistral AI document parsing
pip install definable[mistral-ocr-images]  # Mistral OCR with image support

Documentation

Full documentation: definable.ai/docs

Project Structure

definable/definable/
├── agents/        # Agent orchestration, config, middleware, tracing, testing
├── auth/          # APIKeyAuth, JWTAuth, AllowlistAuth, CompositeAuth
├── compression/   # Context window compression
├── guardrails/    # Input/output/tool policy, PII, token limits, composable rules
├── interfaces/    # Telegram, Discord, Signal integrations
├── knowledge/     # RAG: embedders, vector DBs, rerankers, chunkers
├── mcp/           # Model Context Protocol client
├── media.py       # Image, Audio, Video, File types
├── memory/        # CognitiveMemory + 8 store backends
├── models/        # OpenAI, DeepSeek, Moonshot, xAI providers
├── readers/       # File parsers + AI reader providers
├── reasoning/     # Reasoning capabilities
├── replay/        # Run inspection, re-execution, comparison
├── run/           # RunOutput, RunEvent types
├── runtime/       # AgentRuntime, AgentServer, dev mode
├── skills/        # Built-in + custom skills, skill registry
├── tokens.py      # Token counting utilities
├── tools/         # @tool decorator, tool wrappers
├── triggers/      # Webhook, Cron, EventTrigger
├── utils/         # Logging, supervisor, shared utilities
└── vectordbs/     # Vector database interfaces

Contributing

Contributions welcome! To get started:

Fork the repo and clone it locally
Install for development: pip install -e .
Make your changes — follow existing code patterns
Add tests in definable/tests_e2e/ for new features
Run ruff check and ruff format for linting
Run mypy for type checking
Open a pull request

See definable/examples/ for usage patterns.

License

MIT

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.8.0

Mar 24, 2026

0.7.0

Mar 9, 2026

0.6.0

Feb 27, 2026

0.5.0

Feb 26, 2026

0.4.0

Feb 25, 2026

0.3.2

Feb 25, 2026

0.3.1

Feb 23, 2026

0.3.0

Feb 20, 2026

This version

0.2.8

Feb 17, 2026

0.2.7

Feb 16, 2026

0.2.6

Feb 11, 2026

0.2.5

Aug 4, 2025

0.2.4

Aug 4, 2025

0.2.3

Aug 4, 2025

0.2.2

Aug 4, 2025

0.2.1

Aug 4, 2025

0.2.0

Jul 30, 2025

0.1.9

Jul 29, 2025

0.1.8

Jul 27, 2025

0.1.7

Jul 27, 2025

0.1.6

Jul 27, 2025

0.1.5

Jul 19, 2025

0.1.4

Jul 19, 2025

0.1.3

Jul 19, 2025

0.1.2

Jul 19, 2025

0.1.1

Jul 19, 2025

0.1.0

Jul 19, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

definable-0.2.8.tar.gz (493.2 kB view details)

Uploaded Feb 17, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

definable-0.2.8-py3-none-any.whl (671.2 kB view details)

Uploaded Feb 17, 2026 Python 3

File details

Details for the file definable-0.2.8.tar.gz.

File metadata

Download URL: definable-0.2.8.tar.gz
Upload date: Feb 17, 2026
Size: 493.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for definable-0.2.8.tar.gz
Algorithm	Hash digest
SHA256	`a1b16d8e43ff507afaa2a3f4facddc6b00c552d153046c7d5b9921864b73ce3d`
MD5	`08fffaefca8209a409232f71f7965010`
BLAKE2b-256	`9e7ca9ebeca1a4bc7fdb57129bb71e879dac2c9302b994152616e180c54a9f4f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for definable-0.2.8.tar.gz:

Publisher: publish.yml on definableai/definable.ai

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: definable-0.2.8.tar.gz
- Subject digest: a1b16d8e43ff507afaa2a3f4facddc6b00c552d153046c7d5b9921864b73ce3d
- Sigstore transparency entry: 956403876
- Sigstore integration time: Feb 17, 2026
Source repository:
- Permalink: definableai/definable.ai@e3ddaf6c61a436fc1e9a8627eb11f632aa0ff307
- Branch / Tag: refs/tags/v0.2.8
- Owner: https://github.com/definableai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@e3ddaf6c61a436fc1e9a8627eb11f632aa0ff307
- Trigger Event: release

File details

Details for the file definable-0.2.8-py3-none-any.whl.

File metadata

Download URL: definable-0.2.8-py3-none-any.whl
Upload date: Feb 17, 2026
Size: 671.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for definable-0.2.8-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8fe1ec5fc2d5384f64850158b90b6568a8fef7a1da0b7d834ea167840219c56d`
MD5	`eaa9b0b1e4d1aa3d10cd3f7c1b49faf3`
BLAKE2b-256	`b91e6768d1016cb2aaaec5a2855e59659eb473166b2ece60e523ef96034d1007`

See more details on using hashes here.

Provenance

The following attestation bundles were made for definable-0.2.8-py3-none-any.whl:

Publisher: publish.yml on definableai/definable.ai

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: definable-0.2.8-py3-none-any.whl
- Subject digest: 8fe1ec5fc2d5384f64850158b90b6568a8fef7a1da0b7d834ea167840219c56d
- Sigstore transparency entry: 956403894
- Sigstore integration time: Feb 17, 2026
Source repository:
- Permalink: definableai/definable.ai@e3ddaf6c61a436fc1e9a8627eb11f632aa0ff307
- Branch / Tag: refs/tags/v0.2.8
- Owner: https://github.com/definableai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@e3ddaf6c61a436fc1e9a8627eb11f632aa0ff307
- Trigger Event: release

definable 0.2.8

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Definable

Install

Quick Start

Add Tools

Structured Output

Streaming

Multi-Turn Conversations

Persistent Memory

Knowledge Base (RAG)

Guardrails

Skills

File Readers

Deploy It

Connect to Platforms

MCP

Replay & Compare

Testing

Features

Supported Models

Optional Extras

Documentation

Project Structure

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance