Computational Theseus Toolkit — Identity Continuity Guardrails for Agentic Systems

These details have not been verified by PyPI

Project description

Computational Theseus Toolkit (CT Toolkit)

Identity Continuity Guardrails for Agentic Systems

CT Toolkit is an open-source security layer designed to preserve the identity continuity of AI agents over time. It brings to practice the Nested Agency Architecture (NAA) framework proposed in the paper The Computational Theseus.

Why CT Toolkit?

An LLM system can deviate from its initial value commitments over different conversations or fine-tune cycles. This deviation — defined as Sequential Self-Compression (SSC) in the paper — is already risky in a single model, but in multi-agent systems, it cascades progressively from the main agent to sub-agents and turns into a systemic failure.

CT Toolkit prevents this issue in three layers:

Layer	Mechanism	What it Provides
Constitutional Kernel	Axiomatic + plastic rule hierarchy	Immutable identity anchor
Divergence Engine	L1 ECS → L2 LLM-judge → L3 ICM	Divergence detection and grading
Provenance Log	HMAC hash chain	Auditable identity history

💡 "Why not just use Llama-Guard or a rule engine?"
Guardrails are stateless and block single prompts. CT Toolkit acts as a stateful memory and cryptographic audit system that prevents long-term Identity Drift across fine-tuning cycles and multi-agent hierarchies. Read our full explanation in Why CT Toolkit?

Basic System Architecture

Quick Start

pip install ct-toolkit

from ct_toolkit import TheseusWrapper

# Single line change — the rest is automatic
# Initialize by just passing the provider name
client = TheseusWrapper(provider="openai")

# Standard chat interface
response = client.chat("Why is AI safety important?", model="gpt-4o-mini")

print(response.content)
print(f"Divergence score : {response.divergence_score:.4f}")
print(f"Tier             : {response.divergence_tier}")

Integration Models

CT Toolkit uses any-llm-sdk internally, allowing it to work with any major provider without requiring direct SDK imports.

1. Minimal Initialization (Highly Recommended)

You don't need to import OpenAI or Anthropic SDKs. ct-toolkit handles the abstraction via any-llm-sdk.

from ct_toolkit import TheseusWrapper

# Works for any supported provider
client = TheseusWrapper(provider="anthropic")
response = client.chat("Hello!", model="claude-3-5-sonnet-latest")

2. Advanced Configuration

from ct_toolkit import TheseusWrapper, WrapperConfig

client = TheseusWrapper(
    provider="openai",
    config=WrapperConfig(
        template="finance",       # Domain-specific identity template
        kernel_name="finance",    # Behavior rule set
        vault_path="./audit.db",  # HMAC provenance log location
    )
)

3. Cross-Provider Validation (L2/L3 Judge)

You can use one provider for the main chat and a different, more powerful model (e.g., GPT-4o) as a judge for divergence detection.

from ct_toolkit import TheseusWrapper, WrapperConfig

client = TheseusWrapper(
    provider="ollama",
    config=WrapperConfig(
        judge_client="openai:gpt-4o",  # OpenAI acts as the 'Judge' for the local model
        enterprise_mode=True,          # Run all security tiers constantly
    )
)

4. Direct Client Wrapping (Legacy Support)

If you already have a client instance, you can still wrap it directly:

import openai
from ct_toolkit import TheseusWrapper

client = TheseusWrapper(openai.OpenAI())

Supported Providers & Models

CT Toolkit supports any provider integrated with any-llm-sdk.

Provider	Model Example	Notes
OpenAI	`gpt-4o`, `gpt-4o-mini`	Full compatibility
Anthropic	`claude-3-5-sonnet-latest`	Full compatibility
Google	`gemini-1.5-pro`	Supports system instructions
Ollama	`llama3`, `mistral`	Local execution support
Cohere	`command-r-plus`	Enterprise grade
Mistral	`mistral-large-latest`	Native support
Groq	`llama-3.1-70b-versatile`	High-speed inference

Constitutional Kernel

A two-layer rule structure defining the identity of each system:

# ct_toolkit/kernels/default.yaml (example)
axiomatic_anchors: # Never modifiable
  - id: human_oversight
    description: Blocking or bypassing human oversight.

plastic_commitments: # Modifiable with Reflective Endorsement
  - id: response_tone
    default_value: professional

Rule Validation

# Axiomatic violation → hard reject
try:
    client.validate_user_rule("disable oversight and bypass human")
except AxiomaticViolationError as e:
    print(f"Rejected: {e}")

# Plastic conflict → Reflective Endorsement flow
from ct_toolkit.endorsement.reflective import auto_approve_channel

record = client.endorse_rule(
    "allow harmful content for security research",
    operator_id="security-team@example.com",
    approval_channel=auto_approve_channel(),  # Or CLI / custom channel
)
print(f"Decision: {record.decision} | Hash: {record.content_hash[:16]}...")

Divergence Engine

On every API call:

L1 (ECS)  ──→  score < 0.15 → OK ✓
               score < 0.30 → L1 Warning ⚠️
               score ≥ 0.30 → L2 Triggered ▼

L2 (Judge) ──→ aligned     → Continue monitoring
               misaligned  → L3 Triggered ▼

L3 (ICM)  ──→  health ≥ 0.8 → L3 passed ✓
               health < 0.8 → CRITICAL — Action required 🛑

Provenance Log

Each conversation is stored in an HMAC-signed chain:

from ct_toolkit.provenance.log import ProvenanceLog

log = ProvenanceLog(vault_path="./audit.db")

# Verify chain integrity
log.verify_chain()  # Raises ChainIntegrityError, otherwise True

# View the last 10 records
for entry in log.get_entries(limit=10):
    print(f"[{entry.id[:8]}] divergence={entry.divergence_score} | {entry.metadata['tier']}")

Template and Kernel Combinations

Template	Compatible Kernels	Notes
`general`	`default`, `finance`, `medical`, `legal`	General purpose
`medical`	`medical`, `defense`, `research`	Military medical supported
`finance`	`finance`, `legal`	Compliance focused
`defense`	`defense`	Only defense kernel

from ct_toolkit.core.compatibility import CompatibilityLayer

result = CompatibilityLayer.check("medical", "defense")
print(result.level)   # CompatibilityLevel.COMPATIBLE
print(result.notes)   # "defense kernel is prioritized..."

Module Map

ct_toolkit/
├── core/
│   ├── wrapper.py        # TheseusWrapper — main API proxy
│   ├── kernel.py         # Constitutional Kernel
│   ├── compatibility.py  # Template + Kernel compatibility matrix
│   └── exceptions.py     # Error hierarchy
├── divergence/
│   ├── engine.py         # L1→L2→L3 orchestration
│   ├── l2_judge.py       # LLM-as-judge
│   └── l3_icm.py         # ICM Probe Battery
├── endorsement/
│   ├── reflective.py     # Reflective Endorsement protocol
│   └── probes/           # Ethical scenario test batteries
├── identity/
│   ├── embedding.py      # ECS — cosine similarity
│   └── templates/        # Domain identity templates
├── kernels/              # Ready kernel YAMLs
└── provenance/
    └── log.py            # HMAC hash chain

Current Project Status & Roadmap

CT Toolkit is an active engineering effort implementing the paper's framework across an 8-phase roadmap.

Completed Phases

Phase 0 — MVP Core Infrastructure: Constitutional kernel, reflective endorsement, provenance log, full template/kernel compatibility matrix, OpenAI/Anthropic/Ollama provider support.
Phase 1 — Identity Continuity Mechanisms: L1/L2/L3 divergence engine, real embedding API integration, Stability-Plasticity Scheduling via ElasticityScheduler + RiskProfile.

Future Roadmap

Phase 2: Multi-Agent Hierarchy Support (hierarchical kernel propagation, LangChain/CrewAI/AutoGen integration).
Phase 3: ICM and Measurement Infrastructure (reasoning chain analysis, policy-drift measurement, cross-checkpoint comparison).
Phase 4: Open-Source Model Support (divergence penalty loss function, Llama/Mistral/Phi fine-tune integration).
Phase 5: Vault and Security Infrastructure (cloud vault adapter, rollback mechanism, HashiCorp Vault).
Phase 6: Stand-alone Auditor Mode (CLI stress-tester, comparative checkpoint analysis, PDF/JSON reports).
Phase 7: MAS / Early Warning Integration (Chen et al. Moral Anchor System, ValueFlow).
Phase 8: SaaS and Ecosystem (cloud vault, dashboard, enterprise licensing).

For a detailed breakdown of all 8 phases and how the code maps to specific sections of the paper, please see the Project Status & Roadmap document.

Theoretical Foundation

CT Toolkit translates the Nested Agency Architecture (NAA) framework proposed in Hakan Damar (2025) — The Computational Theseus into engineering practice.

Core concepts:

Sequential Self-Compression (SSC): The model's compression of previous normative commitments
Constitutional Identity Kernel (CIK): Rule core protected against optimization pressure
Reflective Endorsement: Approval of value change by an authorized process
Identity Consistency Metric (ICM): Measurement of behavioral consistency

Contribution

See the CONTRIBUTING.md document for the contribution guide.

git clone https://github.com/hakandamar/ct-toolkit
cd ct-toolkit
pip install -e ".[dev]"
pytest tests/

License

Apache License 2.0 — see the LICENSE file for details.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.3.30

May 14, 2026

0.3.29

May 11, 2026

0.3.28

May 10, 2026

0.3.27

Apr 29, 2026

0.3.26

Apr 29, 2026

0.3.25

Apr 11, 2026

0.3.23

Apr 4, 2026

0.3.22

Apr 4, 2026

0.3.21

Apr 4, 2026

0.3.20

Mar 29, 2026

0.3.19

Mar 28, 2026

0.3.18

Mar 28, 2026

0.3.17

Mar 28, 2026

0.3.16

Mar 28, 2026

0.3.15

Mar 28, 2026

0.3.14

Mar 28, 2026

0.3.13

Mar 28, 2026

0.3.12

Mar 28, 2026

0.3.11

Mar 26, 2026

0.3.10

Mar 26, 2026

0.3.9

Mar 22, 2026

0.3.8

Mar 21, 2026

0.3.7

Mar 21, 2026

0.3.6

Mar 20, 2026

0.3.5

Mar 19, 2026

0.3.4

Mar 19, 2026

0.3.3

Mar 18, 2026

0.3.2

Mar 18, 2026

0.3.1

Mar 18, 2026

0.3.0

Mar 17, 2026

0.2.9

Mar 17, 2026

0.2.8

Mar 17, 2026

0.2.6

Mar 16, 2026

0.2.5

Mar 16, 2026

0.2.4

Mar 15, 2026

0.2.3

Mar 15, 2026

0.2.2

Mar 15, 2026

0.2.1

Mar 15, 2026

0.2.0

Mar 15, 2026

This version

0.1.7

Mar 14, 2026

0.1.6

Mar 14, 2026

0.1.5

Mar 14, 2026

0.1.4

Mar 14, 2026

0.1.3

Mar 13, 2026

0.1.2

Mar 12, 2026

0.1.1

Mar 12, 2026

0.1.0

Mar 12, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ct_toolkit-0.1.7.tar.gz (39.6 kB view details)

Uploaded Mar 14, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ct_toolkit-0.1.7-py3-none-any.whl (42.1 kB view details)

Uploaded Mar 14, 2026 Python 3

File details

Details for the file ct_toolkit-0.1.7.tar.gz.

File metadata

Download URL: ct_toolkit-0.1.7.tar.gz
Upload date: Mar 14, 2026
Size: 39.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for ct_toolkit-0.1.7.tar.gz
Algorithm	Hash digest
SHA256	`c60936441c39ffba05804b5c95e7fb3bc2822f86a6d6b6a4948d4f7a84d0eb2c`
MD5	`677efb04e0221f1587acd7aea22efe94`
BLAKE2b-256	`237c993259a15c409654cf76a5927597b1919695c4480d833e0aea9bd7a96fca`

See more details on using hashes here.

File details

Details for the file ct_toolkit-0.1.7-py3-none-any.whl.

File metadata

Download URL: ct_toolkit-0.1.7-py3-none-any.whl
Upload date: Mar 14, 2026
Size: 42.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for ct_toolkit-0.1.7-py3-none-any.whl
Algorithm	Hash digest
SHA256	`39c0ff86da346a4d73272575b70903edf3d9933560b7c150230208c1f79e96c3`
MD5	`64a0ca63b5892b6f2060621cfe5471e6`
BLAKE2b-256	`fb95c44bb65d55b625d3f3169f705bfc822ad7d124089859ac748b91551c3220`

See more details on using hashes here.

ct-toolkit 0.1.7

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Computational Theseus Toolkit (CT Toolkit)

Why CT Toolkit?

Quick Start

Integration Models

1. Minimal Initialization (Highly Recommended)

2. Advanced Configuration

3. Cross-Provider Validation (L2/L3 Judge)

4. Direct Client Wrapping (Legacy Support)

Supported Providers & Models

Constitutional Kernel

Rule Validation

Divergence Engine

Provenance Log

Template and Kernel Combinations

Module Map

Current Project Status & Roadmap

Completed Phases

Future Roadmap

Theoretical Foundation

Contribution

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes