Skip to main content

Prism-inspired multi-agent orchestration framework built on LangGraph. Security-first, provider-agnostic, composable.

Project description

prismal

PyPI version Python 3.13+ License: MIT

Prismal AI Agent Framework — the core engine powering multi-agent orchestration, security guardrails, RAG, MCP integration, and observability.

This package is the agent framework layer extracted from the larger monorepo as a standalone, publishable PyPI package. It provides everything needed to build and run AI agents without the web server, dashboard, or CLI. It was published as lightagent-agents through v2.x and rebranded to prismal in v3.0.0 (distribution name plus the lightagent.*prismal.* import namespace). End-user backward compatibility is provided by the deprecated lightagent-agents distribution, which now depends on prismal. The sibling lightagent app package historically shared this import namespace and is rebranded/coordinated in tandem.


Features

  • 26 specialized AI agents built on LangGraph — coder, researcher, planner, critic, data_analyst, rag_agent, codeact_agent, cua_agent, and more
  • SUPERVISOR state machine — central supervisor routes each turn to the right specialist, then back to END
  • Security-first (5-layer defense)InputSanitizerGuardrailsEngine (+ NeMo Guardrails L3) → ActionInterceptorAuditLogger (hash-chained) + SecurePromptBuilder + PermissionManager
  • Provider-agnostic — Anthropic Claude, OpenAI GPT, Google Gemini, Ollama via LiteLLM (isolated in prismal/providers/)
  • 7 RAG engines — standard + CRAG, HyDE, RAG-Fusion (RRF), Hybrid (BM25 + semantic), Self-RAG, Parent-Child hierarchical, Multi-Vector, and Adaptive facade
  • 7 agent reasoning patterns — Tree of Thoughts, Debate, Constitutional AI, LATS (MCTS), LLM-Compiler (parallel DAG), Mixture of Agents, Swarm/Handoff
  • 5 domain subgraph pipelines — Customer Service, Document Generation, Data ETL, Code Review, Debate/Consensus — on top of the existing dev/ml/financial pipelines
  • Multimodal layer (implemented, opt-in) — Vision / Audio / Video agents, modality router, multimodal fusion, multimodal subgraph, multimodal RAG engine with cross-modal embeddings, and MediaValidator security gate — gated by settings.multimodal_enabled (default False); see specs/multimodal-agents/
  • Extension surface (implemented, opt-in)prismal.langgraph re-export, @prismal_node decorator (security/OTel/audit/retry middleware), PrismalStateGraphBuilder fluent API, plugin discovery via importlib.metadata entry points, LangChainRunnableAdapter, and formal Protocols for ports (checkpoint/audit/embeddings/tools) — see docs/extension.md and specs/extension-surface/
  • MCP client with capability routingModel Context Protocol with auto-discovery and per-agent capability-based tool filtering (config/mcp_servers.yaml)
  • Process isolationSandboxExecutor with docker/podman/nsjail/bwrap/firejail backends
  • Human-in-the-Loophitl_gate() with LangGraph interrupt() support
  • Composable primitivesreflection_loop() (generate → critique → refine) and make_parallel_dispatcher() (fan-out via Send())
  • Cron engine — APScheduler + timezone-aware DateTimeService (single time source of truth)
  • Long-term memory — PII-sanitized cross-session store (SQLite + ChromaDB; optional MongoDB)
  • Observability — Langfuse traces, OpenTelemetry spans, structlog
  • Deterministic intent routing — regex-based match_intent() ahead of LLM supervision
  • Tool provider injection (implemented)ToolProviderPort hexagonal port: the host composes MCP/Skills/stub providers and injects them (set_tool_provider or per-session via graph config); the agent core no longer imports prismal.mcp/prismal.skills — see docs/tool-providers.md and specs/tool-provider-injection/
  • 120-tool global cap enforced by the official CompositeToolProvider (legacy constants kept in tool_registry.py)
  • Graph visualizationto_mermaid() / visualize() / save_graph_image() (from prismal.langgraph) render any compiled graph or SubgraphDefinition; SubgraphDefinition.to_mermaid() and visualize_supervisor_graph() are one-line shortcuts (see examples/visualize_graphs.py)

Installation

pip install prismal
# or with uv:
uv pip install prismal

Optional extras

pip install "prismal[postgres]"          # PostgreSQL checkpointing
pip install "prismal[mongodb]"           # MongoDB long-term memory
pip install "prismal[ollama]"            # Local LLMs via Ollama
pip install "prismal[local-embeddings]"  # HuggingFace embeddings
pip install "prismal[ml]"                # ML/AutoML pipeline
pip install "prismal[ml-dl]"             # ML + PyTorch Lightning
pip install "prismal[finance]"           # yfinance + pandas-ta
pip install "prismal[analytics]"         # matplotlib + plotly
pip install "prismal[datetime]"          # tzdata + NTP
pip install "prismal[maintenance]"       # pip-audit
pip install "prismal[multimodal]"         # Pillow + ffmpeg-python + imagehash (Fase F)
pip install "prismal[multimodal-local]"   # faster-whisper (local STT)
pip install "prismal[multimodal-premium]" # elevenlabs TTS
pip install "prismal[multimodal-embed]"   # open-clip-torch (CLIP cross-modal embeddings)
pip install "prismal[all]"                # Everything above

Quick Start

from prismal.agents.graph import get_async_compiled_graph
from prismal.agents.state import create_initial_state
from prismal.core.config import get_settings

async def main():
    settings = get_settings()
    graph = await get_async_compiled_graph()   # async contexts MUST use this

    state = create_initial_state(
        session_id="my-session",
        user_message="Analyse the sales data in data/sales.csv",
    )

    result = await graph.ainvoke(
        state,
        config={"configurable": {"thread_id": "my-session"}},
    )
    print(result["messages"][-1].content)

A synchronous get_compiled_graph() entry point is also available for non-async callers.


Advanced architectures

The package ships 19 composable architectures under specs/advanced-architectures/ (Fases A/B/C, ≥82% coverage per module, 0 bandit issues). Every component follows a callable-injection pattern — business logic accepts generate_fn, evaluate_fn, reward_fn, plan_fn, tool_executor, … so tests run without LLM backends. Defaults wire ProviderRegistry().get_llm() lazily.

RAG engines (prismal/rag/)

Engine Module Purpose
HyDE hyde.py Generates a hypothetical answer and searches by its embedding (recall boost on abstract queries)
RAG-Fusion fusion.py N query variants + reciprocal_rank_fusion() (RRF, k=60) over parallel searches
Hybrid Search hybrid.py BM25 (rank-bm25) + semantic linear fusion with configurable alpha
Self-RAG self_rag.py LLM decides whether to retrieve (RETRIEVE/NO_RETRIEVE) and self-assesses support (SUPPORTED/PARTIALLY_SUPPORTED/UNSUPPORTED) + utility score
Parent-Child hierarchical.py Indexes small child chunks (~100 tok) for precision but returns parent context (~500 tok) to the LLM
Multi-Vector multi_vector.py Indexes each chunk plus a summary and N hypothetical questions per chunk
Adaptive RAG adaptive.py Facade that classifies queries (FACTUAL_SIMPLE / ABSTRACT / AMBIGUOUS / MULTI_HOP / TECHNICAL / CONVERSATIONAL) and routes to the engine above

Agent reasoning patterns (prismal/agents/patterns/)

Pattern Module Purpose
Tree of Thoughts tree_of_thoughts.py Explores a tree of candidate thoughts with BFS / DFS / beam search
Debate debate.py N-agent multi-round debate with moderator / majority-vote / weighted synthesis and Jaccard agreement score
Constitutional AI constitutional.py Principle-driven self-critique + revision loop with audit log (3 default principles: no_harmful_content, factual_accuracy, no_pii_exposure)
LATS lats.py Monte Carlo Tree Search (UCB1) over the action space — real backtracking when a branch fails
LLM-Compiler llm_compiler.py Compiles a DAG of tasks, validates with Kahn topological sort, executes independent tasks in parallel waves
Mixture of Agents mixture_of_agents.py Parallel proposers across multiple providers + aggregator synthesis layers
Swarm/Handoff swarm.py Decentralised agent-to-agent handoff with HandoffRecord audit trail and allow-list validation

Domain subgraph pipelines (prismal/agents/subgraphs/)

Pipeline Directory Flow
Customer Service customer_service/ classifier → faq_retrieval → escalation_gate → response | ticket_creator
Document Generation document_generation/ planner → researcher → writer → editor → formatter (markdown/plain/html)
Data ETL data_etl/ extractor → validator → (conditional gate) → transformer → loader → auditor
Code Review code_review/ linter → security_scanner → logic_reviewer → suggester → report_generator
Debate/Consensus debate_consensus/ proponent → opponent → moderator → consensus

Each subgraph exports both build_<name>_subgraph() (returns a SubgraphDefinition) and an idempotent register_<name>() mirroring the existing register_ml_pipeline. Wiring into the top-level supervisor is opt-in operational work — the primitives are ready to register.

MCP capability routing

config/mcp_servers.yaml declares each server's capabilities: list[str]. MCPClientManager.get_all_langchain_tools(capabilities=…) and get_tools_for_agent(agent, required_capabilities=…) filter the tool pool per agent. Servers tagged general are always included; omitting capabilities from a YAML entry defaults to ["general"] for backward compatibility. The capability set is extended in Fase F to include vision, audio, and video.

See specs/advanced-architectures/SPEC.md for the full interface contracts of Fases A/B/C/D/E.

Multimodal layer (Fase F — implemented, opt-in)

The multimodal expansion described in specs/multimodal-agents/ adds voice, image, and video to the existing text-only stack without modifying any existing agent. It is opt-in: gated by settings.multimodal_enabled (default False) and registered via register_multimodal_pipeline(registry) when the operator is ready.

Provider wrappers (prismal/providers/)

Wrapper Module Backends
STT stt.py OpenAI Whisper API, local (openai-whisper / faster-whisper)
TTS tts.py pyttsx3 (offline default), OpenAI, ElevenLabs — automatic cascade fallback
Vision LLM vision.py Any LiteLLM model with vision (Claude, GPT-4o, Gemini)
Multimodal LLM multimodal.py Gemini 2.x, GPT-4o, Sonnet 4.6 (native multimodal)
Cross-modal embeddings cross_modal_embeddings.py CLIP / open_clip_torch (opt-in extra)

Modal agents (prismal/agents/multimodal/)

Agent Module Purpose
VisionAgent vision_agent.py General-purpose image analysis: description, object detection, optional OCR
AudioAgent audio_agent.py Voice-to-voice pipeline: STT → LLM reasoning → optional TTS
VideoAgent video_agent.py FFmpeg frame extraction (via SandboxExecutor) + audio transcript + fusion summary
ModalityRouter modality_router.py Heuristic classifier (MIME + regex) with optional LLM fallback
MultimodalFusion multimodal_fusion.py Combines outputs from modal agents using moa, moderator, or concat strategies (reuses mixture_of_agents.py)

Multimodal subgraph (prismal/agents/subgraphs/multimodal_pipeline/)

router_node → [vision_node | audio_node | video_node | text passthrough] → fusion_node → output_formatter_node

Exports build_multimodal_subgraph() (returns SubgraphDefinition) and an idempotent register_multimodal_pipeline() matching the existing register_ml_pipeline pattern.

Multimodal RAG (prismal/rag/)

MultimodalRAGEngine indexes text + image captions + audio/video transcripts and exposes search(query, modalities=[...]) with metadata-based modality filtering. Without the [multimodal-embed] extra it falls back to textual captions; with it, vectors come from CLIP-style cross-modal embeddings. New loaders: loaders/image_loader.py, loaders/audio_loader.py, loaders/video_loader.py.

Security (prismal/security/)

MediaValidator enforces magic-byte verification + size/duration limits before any media reaches an agent. InputSanitizer.sanitize_media() strips EXIF; AuditLogger.log_media() records SHA-256 + modality (never content); ActionInterceptor.check_media_op() gates filesystem media operations; FFmpeg always runs inside SandboxExecutor.

See specs/multimodal-agents/SPEC.md for the full interface contracts of Fase F.

Extension surface (Fase X — implemented, opt-in)

The extension surface (user guide: docs/extension.md; contracts: specs/extension-surface/) exposes LangGraph as a first-class build target for users and third-party plugins, so you can write new patterns without forking prismal. All public symbols import from prismal.agents.extension. Five components:

prismal.langgraph — official re-export

from prismal.langgraph import StateGraph, START, END, Send, interrupt, add_messages, AgentState, VERSION

graph = StateGraph(AgentState)
graph.add_node("my_node", my_node)
graph.add_edge(START, "my_node")
graph.add_edge("my_node", END)
compiled = graph.compile()

Importing from prismal.langgraph (rather than langgraph.* directly) guarantees the LangGraph version prismal was tested against, exposed as VERSION.

@prismal_node decorator

from prismal.agents.extension import prismal_node

@prismal_node(name="my_classifier", capabilities=["general"], security="standard", audit=True)
async def my_classifier(state):
    last = state["messages"][-1].content
    label = await classify(last)
    return {"metadata": {"my_classifier": {"label": label}}}

Wraps any async (state) → state_update with a middleware chain: InputSanitizer + SecurePromptBuilder + ActionInterceptor → OTel span → structured logger bind → retry/backoff → timeout → user function → audit log → error mapping. Side effect: registers the node's capabilities in tool_registry.DEFAULT_CAPABILITY_MAP.

PrismalStateGraphBuilder — fluent API

from prismal.agents.extension import PrismalStateGraphBuilder

builder = PrismalStateGraphBuilder("my_pipeline")
builder.add_node("classify", classify_fn)        # auto-wraps with @prismal_node if missing
builder.add_node("respond", respond_fn)
builder.add_edge("classify", "respond")
builder.set_entry_point("classify")
subgraph = builder.compile()                      # returns SubgraphDefinition

Plugin discovery via entry points

# prismal-x-healthcare/pyproject.toml
[project.entry-points."prismal.subgraphs"]
healthcare_triage = "prismal_x_healthcare:register_healthcare_pipeline"

After pip install prismal-x-healthcare, discover_plugins() auto-registers the subgraph. Allowlist/denylist via settings.plugins_allowlist / plugins_denylist. CLI: python -m prismal.plugins list | info <name> | doctor. Each plugin loads in isolation — individual failures do not abort startup.

LangChainRunnableAdapter — bridge for existing LangChain code

from prismal.agents.extension import LangChainRunnableAdapter

adapter = LangChainRunnableAdapter(my_agent_executor)
node = adapter.as_node(name="legacy_research", capabilities=["research"])

Automatically maps state["messages"] ↔ the Runnable's input/output. Supports Runnable, RunnableSequence, RunnableLambda, AgentExecutor.

Formal ports (hexagonal)

prismal/agents/extension/ports.py declares CheckpointPort, AuditPort, EmbeddingsPort, ToolPort, ToolProviderPort as Protocols. Existing implementations (AsyncSqliteSaver, AuditLogger, ChromaDB embeddings, BaseTool, the Fase Y tool providers) conform structurally; users substitute their own (Redis checkpointer, Splunk audit, custom tool source, etc.) without modifying the core.

See specs/extension-surface/SPEC.md for the full interface contracts of Fase X.

Tool provider injection (Fase Y — implemented)

Tool resolution is a hexagonal port (user guide: docs/tool-providers.md; contracts: specs/tool-provider-injection/): the agent core asks an injected ToolProviderPort for tools and never imports prismal.mcp / prismal.skills (enforced by an architecture test). The host composes the providers and injects them at startup:

from prismal.agents.extension import build_default_tool_provider
from prismal.agents.tool_registry import set_tool_provider

async def on_startup() -> None:                       # FastAPI lifespan or equivalent
    set_tool_provider(await build_default_tool_provider())   # MCP + Skills + stubs
  • Providers (prismal.agents.extension): McpToolProvider, SkillToolProvider, StubToolProvider, CompositeToolProvider (merge with MCP→Skills→stubs priority, name dedupe, 60/120 tool caps, fixed-tool-agent exemption — exact parity with the historical registry) and FakeToolProvider for tests.
  • Variante A (global)set_tool_provider() once per process; nodes keep calling get_tools_for_agent(name) unchanged.
  • Variante B (multi-tenant)get_async_compiled_graph(tool_provider=provider) with tool_provider_mode="context" binds a per-session provider; nodes resolve via get_tools_for_agent_ctx(name, config). No shared global state.
  • No provider? The registry degrades to static stubs with a warning (tool_provider_strict=True raises ToolProviderNotConfigured instead).
  • Legacy shimsinit_mcp() / get_mcp_tools() / get_skill_tools() still work, emit DeprecationWarning, and will be removed in the next minor.

Runnable examples: examples/tool_provider_host.py, examples/tool_provider_custom.py.


Roadmap — features por desarrollar

Ya implementadas: extension surface (Fase X), advanced architectures (Fase A/B/C), multimodal (Fase F), y la remediación de dependencias (18/18 alertas en estado terminal).

Lo que falta, ordenado de implementación rápida y necesaria → compleja y menos necesaria. Cada feature tiene su contrato SDD en specs/. Estado: spec ready = listo para construir (PLAN/ARCHITECTURE/SPEC/TASKS); PRD seed = solo PRD, requiere expandir antes de construir.

  1. Terminar Tool Provider Injection (Fase Y)rápido · necesario · en cursospecs/tool-provider-injection/. El código de Y1–Y5 ya aterrizó; falta cerrar Y6–Y8 (settings/observabilidad, docs/ejemplos, tests de paridad) y marcar el spec IMPLEMENTED.
  2. Vector Store Port (Fase Z)moderado · necesario · spec readyspecs/vector-store-port/. Quita el lock-in de ChromaDB tras un VectorStorePort con adaptadores (Chroma default + LanceDB, sqlite-vec, Qdrant, pgvector). Reduce superficie de seguridad y abre backends embebidos.
  3. Runtime Composition Root (Fase R)moderado · necesario · spec readyspecs/composition-root/. build_runtime() compone e inyecta todos los puertos (tools, vector store, embeddings, checkpoint, audit) en una llamada; desbloquea prismal-server / prismal-dashboard. Depende de Fase Y + Z.
  4. Cost & Budget Governancerápido-moderado · útil · PRD seedspecs/cost-budget-governance/. Presupuesto por run/sesión/tenant + circuit-breakers de coste/tokens/llamadas en react_loop y los patrones caros (debate, ToT, LATS, MoA). Seguro barato contra gasto runaway.
  5. A2A / Agent Cards interop (Fase I)complejo · necesario (ecosistema) · spec readyspecs/a2a-interop/. Interop agente-a-agente bidireccional: exponer prismal como agente A2A (Agent Card en /.well-known/agent-card.json, JSON-RPC + SSE) y consumir agentes remotos como nodos/tools. Complementa MCP; cierra la brecha frente a MS Agent Framework / Google ADK.
  6. Agent Identity & Access Governancecomplejo · necesario (enterprise) · PRD seedspecs/agent-identity-governance/. Identidad por agente (W3C DID), credenciales acotadas, OAuth-on-behalf y un PolicyEngine. Es la base de confianza que consume A2A; production-blocker enterprise.
  7. Agent Evaluation & Reliability Harnessmoderado-complejo · útil (fiabilidad) · PRD seedspecs/agent-eval-harness/. Eval a nivel de sistema del grafo (trayectorias, tool-usage, groundedness RAG), regresión con gate en CI y suite adversaria. Cierra el "scaffold gap".
  8. Pulido (sin spec aún)variable · menos urgente — UI de observabilidad de primera parte (o integración profunda LangSmith/Langfuse) y type-safety por nodo (validación Pydantic del I/O de nodos; evolución de AgentState).

¿Framework o host? (dónde vive cada feature)

Regla: contrato/lógica → framework (prismal/); servir HTTP, autenticar, mostrar, persistir config → host (prismal-server / prismal-dashboard). Por eso A2A e Identity quedan partidos.

# Feature Framework (prismal/) Host (prismal-server / dashboard)
1 Tool Provider (Fase Y) ✅ ports/providers (agents/extension) compone e inyecta al arranque
2 Vector Store Port (Fase Z) rag/stores/ + VectorStorePort elige backend por config
3 Composition Root (Fase R) composition.py / build_runtime() lo llama en el lifespan
4 Cost & Budget Governance ✅ guard en react_loop + patrones cuotas por tenant
5 A2A / Agent Cards (Fase I) ✅ tipos · card · client · A2AToolProvider · handler endpoint HTTP (/a2a, /.well-known/agent-card.json) + auth
6 Agent Identity & Governance PolicyEngine + puerto de identidad (security/) IdP/OAuth + bóveda de credenciales + emisión/rotación de DID
7 Agent Eval Harness motor de eval (módulo) corre como herramienta dev/CI (o paquete aparte)
8 Pulido type-safety por nodo (AgentState) observabilidad UI

El framework define puertos y lógica; el host los compone y expone. Detalle en docs/competitive-analysis.md.

Análisis completo y comparativa con frameworks 2026 en docs/competitive-analysis.md.


Development

Python 3.13+ is required. uv is the recommended package manager.

# Install with dev tools
uv pip install -e ".[dev]"
# or with dev + extras:
uv pip install -e ".[dev,all]"

# Run the test suite (pytest-asyncio auto-mode, filterwarnings="error")
uv run pytest                                      # full suite
uv run pytest tests/unit                           # one tier
uv run pytest -m unit                              # by marker (unit|integration|security|slow|live_api)
uv run pytest tests/unit/security/test_sanitizer.py::TestSanitizer::test_strip_controls  # single test
uv run pytest -n auto                              # parallel (pytest-xdist)
uv run pytest --cov=prismal --cov-report=term-missing   # coverage (fail_under = 80)

# Lint + format (ruff, line-length=100, target py313)
uv run ruff check .
uv run ruff check --fix .
uv run ruff format .

# Strict type-check (mypy strict mode, namespace_packages=true)
uv run mypy prismal

# Security linting
uv run bandit -r prismal -c pyproject.toml

# Build the distribution
uv run python -m build

live_api tests call real LLM APIs and require provider keys; skip them locally with -m "not live_api". Integration tests under tests/integration/ expect running services (sandbox backends, databases).


Architecture

The core is a LangGraph StateGraph[AgentState] assembled in prismal/agents/graph.py following the SUPERVISOR pattern: a central supervisor_node routes each turn to one of 26 specialist agent nodes, each of which returns control to the supervisor; the supervisor routes to END when the task is complete. Checkpointing is handled by AsyncSqliteSaver (or PostgreSQL via the [postgres] extra).

prismal/                ← PEP 420 namespace package (NO __init__.py at root)
├── agents/                ← LangGraph state machine + 26 agent nodes
│   ├── graph.py           ← get_compiled_graph() / get_async_compiled_graph()
│   ├── supervisor.py      ← Central router
│   ├── state.py           ← AgentState (TypedDict; messages uses add_messages reducer)
│   ├── intent_router.py   ← Deterministic regex routing
│   ├── tool_registry.py   ← stable facade: delegates to the injected ToolProviderPort (Fase Y)
│   ├── patterns/
│   │   ├── reflection.py           ← reflection_loop()
│   │   ├── parallel.py             ← make_parallel_dispatcher() via Send()
│   │   ├── tree_of_thoughts.py     ← ToT with BFS/DFS/beam
│   │   ├── debate.py               ← N-agent multi-round debate + Jaccard
│   │   ├── constitutional.py       ← principle-driven self-revision + audit
│   │   ├── lats.py                 ← MCTS with UCB1
│   │   ├── llm_compiler.py         ← DAG compilation + Kahn validation + parallel waves
│   │   ├── mixture_of_agents.py    ← multi-provider proposers + aggregator
│   │   └── swarm.py                ← decentralised handoff with audit
│   ├── multimodal/                  ← (Fase F) vision / audio / video agents + router + fusion
│   │   ├── vision_agent.py
│   │   ├── audio_agent.py
│   │   ├── video_agent.py
│   │   ├── modality_router.py
│   │   └── multimodal_fusion.py
│   └── subgraphs/
│       ├── factory.py              ← SubgraphFactory
│       ├── registry.py             ← SubgraphRegistry
│       ├── gates.py                ← hitl_gate() with interrupt()
│       ├── dev_pipeline/           ← PO → Architect → Developer → Tests → QA → Reviewer
│       ├── ml_pipeline/            ← Ingester → EDA → Features → Trainer → Evaluator → Exporter
│       ├── financial/              ← Collector → Technical → Fundamental → Risk → Report
│       ├── customer_service/       ← classifier → faq_retrieval → gate → response | ticket
│       ├── document_generation/    ← planner → researcher → writer → editor → formatter
│       ├── data_etl/               ← extractor → validator → gate → transformer → loader → auditor
│       ├── code_review/            ← linter → security_scanner → logic_reviewer → suggester → report
│       ├── debate_consensus/       ← proponent → opponent → moderator → consensus
│       ├── multimodal_pipeline/    ← (Fase F) router → vision|audio|video → fusion → output_formatter
│       ├── analysis_orchestrator/
│       ├── engineering_orchestrator/
│       └── research_orchestrator/
├── core/                  ← Pydantic Settings, logging, exceptions, DB, user model
├── providers/             ← LiteLLM wrapper (ONLY location for provider-specific imports;
│                             Fase F adds stt/tts/vision/multimodal/cross_modal_embeddings)
├── memory/                ← Short-term history + long-term PII-sanitized store
├── mcp/                   ← MCP client, adapter, connection manager, capability routing
├── security/              ← 5-layer defense-in-depth (see below) + (Fase F) media_validator.py
├── rag/                   ← 7 retrieval engines:
│   ├── engine.py          ← standard RAGEngine
│   ├── crag.py            ← CRAG pipeline
│   ├── hyde.py            ← Hypothetical Document Embeddings
│   ├── fusion.py          ← RAG-Fusion (RRF)
│   ├── hybrid.py          ← BM25 + semantic hybrid search
│   ├── self_rag.py        ← Self-RAG (conditional retrieval + self-assessment)
│   ├── hierarchical.py    ← Parent-Child chunking
│   ├── multi_vector.py    ← chunk + summary + N hypothetical questions
│   ├── adaptive.py        ← facade routing by query type
│   ├── federated.py       ← federated search
│   ├── multimodal.py      ← (Fase F) MultimodalRAGEngine — text + image captions + audio/video transcripts
│   ├── loaders/           ← (Fase F) document/image/audio/video loaders
│   └── vector_store.py    ← ChromaDB vector store (extended with modality metadata in Fase F)
├── skills/                ← available/ (source) · active/ (gitignored) · custom/ (gitignored)
├── scheduler/             ← APScheduler CronExecutor, DateTimeService, Prefect flows
├── monitoring/            ← Langfuse, OpenTelemetry, structlog
├── data/                  ← DuckDB + Polars utilities
├── sandbox/               ← SandboxExecutor process isolation
├── utils/                 ← Shared utilities
└── events/                ← Event bus

Namespace package

prismal/ has no __init__.py — it is a PEP 420 implicit namespace package (renamed from lightagent/ in v3.0.0). Both prismal and the sibling lightagent app package contribute modules into the same prismal.* namespace. Do not add prismal/__init__.py; it would break the sibling package.

Security stack (5 layers)

Layer Component Purpose
L1 InputSanitizer Strip control chars, normalize unicode, enforce MAX_INPUT_LENGTH
L2 GuardrailsEngine Regex pattern matching + risk scoring
L3 nemo_rails.py NVIDIA NeMo Guardrails integration
L4 ActionInterceptor LangChain callback, pre-tool permission checks
L5 AuditLogger Append-only JSONL audit log with xxhash chaining
Support SecurePromptBuilder User-input isolation with canary tokens
Support PermissionManager TTL-based SQLite permission grants
Support filesystem_guard.py Path confinement via resolve().is_relative_to()

Critical rules

  1. Never concatenate user input into prompts — use SecurePromptBuilder. This applies to STT transcripts, OCR text, and image captions as well — they are user-controlled content.
  2. Never bypass GuardrailsEngine / ActionInterceptor.
  3. Always use get_async_compiled_graph() in async contexts (the sync variant wires a non-async SQLite saver).
  4. Never add provider-specific imports (anthropic, openai, google.generativeai, ollama, whisper, pyttsx3, elevenlabs, open_clip_torch, …) outside prismal/providers/.
  5. Always call ActionInterceptor.check() before tool calls that write files or execute code; call ActionInterceptor.check_media_op() before media filesystem operations (Fase F).
  6. Always validate incoming media with MediaValidator.validate() before passing to a multimodal agent (Fase F); FFmpeg always runs inside SandboxExecutor.
  7. Never add __init__.py to prismal/ — it must remain a PEP 420 namespace package.

See CLAUDE.md for the full working guide (commands, testing notes, architectural context for contributors and AI assistants).


Versioning

This package follows Semantic Versioning. Tag format for releases: prismal/vMAJOR.MINOR.PATCH

git tag prismal/v3.0.0
git push --tags

See CHANGELOG.md for release history.


Releasing (maintainers)

Run only after the full suite, linters and type/security checks are green.

# 0) Verify on the release branch
cd prismal && git switch main

# 1) Quality gates (must all pass)
uv pip install -e ".[dev,all]"
uv run pytest -m "not live_api"
uv run ruff check . && uv run mypy prismal && uv run bandit -r prismal -c pyproject.toml

# 2) Build + validate the prismal distribution
rm -rf dist/ && python -m build && twine check dist/*
twine upload --repository testpypi dist/*          # validate on TestPyPI first

# 3) Push history and publish prismal
git push origin main
twine upload dist/*                                 # publish to PyPI
git tag prismal/v3.0.0 && git push --tags           # tag format: prismal/vMAJOR.MINOR.PATCH

# 4) Publish the deprecated compatibility bridge (lightagent-agents -> prismal)
cd compat/lightagent-agents
rm -rf dist/ && python -m build && twine check dist/*
twine upload dist/*                                 # publishes lightagent-agents 2.9.0

Post-release follow-ups: configure DNS/site for prismal.dev, coordinate the sibling lightagent app package (the namespace rename breaks the shared PEP 420 namespace), and regenerate the branded binary assets (PDF/PPTX/HTML).


License

MIT © Ernesto Crespo

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

prismal_ai-3.0.1.tar.gz (1.3 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

prismal_ai-3.0.1-py3-none-any.whl (680.3 kB view details)

Uploaded Python 3

File details

Details for the file prismal_ai-3.0.1.tar.gz.

File metadata

  • Download URL: prismal_ai-3.0.1.tar.gz
  • Upload date:
  • Size: 1.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for prismal_ai-3.0.1.tar.gz
Algorithm Hash digest
SHA256 83f4556e566ded20c794364bd68aab93f84b89685dd97696bbf10b82873958f5
MD5 46675fe22f786917bba7376299c21767
BLAKE2b-256 16b3efe99866add6b09322187e0a4114bc895daae674f6dbd90e44ae9acf3328

See more details on using hashes here.

File details

Details for the file prismal_ai-3.0.1-py3-none-any.whl.

File metadata

  • Download URL: prismal_ai-3.0.1-py3-none-any.whl
  • Upload date:
  • Size: 680.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for prismal_ai-3.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 da4070f88d760e92baeb779a97007591bed518cd76ba1cc02510b8624b86da5b
MD5 0e615d5c9698ed2c697f8202296af977
BLAKE2b-256 ff2ec834fc39dd3ba8f448608894191cae12404847802ed715c4ad9b88f6f372

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page