15 projects
inspect-ai
Framework for large language model evaluations
inspect-evals
Collection of large language model evaluations
inspect-scout
Transcript Analysis for AI Agents
inspect-swe
Software engineering agents for Inspect AI.
inspect-harbor
Inspect AI interface to Harbor tasks
inspect-flow
Inspect Flow is a workflow stack built on Inspect AI that enables research organizations to run AI evaluations at scale
petri-bloom
Framework for generating behavioral evaluations of frontier AI models.
petri-dish
Run Petri alignment audits against real agent scaffolds (Claude Code, Codex CLI, Gemini CLI) via ACP.
inspect-k8s-sandbox
A Kubernetes Sandbox Environment for Inspect
inspect-petri
An auditing agent that enables automated monitoring and interaction with language models to detect potential alignment issues, reward hacking, and other concerning behaviors.
inspect-sandboxes
Collection of sandboxes for Inspect AI
inspect-viz
Data visualization for Inspect AI large language model evalutions.
condorai
AI Analysis
inspect-tool-support
Sandbox container tool code for inspect_ai
quarto
Python Interface to 'Quarto' Markdown Publishing System