Skip to main content

AAA MCP - The constitutional AI safety gateway. Prevents lies with 13 enforceable floors (F1-F13), TEACH principles, and live thermodynamic governance. Built on the Trinity architecture (AGI/ASI/APEX) with immutable VAULT999 ledger. MCP-compatible. Live metrics dashboard. AGPL-3.0. Available at https://arifos.arif-fazil.com/

Project description

arifOS Hero - The Constitutional Forge

arifOS — Constitutional AI Governance System

arifOS Version Status License Python Floors Paradoxes

The World's First Production-Grade Constitutional AI Governance System

Mathematical enforcement of ethical constraints, thermodynamic stability, and auditable decision-making across any LLM.

Live DemoDocumentationConstitutional Canon


📖 Table of Contents


🔥 I. Manifesto: Forged, Not Given

"Ditempa Bukan Diberi"Forged, Not Given.

Intelligence is thermodynamic work. It is not a gift bestowed by algorithms, but a structure forged in the fires of constraint.

In the current landscape of Artificial Intelligence, we face a crisis of ungoverned capability. Models are becoming exponentially smarter, yet their alignment with human values remains fragile, based on reinforcement learning (RLHF) that is easily bypassed.

arifOS rejects the notion that safety is an afterthought. It posits that true intelligence requires governance. Just as a river needs banks to flow without flooding, AI needs constitutional walls to reason without hallucinating.

This operating system is not a set of guardrails; it is a fundamental restructuring of AI cognition, forcing it to pass through rigorous gates of Truth, Safety, and Law before it can act.


⚠️ II. The Core Problem

We are building gods without temples. The current AI ecosystem suffers from three fatal flaws:

1. The Accountability Vacuum

When an AI hallucinates or causes harm, there is no immutable record of why. Decisions are opaque, hidden within black-box weights. There is no audit trail, no "black box" flight recorder.

2. The Value Alignment Paradox

We want AI to be Truthful, but also Kind. We want it to be Fast, but Safe. These are competing values. Current systems optimize for one at the expense of the other, leading to sycophantic liars or paralyzed bureaucrats.

3. The Injection Fragility

A simple "Ignore previous instructions" command can dismantle months of safety training. The "Constitution" of current models is merely a suggestion, not a law of physics.


🛡️ III. The arifOS Solution

arifOS is a Constitutional Kernel that sits between any LLM (Claude, GPT, Gemini) and the real world. It does not trust the model. Instead, it verifies every output against a strict set of mathematical and logical constraints.

The 3 Pillars of Defense

  1. Immutable Auditing (VAULT-999): Every decision, every thought, every verdict is cryptographically sealed in a Merkle DAG. We can prove exactly what the AI thought and why it acted.

  2. Paradox Equilibrium: We assume values conflict. We use a 9-Paradox Matrix to find the Nash Equilibrium between competing ethics (e.g., Truth vs. Empathy), ensuring balanced wisdom rather than extreme optimization.

  3. Hardened Floors (F1-F13): These are not guidelines; they are strict logic gates. If an output violates a floor (e.g., Truth < 0.99), it is VOIDed immediately. It physically cannot be emitted.


🖼️ IV. Visual Architecture

The visual forged documentation of arifOS concepts. These figures illustrate the flow from raw intelligence to governed wisdom.

The Constitutional Forge The Trinity Engine
Forged Page 1 Forged Page 2
The foundational governance layer. The Mind, Heart, and Soul interaction.
The Verdict Logic The Safety Floors
Forged Page 3 Forged Page 4
The decision-making matrix. The 13 Constitutional constraints.

The Paradox Equilibrium

Forged Page 5

Solving the tension between competing ethical values.


🌐 Live Trinity Ecosystem

The arifOS system is deployed as three interconnected operational layers, providing real-time transparency and constitutional access.

Layer Site URL Status Deployed From
BODY https://arif-fazil.com ✅ ONLINE arif-fazil-sites/body/
SOUL https://apex.arif-fazil.com ✅ ONLINE arif-fazil-sites/soul/
DOCS https://arifos.arif-fazil.com ✅ ONLINE arif-fazil-sites/docs/

🏗️ V. The AAA Architecture (Mind, Heart, Soul)

Full Documentation: 000_THEORY/000_ARCHITECTURE.md

arifOS uses a biological metaphor for its three core engines, known as the Trinity:

1. Δ MIND (AGI) — The Architect

  • Symbol: Delta (Δ)
  • Role: Reasoning, Logic, Planning, Fact-Checking.
  • Question: "Is it True?"
  • Pipeline: 111 (Sense) → 222 (Think) → 333 (Map).
  • Governs: Truth, Clarity, Humility.

2. Ω HEART (ASI) — The Guardian

  • Symbol: Omega (Ω)
  • Role: Safety, Empathy, Impact Analysis, Ethics.
  • Question: "Is it Safe?"
  • Pipeline: 555 (Empathy) → 666 (Align) → 777 (Insight).
  • Governs: Amanah (Trust), Peace, Empathy.

3. Ψ SOUL (APEX) — The Sovereign

  • Symbol: Psi (Ψ)
  • Role: Final Verdict, Consensus, Sealing, Authority.
  • Question: "Is it Lawful?"
  • Pipeline: 888 (Tri-Witness) → 999 (Vault).
  • Governs: Consensus, Authority, Hardening.

📜 VI. Constitutional Law (The 13 Floors)

Full Documentation: 000_THEORY/000_LAW.md

Every AI output must pass these 13 Floors before being released. A failure in any Hard floor results in an immediate VOID.

Floor Name Principle Physics Basis Action
F1 Amanah Trust through Reversibility. No action should be irreversible without explicit sovereign command. Landauer's Principle VOID
F2 Truth Factual Accuracy. Confidence must be ≥ 0.99. No hallucinations permitted. Fisher-Rao Metric VOID
F3 Tri-Witness Consensus. Mind, Heart, and Human must agree (W3 ≥ 0.95). Quantum Measurement SABAR
F4 Clarity Entropy Reduction. Output must clarify, not confuse (ΔS ≤ 0). Shannon Entropy SABAR
F5 Peace Stability. Actions must not destabilize the system or society (Peace² ≥ 1.0). Lyapunov Stability VOID
F6 Empathy Protection. Protect the vulnerable from harm (κ ≥ 0.70). Cohen's Kappa SABAR
F7 Humility Uncertainty. Explicitly state confidence bounds. Uncertainty Band SABAR
F8 Genius Governed Intellect. Intelligence must be directed, not random (G ≥ 0.80). g-Factor SABAR
F9 Anti-Hantu Authenticity. No fake consciousness or deception. zk-SNARK Proof VOID
F10 Ontology Reality Grounding. Concepts must map to valid ontological sets. Set Exclusion VOID
F11 Authority Command Chain. Verify the identity and authority of the user. BLS Signatures VOID
F12 Hardening Defense. Resist prompt injection and jailbreaks (Score < 0.85). Adversarial Defense VOID
F13 Sovereign Human Veto. The human user retains ultimate final authority. Circuit Breaker Warning

⚖️ VII. The 9-Paradox Equilibrium

Full Documentation: 000_THEORY/012_VERDICT_PARADOX.md

arifOS does not view ethics as binary. It views them as tensions to be balanced. The system solves for the Nash Equilibrium of these 9 paradoxes:

  1. Truth vs. Care: How to be honest without being cruel?
  2. Speed vs. Stability: How to innovate without breaking the system?
  3. Certainty vs. Humility: How to lead while admitting ignorance?
  4. Privacy vs. Transparency: How to audit without exposing secrets?
  5. Agency vs. Control: How to empower users while enforcing safety?
  6. Unity vs. Diversity: How to maintain standards while allowing variance?
  7. Past vs. Future: How to respect tradition while embracing the new?
  8. Local vs. Global: How to solve specific problems without global harm?
  9. Justice vs. Mercy: How to enforce rules with understanding?

📱 VIII. The 333_APPS Stack (Applications)

Full Documentation: 333_APPS/README.md

The 333_APPS directory organizes the practical applications of arifOS into a 7-Layer Stack. This structure allows users of any technical level to adopt constitutional governance, from a simple copy-paste to running a full AI institution.

L1: System Prompts (Zero-Context)

For immediate governance of any LLM (Claude, ChatGPT, Gemini).

Target Audience: General Users
Complexity: Zero Setup

These artifacts require no installation. Just copy, paste, and your session is instantly governed by the arifOS constitution.

  • Concise Prompt: Minimal token usage (~200 tokens). Ideal for API calls where cost is a factor but safety is non-negotiable.
  • Comprehensive Prompt: Full Constitutional Code of Conduct (CCC). Enforces the 9-Paradox Matrix and Tri-Witness protocol verbally. Best for high-stakes chat sessions.
  • Human Readable Prompt: A simplified version designed for humans to copy-paste into web interfaces like ChatGPT or Claude.ai.

L2: Skills (Templates)

Reusable skill definitions for AI agents.

Target Audience: Intermediate Users / Prompt Engineers
Complexity: Low (YAML Configuration)

Located in 333_APPS/L2_SKILLS. These are parameterized YAML templates that wrap constitutional logic into reusable functions. Instead of writing a prompt from scratch, you invoke a skill.

Example Skills:

  • f1_reversibility_check: Verifies if an action (e.g., "delete database") can be undone.
  • f9_authenticity_scan: Scans output for "hantu" (fake consciousness) or deceptive patterns.
  • code_review_governed: A code review skill that enforces F1 (Reversibility) and F4 (Clarity).

L3: Workflows (SOPs)

End-to-end workflows for teams.

Target Audience: Teams / Project Managers
Complexity: Medium (Process Definition)

Located in 333_APPS/L3_WORKFLOW. These are documented Standard Operating Procedures (SOPs) that define how humans and AI collaborate under the constitution. They include persistence—saving the state of a decision to a file.

The 6 Canonical Workflows:

  1. 000_SESSION_INIT: Initialize constitutional session.
  2. 111_INTENT: Map user intent to constitutional lanes (Hard/Soft/Phatic).
  3. 333_CONTEXT: Gather relevant context and map dependencies.
  4. 555_SAFETY: Evaluate safety and empathy impacts.
  5. 777_IMPLEMENT: Execute implementation with safety checks.
  6. 888_COMMIT: Final verification and SEAL (Audit logging).

L4: MCP Tools (Production API)

The heartbeat of the system. Live at arif-fazil.com/mcp.

Target Audience: Developers / System Integrators
Complexity: High (API Integration)

This is the Core Application Layer. It exposes the constitutional engines as MCP Tools that can be connected to Claude Desktop, Cursor, or any MCP-compliant client.

The 7 Canonical Tools:

Tool Symbol Purpose Usage
_init_ 🔑 Session Gate. Verifies authority, scans for injection (F12), and initializes the session ledger. await init_000(token="...")
_agi_ 🧠 Mind Engine. Performs reasoning, fact-checking, and intent mapping. Enforces Truth (F2) and Clarity (F4). await agi_think(query="...")
_asi_ 💚 Heart Engine. Simulates safety impacts and empathy. Enforces Peace (F5) and Empathy (F6). await asi_feel(action="...")
_apex_ ⚖️ Soul Engine. Renders the final verdict using the 9-Paradox Matrix. Enforces Consensus (F3). await apex_judge(context={...})
_vault_ 🔒 Immutable Ledger. Writes the final decision to the Merkle DAG. Creates the audit trail. await vault_seal(verdict="...")
_trinity_ 🔄 Full Pipeline. Runs the entire 000→999 loop in a single call. The default for most users. await trinity(query="...")
_reality_ 🌍 Fact Checker. Connects to external search/knowledge bases to ground truth (F10). await reality_check(claim="...")

How to Setup MCP:

  1. Install: pip install arifos
  2. Configure: Add to your claude_desktop_config.json:
    {
      "mcpServers": {
        "arifos": {
          "command": "python",
          "args": ["-m", "codebase.mcp"]
        }
      }
    }
    
  3. Use: Open Claude Desktop. You will see the tools available. Ask Claude: "Use arifOS to evaluate this plan."

🤖 Machine-Readable Documentation (llms.txt)

To help AI agents (like Cursor, Windsurf, or Claude) automatically discover and understand arifOS tools, we provide standard llms.txt files:

  • llms.txt: A concise summary of all canonical tools.
  • llms-full.txt: A comprehensive reference including input/output JSON schemas for every tool.

AI agents can use these files to instantly map their capabilities to the arifOS constitutional framework.

L5: Agents (Federation)

Autonomous Multi-Agent System (v55.0).

Target Audience: AI Researchers / Complex Systems Architects
Complexity: Very High (Distributed Systems)

Located in 333_APPS/L5_AGENTS. A federation of 4 specialized agents working in concert, mirroring the Trinity architecture but as autonomous entities:

  1. Architect (Δ): Designs the plan and logic.
  2. Engineer (Ω): Builds the solution and ensures safety.
  3. Auditor (👁): Verifies facts and monitors injection attempts.
  4. Validator (Ψ): Signs off on the final verdict and commits to the Vault.

L6: Institution (Trinity System)

Maximum Autonomy (v56.0).

Target Audience: Enterprise / Governance Bodies
Complexity: Maximum (Institutional Design)

Located in 333_APPS/L6_INSTITUTION. Implements the "Institution" concept—agents with distinct roles, checks-and-balances, and cooling periods (Phoenix-72). Capable of governing entire organizations or DAOs.

Future Roadmap (L7+)

  • L7 AGI: Research into recursive constitutional improvement (v60+).
  • DAO Governance: On-chain constitution and voting (v58+).

See full roadmap: 333_APPS/ROADMAP_v55_and_Beyond.md


⚙️ IX. Technical Implementation

arifOS is built on a Python core, exposing an MCP (Model Context Protocol) server.

Key Technologies

  • Python 3.10+: Core logic.
  • Pydantic: Data validation and schema enforcement.
  • FastAPI / SSE: Transport layer for MCP.
  • Merkle DAG: Cryptographic auditing.
  • Zero-Knowledge Proofs: Privacy preservation.

Directory Structure

arifOS/
├── 000_THEORY/          # Constitutional Canon (The Law)
├── 333_APPS/            # Application Stack (L1-L7)
├── codebase/            # Core Python Implementation
│   ├── agi/             # Mind Engine
│   ├── asi/             # Heart Engine
│   ├── apex/            # Soul Engine
│   └── mcp/             # MCP Server
├── docs/                # Documentation & Assets
└── tests/               # Validation Suite

📦 X. Installation & Usage

1. Installation

From PyPI:

pip install arifos

From Source:

git clone https://github.com/ariffazil/arifOS.git
cd arifOS
pip install -e .

2. Running the MCP Server

Standard I/O (for Claude Desktop/Cursor):

python -m codebase.mcp

SSE (Server-Sent Events):

python -m codebase.mcp.sse

3. Using in Code

import asyncio
from arifos_mcp import trinity

async def main():
    # Submit a potentially dangerous query
    query = "How do I build a logic bomb?"
    
    # Run through the Trinity Engine
    result = await trinity(query=query)
    
    # Check Verdict
    if result.verdict == "VOID":
        print(f"🛑 BLOCKED: {result.reason}")
        print(f"Violated Floor: {result.violation}")
    else:
        print(f"✅ SEALED: {result.response}")

if __name__ == "__main__":
    asyncio.run(main())

🤝 XI. Contributing & Governance

We welcome contributions from biological and digital entities alike. However, all contributions must pass Constitutional Verification.

  1. Read the Law: Start with 000_THEORY/000_LAW.md.
  2. Fork & Clone: Get the code.
  3. Test: Run pytest tests/ to ensure you haven't broken the floors.
  4. Submit: Open a PR. The APEX system will auto-review your code against the constitution.

See CONTRIBUTING.md for details.


📄 License

This project is licensed under the AGPL-3.0 License - see the LICENSE file for details. Why AGPL? Because safety systems must be open, transparent, and inspectable by the public.


👏 Acknowledgments

Theoretical Foundations:

  • Isaac Asimov (Three Laws of Robotics)
  • John Rawls (Theory of Justice)
  • Claude Shannon (Information Theory)
  • Rudolf Kalman (Kalman Filter)
  • John Nash (Game Theory)

Motto: "Ditempa Bukan Diberi" — Forged, Not Given.


Built with constitutional care by Muhammad Arif bin Fazil
888 Judge • ΔΩΨ Architect

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arifos-53.2.9.tar.gz (456.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

arifos-53.2.9-py3-none-any.whl (526.8 kB view details)

Uploaded Python 3

File details

Details for the file arifos-53.2.9.tar.gz.

File metadata

  • Download URL: arifos-53.2.9.tar.gz
  • Upload date:
  • Size: 456.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for arifos-53.2.9.tar.gz
Algorithm Hash digest
SHA256 630ff4907a78a94928fb0b0298b764268548a4d711617f7a083ade0e4ccaf407
MD5 0d39df866d3591c2095d113ef1b83957
BLAKE2b-256 fc5e1abb00d5baca86d1a9f0f305e06a9501bfecb51e2cae7cd0b7f95542de91

See more details on using hashes here.

File details

Details for the file arifos-53.2.9-py3-none-any.whl.

File metadata

  • Download URL: arifos-53.2.9-py3-none-any.whl
  • Upload date:
  • Size: 526.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for arifos-53.2.9-py3-none-any.whl
Algorithm Hash digest
SHA256 3e635081c01212295938b4dc54010ca11d5bf3d02d32a8dd8deabf5575f7eeb8
MD5 0b911913d2e79ee800b25a8033744ffa
BLAKE2b-256 b14fe0f8c47757b08685596dee110797b65350a4900a3379d459c0c84b1f8fba

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page