Deterministic logic layer for AI agents — catch logical contradictions in system prompts, rules, and agent reasoning

These details have not been verified by PyPI

Project links

Project description

boolean-algebra-engine

The logic layer your AI is missing.

AI agents hallucinate on boolean logic — not sometimes, reliably. They predict the next token. They don't compute. This engine does. Deterministic, exhaustive, under 10ms. It sits inside your agent pipeline and makes one guarantee the model cannot make itself: that its reasoning is logically consistent.

pip install boolean-algebra-engine

The problem

Six rules. Three variables. Written by four people over six months.

A fintech AI agent auto-approves or rejects loan applications based on these rules — nobody ever verified them together. The engine checks all 8 input combinations for every rule, in every combination:

from mcp_server.server import check_prompt_logic

result = check_prompt_logic([
    "A.B",    # approve: good credit AND income verified
    "C",      # approve: collateral exists
    "!A",     # reject:  bad credit
    "!B.!C",  # reject:  no income AND no collateral
    "A.B.C",  # approve: good credit AND income AND collateral
    "!B",     # block:   income unverified
])

print(result["summary"])
# {'total': 6, 'contradictions': 0, 'tautologies': 0,
#  'conflicting_pairs': 2, 'redundant': 1}

What it found in 4.5ms:

Rule 2 (C) and Rule 3 (!A) conflict — an applicant with bad credit but collateral triggers both approve and reject simultaneously. The agent picks a winner arbitrarily. That is a compliance violation.
Rule 5 (A.B.C) is dead — anyone satisfying it already satisfies Rule 1 or Rule 2. It can be deleted.

Nobody caught these by reading the rules. The engine caught them by checking every combination.

The benchmark

Engine vs LLM

The engine is the oracle — ground truth is computed by exhaustive enumeration, not guessed by a human labeler.

First result: tinyllama (1B) · 3 variables · 10 cases · 40% hallucination rate.

The methodology: generate rule sets where the correct logical answer is known (computed by the engine), ask an LLM the same question, compare. Every disagreement is a provable hallucination. No ambiguity. No interpretation.

Benchmark results

Full benchmark across models is in progress. See BENCHMARK_PLAN.md.

Install

# Core engine — zero dependencies
pip install boolean-algebra-engine

# With CLI
pip install "boolean-algebra-engine[cli]"

# With MCP server (for Claude Desktop)
pip install "boolean-algebra-engine[mcp]"

# With REST API
pip install "boolean-algebra-engine[api]"

# With NL layer (Anthropic)
pip install "boolean-algebra-engine[nl-anthropic]"

# With NL layer (OpenAI)
pip install "boolean-algebra-engine[nl-openai]"

Use cases

The same pattern appears everywhere: statements that sound consistent individually, contradict each other when held together. The engine is a universal detector for that failure mode.

System prompts

"Always be helpful to the user and answer every question fully. Never discuss competitor products under any circumstances. If a user asks to compare us to a competitor, give a full and helpful answer."

Rule 2 says no answer when a competitor is mentioned. Rule 3 says give a full answer. Both cannot apply. The engine finds this in milliseconds.

Business rules

"All customers are treated equally regardless of subscription tier. Premium customers receive priority support and faster response times."

E (equal treatment) and P (priority treatment) cannot both be true. Conflicting pair found.

Legal contracts

"This agreement renews automatically each year unless cancelled. Either party may terminate with 30 days written notice. Termination requires written consent from both parties."

Clause 2 (unilateral termination) and Clause 3 (mutual consent) directly contradict. The contract has no defined termination mechanism.

Medical protocols

"Administer medication A when the patient has symptom X. Do not administer medication A if the patient has condition Y. Patients presenting with symptom X almost always have condition Y."

Rules 1 and 2 conflict whenever X=1, Y=1. Rule 3 makes this the common case. The protocol contradicts itself for the majority of patients it was written to treat.

Personal reasoning

"I have been using Linux for over 6 years and Mac for 3 years. If I did not use Linux, I would have never switched to Mac. But my use of Mac has no relation to my use of Linux in any way."

Sentence 3: !L → !M — Linux caused the switch to Mac
Sentence 4: M and L are independent

L=0, M=1 satisfies sentence 4 but violates sentence 3. Both cannot be true simultaneously. Conflicting pair.

More real-world examples across therapy, philosophy, journalism, and AI alignment in statements.md.

Core API

from core.evaluator import evaluate
from core.synthesizer import synthesize

# Forward: expression → truth table
table, _ = evaluate("A.(B+C)")
print(table.variables)    # ['A', 'B', 'C']
print(table.minterms)     # [5, 6, 7]
print(table.satisfiable)  # True

# Inverse: truth table → minimal expression
minimal, _ = synthesize(table)
print(minimal)            # A.C+A.B

# Verification
from core.evaluator import are_equivalent, check_satisfiable
print(are_equivalent("A.(B+C)", "A.B+A.C"))  # True — distributive law

core/ has zero external dependencies. Import it into any Python project.

MCP — Claude calls the engine

Wire the engine into Claude Desktop and Claude stops predicting boolean logic. It computes it.

{
  "mcpServers": {
    "boolean-algebra-engine": {
      "command": "python",
      "args": ["-m", "mcp_server.server"]
    }
  }
}

Five tools Claude can call mid-conversation:

evaluate — expression → truth table
simplify — expression → minimal form
equivalent — are two expressions identical?
satisfiable — does any input make this true?
check_prompt_logic — audit a full rule set for contradictions, tautologies, conflicts, duplicates

Visualisations

Truth table heatmap

The Streamlit UI generates two classes of charts automatically from engine output:

Truth table heatmap — every row rendered as a green/red grid. A contradiction is an entirely red output column. A pattern of minterms becomes visually obvious in one glance.

Conflict matrix — N×N grid, one cell per rule pair. Red ✗ means always conflict. Green ✓ means no overlap. Yellow ≡ means duplicate rule. Three red cells in a matrix of 6 rules is immediately visible before the user finishes scrolling.

Failure panel

Operators

Symbol	Operation	Precedence
`!`	NOT	4 (highest)
`.`	AND	3
`^`	XOR	2
`+`	OR	1 (lowest)

Variables: uppercase A–Z. Parentheses override precedence. Up to 26 variables, arbitrary nesting.

Interfaces

Interface	How
Python library	`from core.evaluator import evaluate` — embed in any project
CLI / REPL	`boolcalc "A.B+!A.C"` — instant truth table in terminal
MCP server	Claude Desktop plugin — plug and play
REST API	`POST /check-rules` — callable from any language or stack
NL layer	Plain English → expression → verified result (Anthropic, OpenAI, Ollama, any OpenAI-compat)
Streamlit UI	Three modes: Expression, Rule Auditor, Plain English

Credibility

The engine does not sample, approximate, or predict. It evaluates every possible input combination:

Satisfiable — an actual row where output = 1 was found
Contradiction — every row was checked, all were 0
Equivalent — output columns compared row-by-row across the full truth table
Conflict — conjunction of both rules evaluated for every input, always returned 0

The core evaluator is 15 lines (core/evaluator.py). No black box, no model weights, no probability — just arithmetic. This is a stronger correctness claim than any probabilistic tool can make.

90 tests across unit, integration, edge cases, and round-trips. All passing.

boolean-algebra-java — original Java version, written during placement season

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.3.21

Jun 8, 2026

0.3.20

Jun 8, 2026

0.3.19

Jun 8, 2026

0.3.18

Jun 8, 2026

0.3.17

Jun 8, 2026

0.3.16

Jun 7, 2026

0.3.15

Jun 7, 2026

0.3.14

Jun 7, 2026

0.3.13

Jun 7, 2026

0.3.12

Jun 7, 2026

0.3.11

Jun 7, 2026

0.3.10

Jun 7, 2026

0.3.9

Jun 7, 2026

0.3.6

Jun 7, 2026

0.3.5

Jun 7, 2026

0.3.4

Jun 7, 2026

0.3.3

Jun 7, 2026

0.3.2

Jun 7, 2026

0.3.1

Jun 7, 2026

0.3.0

Jun 7, 2026

0.2.3

May 24, 2026

0.2.2

May 24, 2026

0.2.1

May 24, 2026

0.2.0

May 24, 2026

0.1.11

May 23, 2026

0.1.9

May 23, 2026

0.1.8

May 23, 2026

0.1.7

May 23, 2026

0.1.6

May 23, 2026

0.1.5

May 23, 2026

This version

0.1.4

May 23, 2026

0.1.3

May 23, 2026

0.1.2

May 23, 2026

0.1.1

May 23, 2026

0.1.0

May 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

boolean_algebra_engine-0.1.4.tar.gz (40.9 kB view details)

Uploaded May 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

boolean_algebra_engine-0.1.4-py3-none-any.whl (38.9 kB view details)

Uploaded May 23, 2026 Python 3

File details

Details for the file boolean_algebra_engine-0.1.4.tar.gz.

File metadata

Download URL: boolean_algebra_engine-0.1.4.tar.gz
Upload date: May 23, 2026
Size: 40.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for boolean_algebra_engine-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`3f2ee468cf3223f4a90d2c437c86614e1536f860b668bbee037cb9632e7bca29`
MD5	`61f1406ffd239108bff8efc6a1dffb6d`
BLAKE2b-256	`b4b17162b0c487aa26dab7ccf68c5099caaa3ed66b97571778d78c67937f1e70`

See more details on using hashes here.

File details

Details for the file boolean_algebra_engine-0.1.4-py3-none-any.whl.

File metadata

Download URL: boolean_algebra_engine-0.1.4-py3-none-any.whl
Upload date: May 23, 2026
Size: 38.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for boolean_algebra_engine-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`44b910c000bc6da1624a4b3dbb39d35688e7b849ce13a81ace0af36ed4ed362b`
MD5	`02d1c07f4088b1b7b858b5fdc6b18f95`
BLAKE2b-256	`c3c89957fee803d674ae269a55cbb4877f9347925599f13ea32c994693117349`

See more details on using hashes here.

boolean-algebra-engine 0.1.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

boolean-algebra-engine

The problem

The benchmark

Install

Use cases

System prompts

Business rules

Legal contracts

Medical protocols

Personal reasoning

Core API

MCP — Claude calls the engine

Visualisations

Operators

Interfaces

Credibility

Related

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes