Framework for rigorous, systematic analysis of claims, sources, predictions, and argument chains

These details have not been verified by PyPI

Project links

Project description

Reality Check

A framework for rigorous, systematic analysis of claims, sources, predictions, and argument chains.

With so many hot takes, plausible theories, misinformation, and AI-generated content, sometimes, you just need a realitycheck.

Overview

Reality Check helps you build and maintain a unified knowledge base with:

Claim Registry: Track claims with evidence levels, credence scores, and relationships
Source Analysis: Structured 3-stage methodology (descriptive → evaluative → dialectical)
Prediction Tracking: Monitor forecasts with falsification criteria and status updates
Argument Chains: Map logical dependencies and identify weak links
Semantic Search: Find related claims across your entire knowledge base

See realitycheck-data for a public example knowledge base built with Reality Check.

Status

v0.1.4 - Core functionality complete. Extended CLI, integrations for Claude Code, Codex, and Amp, and 231 passing tests.

Prerequisites

Python 3.11+
Claude Code (optional) - For plugin integration
OpenAI Codex (optional) - For skills integration
Amp (optional) - For skills integration

Installation

From PyPI (Recommended)

# Install with pip
pip install realitycheck

# Or with uv (faster)
uv pip install realitycheck  # installs to active venv or system Python

# Verify installation
rc-db --help

From Source (Development)

# Clone the framework
git clone https://github.com/lhl/realitycheck.git
cd realitycheck

# Install dependencies with uv
uv sync

# Verify installation
REALITYCHECK_EMBED_SKIP=1 uv run pytest -v

GPU Support (Optional)

The default install uses CPU-only PyTorch. For GPU-accelerated embeddings:

# NVIDIA CUDA 12.8
uv sync --extra-index-url https://download.pytorch.org/whl/cu128

# AMD ROCm 6.4
uv sync --extra-index-url https://download.pytorch.org/whl/rocm6.4

AMD TheRock nightly (e.g., gfx1151 / Strix Halo):

TheRock nightlies provide support for newer AMD GPUs not yet in stable ROCm. Replace gfx1151 with your GPU arch.

Note: TheRock support is experimental. Newer architectures (gfx1151/RDNA 3.5, gfx1200/RDNA 4) may require matching system ROCm kernel drivers. Memory allocation may work but kernel execution can fail if there's a version mismatch between pip ROCm userspace and system kernel module.

# 1. Install matching ROCm SDK (system-wide)
pip install --index-url https://rocm.nightlies.amd.com/v2/gfx1151/ "rocm[libraries]" -U

# 2. Create fresh venv with ROCm torch
rm -rf .venv && uv venv --python 3.12
VIRTUAL_ENV=$(pwd)/.venv uv pip install --index-url https://rocm.nightlies.amd.com/v2/gfx1151/ torch
VIRTUAL_ENV=$(pwd)/.venv uv pip install sentence-transformers lancedb pyarrow pyyaml tabulate

# 3. Set library path and verify
export LD_LIBRARY_PATH="$(pip show rocm-sdk-core | grep Location | cut -d' ' -f2)/_rocm_sdk_devel/lib:$LD_LIBRARY_PATH"
.venv/bin/python -c "import torch; print(torch.version.hip); print(torch.cuda.is_available())"

Or set UV_EXTRA_INDEX_URL in your shell profile for persistent configuration.

Note: If switching GPU backends, force reinstall torch:

rm -rf .venv && uv sync --extra-index-url <your-index-url>

Quick Start

1. Create Your Knowledge Base

# Create a new directory for your data
mkdir my-research && cd my-research

# Initialize a Reality Check project (creates structure + database)
rc-db init-project

# This creates:
#   .realitycheck.yaml    - Project config
#   data/realitycheck.lance/  - Database
#   analysis/sources/     - For analysis documents
#   tracking/             - For prediction tracking
#   inbox/                - For sources to process

2. Set Environment Variable

# Tell Reality Check where your database is
export REALITYCHECK_DATA="data/realitycheck.lance"

# Add to your shell profile for persistence:
echo 'export REALITYCHECK_DATA="data/realitycheck.lance"' >> ~/.bashrc

3. Add Your First Claim

rc-db claim add \
  --text "AI training costs double annually" \
  --type "[F]" \
  --domain "TECH" \
  --evidence-level "E2" \
  --credence 0.8

# Output: Created claim: TECH-2026-001

4. Add a Source

rc-db source add \
  --id "epoch-2024-training" \
  --title "Training Compute Trends" \
  --type "REPORT" \
  --author "Epoch AI" \
  --year 2024 \
  --url "https://epochai.org/blog/training-compute-trends"

5. Search and Explore

# Semantic search
rc-db search "AI costs"

# List all claims
rc-db claim list --format text

# Check database stats
rc-db stats

Using with Framework as Submodule

For easier access to scripts, add the framework as a git submodule:

cd my-research
git submodule add https://github.com/lhl/realitycheck.git .framework

# Now use shorter paths:
.framework/scripts/db.py claim list --format text
.framework/scripts/db.py search "AI"

CLI Reference

All commands should be run with REALITYCHECK_DATA set.

If REALITYCHECK_DATA is not set, commands will only run when a default database exists at ./data/realitycheck.lance/ (and will otherwise exit with a helpful error suggesting how to set REALITYCHECK_DATA or create a project via rc-db init-project). The Claude Code plugin can also auto-resolve project config via .realitycheck.yaml.

# Database management
rc-db init                              # Initialize database tables
rc-db init-project [--path DIR]         # Create new project structure
rc-db stats                             # Show statistics
rc-db reset                             # Reset database (destructive!)

# Claim operations
rc-db claim add --text "..." --type "[F]" --domain "TECH" --evidence-level "E3"
rc-db claim add --id "TECH-2026-001" --text "..." ...  # With explicit ID
rc-db claim get <id>                    # Get single claim (JSON)
rc-db claim list [--domain D] [--type T] [--format json|text]
rc-db claim update <id> --credence 0.9 [--notes "..."]
rc-db claim delete <id>                 # Delete a claim

# Source operations
rc-db source add --id "..." --title "..." --type "PAPER" --author "..." --year 2024
rc-db source get <id>
rc-db source list [--type T] [--status S]

# Chain operations (argument chains)
rc-db chain add --id "..." --name "..." --thesis "..." --claims "ID1,ID2,ID3"
rc-db chain get <id>
rc-db chain list

# Prediction operations
rc-db prediction add --claim-id "..." --source-id "..." --status "[P→]"
rc-db prediction list [--status S]

# Search and relationships
rc-db search "query" [--domain D] [--limit N]
rc-db related <claim-id>                # Find related claims

# Import/Export
rc-db import <file.yaml> --type claims|sources|all
rc-validate                             # Check database integrity
rc-export yaml claims -o claims.yaml    # Export to YAML

Claude Code Plugin

Claude Code is Anthropic's AI coding assistant. Reality Check includes a plugin that adds slash commands for analysis workflows.

Install the Plugin

# From the realitycheck repo directory:
make install-plugin-claude

Note: Local plugin discovery from ~/.claude/plugins/local/ is currently broken. Use the --plugin-dir flag:

# Start Claude Code with the plugin loaded:
claude --plugin-dir /path/to/realitycheck/integrations/claude/plugin

# Or create a shell alias:
alias claude-rc='claude --plugin-dir /path/to/realitycheck/integrations/claude/plugin'

Plugin Commands

Commands are prefixed with /reality::

Command	Description
`/reality:check <url>`	Flagship - Full analysis workflow (fetch → analyze → register → validate)
`/reality:synthesize <topic>`	Cross-source synthesis across multiple analyses
`/reality:analyze <source>`	Manual 3-stage analysis without auto-registration
`/reality:extract <source>`	Quick claim extraction
`/reality:search <query>`	Semantic search across claims
`/reality:validate`	Check database integrity
`/reality:export <format> <type>`	Export to YAML/Markdown
`/reality:stats`	Show database statistics

Alternative: Global Skills

If you prefer skills over plugins:

make install-skills-claude

This installs skills to ~/.claude/skills/ which are auto-activated based on context.

Example Session

> /reality:check https://arxiv.org/abs/2401.00001

Claude will:
1. Fetch the paper content
2. Run 3-stage analysis (descriptive → evaluative → dialectical)
3. Extract and classify claims
4. Register source and claims in your database
5. Validate data integrity
6. Report summary with claim IDs

See docs/PLUGIN.md for full documentation.

Codex Skills

Codex doesn’t support Claude-style plugins, but it does support “skills”.

Codex CLI reserves /... for built-in commands, so custom slash commands are not supported. Reality Check ships Codex skills you can invoke with $...:

$check ...
$realitycheck ... (including $realitycheck data <path> to target a DB for the current Codex session)

Embeddings are generated by default when registering sources/claims. Only set REALITYCHECK_EMBED_SKIP=1 (or use --no-embedding) when you explicitly want to defer embeddings.

Install:

make install-skills-codex

See integrations/codex/README.md for usage and examples.

Amp Skills

Amp is Sourcegraph's AI coding assistant. Reality Check includes skills that activate on natural language triggers.

Install Skills

make install-skills-amp

Usage

Skills activate automatically based on natural language:

"Analyze this article for claims: https://example.com/article"
"Search for claims about AI automation"
"Validate the database"
"Show database stats"

See integrations/amp/README.md for full documentation.

Taxonomy Reference

Claim Types

Type	Symbol	Definition
Fact	`[F]`	Empirically verified, consensus reality
Theory	`[T]`	Coherent framework with empirical support
Hypothesis	`[H]`	Testable proposition, awaiting evidence
Prediction	`[P]`	Future-oriented with specified conditions
Assumption	`[A]`	Underlying premise (stated or unstated)
Counterfactual	`[C]`	Alternative scenario for comparison
Speculation	`[S]`	Unfalsifiable or untestable claim
Contradiction	`[X]`	Identified logical inconsistency

Evidence Hierarchy

Level	Strength	Description
E1	Strong Empirical	Replicated studies, systematic reviews, meta-analyses
E2	Moderate Empirical	Single peer-reviewed study, official statistics
E3	Strong Theoretical	Expert consensus, working papers, preprints
E4	Weak Theoretical	Industry reports, credible journalism
E5	Opinion/Forecast	Personal observation, anecdote, expert opinion
E6	Unsupported	Pure speculation, unfalsifiable claims

Domain Codes

Domain	Code	Description
Technology	`TECH`	AI capabilities, tech trajectories
Labor	`LABOR`	Employment, automation, work
Economics	`ECON`	Value, pricing, distribution
Governance	`GOV`	Policy, regulation, institutions
Social	`SOC`	Social structures, culture, behavior
Resource	`RESOURCE`	Scarcity, abundance, allocation
Transition	`TRANS`	Transition dynamics, pathways
Geopolitics	`GEO`	International relations, competition
Institutional	`INST`	Organizations, coordination
Risk	`RISK`	Risk assessment, failure modes
Meta	`META`	Claims about the framework itself

Project Structure

realitycheck/                 # Framework repo (this)
├── scripts/                  # Python CLI tools
│   ├── db.py                 # Database operations + CLI
│   ├── validate.py           # Data integrity checks
│   ├── export.py             # YAML/Markdown export
│   ├── migrate.py            # Legacy YAML migration
│   ├── embed.py              # Embedding utilities (re-generate, status)
│   └── html_extract.py       # HTML → {title, published, text} extraction
├── plugin/                   # Claude Code plugin
│   ├── skills/               # Slash command skill definitions
│   └── scripts/              # Shell wrappers
├── integrations/             # Other tool integrations
│   └── codex/                # Codex skills + installers
├── methodology/              # Analysis templates
│   ├── evidence-hierarchy.md
│   ├── claim-taxonomy.md
│   └── templates/
├── tests/                    # pytest suite (137 tests)
└── docs/                     # Documentation

my-research/                  # Your data repo (separate)
├── .realitycheck.yaml        # Project config
├── data/realitycheck.lance/  # LanceDB database
├── analysis/sources/         # Analysis documents
├── tracking/                 # Prediction tracking
└── inbox/                    # Sources to process

Why a Unified Knowledge Base?

Reality Check recommends one knowledge base per user, not per topic:

Claims build on each other across domains (AI claims inform economics claims)
Shared evidence hierarchy enables consistent evaluation
Cross-domain synthesis becomes possible
Semantic search works across your entire knowledge base

Create separate databases only for: organizational boundaries, privacy requirements, or team collaboration.

Example Knowledge Base

See realitycheck-data for a public example knowledge base built with Reality Check, tracking claims across technology, economics, labor, and governance domains.

Embedding Model

Reality Check uses all-MiniLM-L6-v2 for semantic search embeddings. This model provides the best balance of performance and quality for CPU inference:

Model	Dim	Load Time	Throughput	Memory
all-MiniLM-L6-v2	384	2.9s	7.8 q/s	1.2 GB
all-mpnet-base-v2	768	3.0s	3.3 q/s	1.4 GB
granite-embedding-278m	768	6.0s	3.4 q/s	2.5 GB
stella_en_400M_v5	1024	4.4s	1.7 q/s	2.7 GB

The 384-dimension vectors are stored in LanceDB and used for similarity search across claims.

Note: Embeddings default to CPU to avoid GPU driver crashes. To use GPU:

export REALITYCHECK_EMBED_DEVICE="cuda"  # or "mps" for Apple Silicon

Development

# Run tests (skip slow embedding tests)
REALITYCHECK_EMBED_SKIP=1 uv run pytest -v

# Run all tests including embeddings
uv run pytest -v

# Run with coverage
uv run pytest --cov=scripts --cov-report=term-missing

See CLAUDE.md for development workflow and contribution guidelines.

Documentation

docs/PLUGIN.md - Claude Code plugin guide
docs/SCHEMA.md - Database schema reference
docs/WORKFLOWS.md - Common usage workflows
methodology/ - Analysis methodology and templates

License

Apache 2.0

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.3.4

May 3, 2026

0.3.3

Feb 20, 2026

0.3.2

Feb 19, 2026

0.3.1

Feb 11, 2026

0.3.0

Feb 1, 2026

0.2.1

Feb 1, 2026

0.2.0

Jan 30, 2026

0.1.9

Jan 27, 2026

0.1.8

Jan 26, 2026

0.1.7

Jan 26, 2026

0.1.6

Jan 25, 2026

0.1.5

Jan 25, 2026

This version

0.1.4

Jan 24, 2026

0.1.3

Jan 22, 2026

0.1.2

Jan 21, 2026

0.1.1

Jan 21, 2026

0.1.0

Jan 21, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

realitycheck-0.1.4.tar.gz (273.8 kB view details)

Uploaded Jan 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

realitycheck-0.1.4-py3-none-any.whl (74.4 kB view details)

Uploaded Jan 24, 2026 Python 3

File details

Details for the file realitycheck-0.1.4.tar.gz.

File metadata

Download URL: realitycheck-0.1.4.tar.gz
Upload date: Jan 24, 2026
Size: 273.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for realitycheck-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`2814c9d0d9ec97b6f8185da70d5126dd7d1fef9e2c867b184dc2e1cb6ee2a312`
MD5	`d445ce1e53e8c57548e0e984c4ef766f`
BLAKE2b-256	`fd18b97f142054d310463e0a5e396d8224c7e6054e5d5d589d4c6d2ea3d7acde`

See more details on using hashes here.

File details

Details for the file realitycheck-0.1.4-py3-none-any.whl.

File metadata

Download URL: realitycheck-0.1.4-py3-none-any.whl
Upload date: Jan 24, 2026
Size: 74.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for realitycheck-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c8e0004db1feee1ba93dbc154fad85f1a7737162a25de9466f7ce9b82ebaf196`
MD5	`4d36ee84c2d89d37f450e0a0c6714881`
BLAKE2b-256	`fe9e856ee058d37c1ceea4c47002cf05c029a46fea97cd06a516e518e7c47dc9`

See more details on using hashes here.

realitycheck 0.1.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Reality Check

Overview

Status

Prerequisites

Installation

From PyPI (Recommended)

From Source (Development)

GPU Support (Optional)

Quick Start

1. Create Your Knowledge Base

2. Set Environment Variable

3. Add Your First Claim

4. Add a Source

5. Search and Explore

Using with Framework as Submodule

CLI Reference

Claude Code Plugin

Install the Plugin

Plugin Commands

Alternative: Global Skills

Example Session

Codex Skills

Amp Skills

Install Skills

Usage

Taxonomy Reference

Claim Types

Evidence Hierarchy

Domain Codes

Project Structure

Why a Unified Knowledge Base?

Example Knowledge Base

Embedding Model

Development

Documentation

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes