Python SDK + CLI for openinterp.org — Atlas search, SAE Traces, FabricationGuard hallucination detection, InterpScore. The operational layer for mechanistic interpretability.

These details have not been verified by PyPI

Project links

Project description

`openinterp`

Python SDK + CLI for openinterp.org

Search the feature Atlas, generate Traces from your own SAE, rank against the public InterpScore leaderboard.

Install

pip install openinterp              # lite: Atlas + CLI (no torch, ~2 MB total)
pip install "openinterp[full]"      # + torch/transformers/safetensors for trace generation

Requires Python ≥ 3.10.

Part of a 5-repo ecosystem

Repo	What's in it
`.github`	Org profile + shared CoC + SECURITY
`web`	Next.js site behind openinterp.org
`notebooks`	23 training + interpretability notebooks
`cli` (you are here)	`pip install openinterp` — Python SDK
`mechreward`	SAE features as dense RL reward

🚀 Quick start

Search the Atlas (offline, zero GPU)

$ openinterp atlas "overconfidence"

                    Atlas results: 'overconfidence'
┏━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━
┃ ID      ┃ Name                    ┃ Model             ┃ AUROC ┃ Description
┡━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━
│ f2503   │ overconfidence_pattern  │ Qwen/Qwen3.6-27B  │  0.54 │ Definitive…
│ f1847   │ urgency_assessment      │ Qwen/Qwen3.6-27B  │  0.68 │ Time-critic…
└─────────┴─────────────────────────┴───────────────────┴───────┴────────────

>>> from openinterp import search_features
>>> features = search_features("overconfidence", model="Qwen/Qwen3.6-27B")
>>> features[0].id
'f2503'

Generate a Trace from your own SAE

pip install "openinterp[full]"

openinterp trace \
    --model google/gemma-2-2b \
    --sae-repo YOUR_HF_USER/gemma2-2b-sae-first \
    --prompt "The capital of France is" \
    --layer 12 \
    --d-model 2304 --d-sae 16384 --k 64 \
    --out my_trace.json

This:

Loads the base model in bf16 with SDPA (no flash-attn)
Loads your SAE from HuggingFace (sae_lens safetensors format)
Generates tokens, captures residuals at layer 12
Applies the SAE, picks top-10 active features
Writes a Trace JSON matching openinterp.org/observatory/trace byte-for-byte

Python API

from openinterp import generate_trace

trace = generate_trace(
    model_id="google/gemma-2-2b",
    sae_repo="YOUR_HF_USER/gemma2-2b-sae-first",
    prompt="The capital of France is",
    layer=12,
    d_model=2304,
    d_sae=16384,
    k=64,
)

print(trace.model_dump_json(indent=2))   # Trace Theater schema

With feature labels from notebook 04

# After running 04_discover_features.ipynb (emits feature_catalog.json):
openinterp trace ... --catalog feature_catalog.json

Trace features inherit names from your catalog.

📦 What's in v0.1.0

Command	Status	What it does
`openinterp atlas <query>`	✅ live	Feature search with offline fallback to curated demo features
`openinterp trace ...`	✅ live (needs `[full]`)	Real SAE trace generation, sae_lens format, any HF model
`openinterp info`	✅ live	Version + optional-dep status

Planned v0.2.0 (Q2 2026)

openinterp upload-trace <trace.json> → shareable openinterp.org URL
openinterp score --sae-repo X → compute InterpScore (wraps notebook 18)
openinterp steer --sae-repo X --feature Y --alpha Z → intervention (wraps notebook 06)
openinterp circuit --sae-repo X --prompt Y → attribution graph JSON (wraps notebook 14/15)
openinterp publish <repo> → HuggingFace release with model card

Open an issue on the tracker if you'd take one of these.

🛠️ Development

git clone https://github.com/OpenInterpretability/cli openinterp-cli
cd openinterp-cli
python -m venv .venv && source .venv/bin/activate
pip install -e ".[dev,full]"          # dev = pytest + ruff + build; full = torch + transformers
pytest -xvs                            # 5 tests, ~1s

Package layout

openinterp-cli/
├── pyproject.toml              # name='openinterp', hatchling build
├── openinterp/
│   ├── __init__.py             # public exports + __version__
│   ├── models.py               # pydantic types: AtlasFeature, Trace, TraceFeature
│   ├── atlas.py                # search_features() — HF API + curated fallback
│   ├── trace.py                # generate_trace() — real transformers-based impl
│   └── cli.py                  # click-based CLI: atlas / trace / info
├── tests/
│   ├── test_atlas.py
│   └── test_trace.py
├── CHANGELOG.md
├── CONTRIBUTING.md
└── README.md

Contribution recipe — add a new command

Full rules: CONTRIBUTING.md.

Decide which notebook it wraps (score → 18, steer → 06, circuit → 14/15, publish → generic)
Add a function to the matching file (openinterp/score.py, etc.). Keep it small — actual compute lives in the notebook.
Expose it in __init__.py
Add a @main.command() in cli.py with click decorators
Add a smoke test in tests/test_<name>.py
Update CHANGELOG.md under [Unreleased]
PR title: Add openinterp <command>

Hard rules:

Python ≥ 3.10 syntax (PEP 604 unions OK)
dtype=torch.bfloat16, never torch_dtype= (transformers 5.x deprecated)
SDPA only, never flash-attn
New heavy deps (torch tier) → add to [full] extra, not base
Every new public function has type hints + docstring

🚢 Release process (maintainer)

# 1. Bump version in BOTH:
#    pyproject.toml          ([project] version = "X.Y.Z")
#    openinterp/__init__.py  (__version__ = "X.Y.Z")
# 2. Update CHANGELOG.md — move [Unreleased] → [X.Y.Z] — YYYY-MM-DD

source .venv/bin/activate
rm -rf dist/
python -m build
python -m twine check dist/*
python -m twine upload dist/*     # needs PyPI token in ~/.pypirc

git tag vX.Y.Z
git push --tags

CI

Every PR runs:

pytest -xvs across Python 3.10, 3.11, 3.12 (see .github/workflows/ci.yml)
ruff check . (warn-only for now)
python -m build + twine check

Green required to merge.

Community

💬 Discussions — API proposals, "which repo should this live in"
🟢 Good-first-issues
📦 PyPI release history
✉️ hi@openinterp.org

Standing on the shoulders of

Neuronpedia · the SAE encyclopedia
Gemma Scope · reference SAE suite
Gao et al. 2024 · TopK + AuxK recipe
SAELens · our safetensors format

License

Apache-2.0. Built by Caio Vicentino + OpenInterpretability. 2026.

openinterp.org · github.com/OpenInterpretability · hi@openinterp.org

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.2

May 2, 2026

0.2.1

May 1, 2026

This version

0.2.0

Apr 27, 2026

0.1.0

Apr 24, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

openinterp-0.2.0.tar.gz (22.6 kB view details)

Uploaded Apr 27, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

openinterp-0.2.0-py3-none-any.whl (23.0 kB view details)

Uploaded Apr 27, 2026 Python 3

File details

Details for the file openinterp-0.2.0.tar.gz.

File metadata

Download URL: openinterp-0.2.0.tar.gz
Upload date: Apr 27, 2026
Size: 22.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for openinterp-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`7233f477d5f698f09a7c032ca0f962e53ed66a7062b054ed2fabd3391aeb6c8d`
MD5	`9c16243a626bdf0918e8e3889bfb81fe`
BLAKE2b-256	`c2896de102d89b414506d2693cc6335143600707cc3e775dacb047be9a5f30e9`

See more details on using hashes here.

File details

Details for the file openinterp-0.2.0-py3-none-any.whl.

File metadata

Download URL: openinterp-0.2.0-py3-none-any.whl
Upload date: Apr 27, 2026
Size: 23.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for openinterp-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d5aa20bac743d07774fa3c8db7b62c56049f2d8b4b89c6c5fd1173003016c2d8`
MD5	`7be3fc63c6c21f71c86dc6ad33f0f9aa`
BLAKE2b-256	`cf91799f50530330a40f80968b9762fde8ce5dff4cbc6c88a7e8cd3b274ef0a8`

See more details on using hashes here.

openinterp 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

openinterp

Python SDK + CLI for openinterp.org

Install

Part of a 5-repo ecosystem

🚀 Quick start

Search the Atlas (offline, zero GPU)

Generate a Trace from your own SAE

Python API

With feature labels from notebook 04

📦 What's in v0.1.0

Planned v0.2.0 (Q2 2026)

🛠️ Development

Package layout

Contribution recipe — add a new command

🚢 Release process (maintainer)

CI

Community

Standing on the shoulders of

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`openinterp`