Door: minimal adaptive control (act/update) with online surrogate and optional N(t) readout

Project description

Door

Door is a small adaptive control library: you implement plant.step(action) -> reward, then run an incremental loop with Door.act() and Door.update(reward). Under the hood it keeps an online surrogate (PyTorch MLP by default, ridge fallback if PyTorch is missing), scores candidate actions, applies a few gradient refinement steps on the best candidate (similar in spirit to a hybrid inner refine), and widens exploration when reward variability spikes.

door.run is a batch alternative: one rollout with volatility widening and an optional N(t)-based stability summary (StabilizeResult.show()).

Install (local tree)

cd team-bioquant
pip install -e .

On PyPI: pip install door-stabilizer (import package is still door). Optional PyTorch: pip install "door-stabilizer[torch]" or locally pip install -e ".[torch]".

Minimal example

import numpy as np
from door import Door

class Plant:
    def step(self, a):
        return float(-np.sum(np.asarray(a) ** 2))

p = Plant()
ctrl = Door(dim=2, action_low=-1.0, action_high=1.0, seed=0)
for _ in range(100):
    u = ctrl.act()
    r = p.step(u)
    ctrl.update(r)

Benchmark vs CEM-restart (drifting quadratic)

On a simple drifting quadratic toy (2D action, random-walk target, 200 steps, 48 seeds), Door (default ridge surrogate; no extra deps) vs a sequential CEM-restart baseline with the same number of plant.step calls:

Metric (48 seeds)	Door	CEM-restart
Mean cumulative reward	≈ −39.55	≈ −46.32
Paired mean (Door − CEM)	≈ +6.77	—
Mean reward, last 50 steps	≈ −0.0013	≈ −0.10

Head-to-head cumulative reward per seed: Door wins on ~44% of seeds but wins on average — CEM-restart has heavier tails (more catastrophic runs). That is a reasonable MVP story: better average and terminal performance, not “wins every seed.”

Reproduce:

python benchmark_door_vs_cem_restart.py --seeds 48

Optional PyTorch MLP + 2 gradient refine steps on the surrogate (closer to hybrid HAT spirit; slower):

python benchmark_door_vs_cem_restart.py --seeds 48 --torch --refine 2

Trade-off (how to message it)

Door is tuned for strong average performance and stability across seeds, not for winning every random seed against a noisy baseline. That is a reasonable MVP bar.

Research HAT

The full HAT stack (distilled transformer, JEPA world, challenge simulator) in this repo is separate; it can be bridged later as an optional checkpoint path. The HAT name is kept as an alias: from door import Door, HAT.

Project details

Release history Release notifications | RSS feed

This version

0.4.1

Apr 14, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

door_stabilizer-0.4.1.tar.gz (20.6 kB view details)

Uploaded Apr 14, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

door_stabilizer-0.4.1-py3-none-any.whl (17.5 kB view details)

Uploaded Apr 14, 2026 Python 3

File details

Details for the file door_stabilizer-0.4.1.tar.gz.

File metadata

Download URL: door_stabilizer-0.4.1.tar.gz
Upload date: Apr 14, 2026
Size: 20.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for door_stabilizer-0.4.1.tar.gz
Algorithm	Hash digest
SHA256	`ef2869b8aa47e77b1f4ac91317e4037d45e56786e0440d4ca9ead6a9b176162f`
MD5	`4a8b1b6b27cfbaebc5773d0e76a6cb36`
BLAKE2b-256	`9a67d93acd3c23c02244dff98e9036c163f5f122c1b4042a840b342847cc9e4e`

See more details on using hashes here.

File details

Details for the file door_stabilizer-0.4.1-py3-none-any.whl.

File metadata

Download URL: door_stabilizer-0.4.1-py3-none-any.whl
Upload date: Apr 14, 2026
Size: 17.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for door_stabilizer-0.4.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a6973aed1f27f69790ad55c01b20e56b5baf996ff1523a0f27e0522d167262c1`
MD5	`46f06e6f4afee22c96f6cfb0982dae81`
BLAKE2b-256	`020f0cd5f532e0e1a28f9626e972488626b633261c3106e74782f21836bd7b69`

See more details on using hashes here.

door-stabilizer 0.4.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Door

Install (local tree)

Minimal example

Benchmark vs CEM-restart (drifting quadratic)

Trade-off (how to message it)

Research HAT

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes