model-garage

Open the hood on neural networks. Component-level model surgery, analysis, and composition.

These details have not been verified by PyPI

Project links

Project description

Model Garage

Open the hood on neural networks.

Models Tests

Component-level model surgery, analysis, and composition. Extract attention heads, swap FFN layers between models, inject capability blades, and build hybrid architectures — all from a beautiful CLI or Python API.

People used to be "Gear Heads" — they could fine-tune actual car engines manually. Model Garage brings that same hands-on philosophy to neural networks. Open the hood, understand the parts, swap what you need.

Quick Start

pip install model-garage

# Inspect a model's architecture
garage open gpt2

# Extract attention from layer 6
garage extract gpt2 --layer 6 --component self_attention

# Compare two models for compatible parts
garage compare gpt2 distilgpt2

# Analyze activations across layers
garage analyze gpt2 --prompt "The meaning of life is"

See CLI output examples

garage open

garage extract

garage analyze

garage compare

Or use the Python API directly:

from model_garage import ModelLoader, ModelRegistry

# Load and decompose
loader = ModelLoader()
model, tokenizer, info = loader.load("gpt2")

registry = ModelRegistry()
spec = registry.register("gpt2", model)

# See all parts
for name, part in spec.parts.items():
    print(f"{name}: {part.part_type.value} [{part.input_dim}→{part.output_dim}]")

What Can You Do?

Extract

Pull real nn.Module components from any supported transformer:

from model_garage.extract.pytorch import PyTorchExtractor

extractor = PyTorchExtractor("microsoft/phi-2")
extractor.load_model()

# Extract attention from layer 12
attn = extractor.extract_component("self_attention", layer_idx=12)

# It's a real module — run it, analyze it, transplant it
output = attn.module(hidden_states)

Inject

Insert custom processing between any two layers without modifying the model:

from model_garage.inject.layer import LayerInjector

with LayerInjector(model) as injector:
    # Scale activations (reduce by 10%)
    injector.inject_scaling("model.layers.12", scale=0.9)

    # Or inject a custom module
    injector.inject_custom_layer("model.layers.12", my_adapter)

    # Model now uses your injection during forward pass
    output = model(input_ids)
# Injections automatically cleaned up

Analyze

Capture and inspect hidden states at any layer:

from model_garage.core.hooks import HookManager
from model_garage.core.tensor import TensorUtils

with HookManager(model) as hooks:
    hooks.register_capture_hook("model.layers.15", hook_name="layer_15")
    model(input_ids)

    data = hooks.get_captured("layer_15")
    stats = TensorUtils.stats(data["output"])
    print(f"Mean: {stats['mean']:.4f}, Sparsity: {stats['sparsity']:.2%}")

Compose

from model_garage.registry.models import ModelRegistry

registry = ModelRegistry()
registry.register("gpt2", model_a)
registry.register("distilgpt2", model_b)

# Find what's swappable
comparison = registry.compare("gpt2", "distilgpt2")
print(comparison["compatible_parts"])

Debate

Add multi-perspective reasoning to any model via self-debate chambers:

from model_garage.inject.debate import SelfDebate

# Two perspectives reconciled through learned gating
with SelfDebate(model, layer_idx=6, reconciliation_method="gated"):
    output = model.generate(input_ids, max_new_tokens=50)

Supported Model Families

Family	Models	Status
GPT-2	gpt2, gpt2-medium, gpt2-large, gpt2-xl, distilgpt2	Full support
Llama	Llama-2-7b, Llama-3-8b, TinyLlama, CodeLlama	Full support
Phi	Phi-2, Phi-3-mini, Phi-3.5, Phi-4	Full support
Mistral	Mistral-7B, Mixtral-8x7B	Full support
Gemma	Gemma-2b, Gemma-7b, Gemma-2-9b	Full support
Qwen	Qwen-1.5, Qwen-2	Full support
BERT	bert-base, bert-large, distilbert	Extraction only

Adding a new family? See CONTRIBUTING.md.

Research Papers

Model Garage is backed by three peer-reviewed research papers demonstrating its capabilities on real problems:

Blades: Compositional Capability Enhancement Through Hidden State Injection

Hidden state injection between specialized models achieves +14.2% accuracy on medical reasoning tasks. Establishes 7 validated principles for capability transfer including the N-4 layer rule and same-domain synergy (+27.8%). Uses Model Garage's extract, inject, and snapshot modules.

Learned Routers Don't Learn: Expert Miscalibration in MoE Models

Per-layer expert isolation reveals that learned MoE routers show Spearman rho ~ 0 between routing probability and expert quality. 207/896 expert-layer-domain combinations show statistically significant specialization (BH FDR, alpha=0.05), yet the router ignores it. Uses Model Garage's analyze and core.hooks modules.

Sparse Pathways: Domain-Aware Neuron Routing for Efficient Inference

FFN neurons exhibit strong domain specialization (~50% in late layers), with r=0.999 correlation between model scale and specialization degree. Demonstrates 2-4x potential compute reduction via negative neuron selection. Uses Model Garage's analyze and snapshot modules.

See research/ for the full findings, blade principles, and experiment index.

CLI Commands

Command	Description
`garage open <model>`	Load and display model architecture card
`garage extract <model>`	Extract components (attention, FFN, embeddings)
`garage analyze <model>`	Analyze activations, entropy, sparsity
`garage compare <a> <b>`	Compare architectures, find compatible parts
`garage registry list`	List decomposed models in local registry
`garage registry add <model>`	Decompose and register a model

Project Structure

model-garage/
├── src/model_garage/
│   ├── cli/          # Retro-themed CLI (Typer + Rich)
│   ├── core/         # Model loading, hooks, tensor utils
│   ├── extract/      # Component extraction engine
│   ├── inject/       # Layer injection, blades, debate
│   ├── analyze/      # Activation analysis
│   ├── compose/      # Hybrid model builder
│   ├── registry/     # Component registry & decomposition
│   ├── snapshot/     # Hidden state capture
│   └── models/       # Model family adapters
├── examples/         # Runnable example scripts
├── research/         # Validated findings & principles
├── tests/            # Unit & integration tests
└── docs/             # Documentation

Philosophy

"People used to be 'Gear Heads' — they could fine-tune actual car engines manually. They understood every part, could diagnose problems by sound, and could build a hot rod from junkyard parts.

Modern AI models are the same kind of machine. They have parts — attention heads, feed-forward networks, embeddings, normalization layers. These parts can be extracted, tested, swapped, and recombined.

Model Garage makes you a neural network gear head."

Contributing

We especially welcome contributions that add support for new model families. See CONTRIBUTING.md for the guide on writing a new ModelDecomposer.

License

Apache License 2.0. See LICENSE.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Mar 24, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

model_garage-0.1.0.tar.gz (53.9 kB view details)

Uploaded Mar 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

model_garage-0.1.0-py3-none-any.whl (47.7 kB view details)

Uploaded Mar 24, 2026 Python 3

File details

Details for the file model_garage-0.1.0.tar.gz.

File metadata

Download URL: model_garage-0.1.0.tar.gz
Upload date: Mar 24, 2026
Size: 53.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for model_garage-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`22a05268f64c30fadfdb97ec242d5d3effc5514e72d216e6d13aa26e4da49867`
MD5	`652aff1752e08709d7dd6c79bc294f18`
BLAKE2b-256	`4122f8542cec14b905d474a52cad3cb746798fff837553ac03a916f451cf747b`

See more details on using hashes here.

File details

Details for the file model_garage-0.1.0-py3-none-any.whl.

File metadata

Download URL: model_garage-0.1.0-py3-none-any.whl
Upload date: Mar 24, 2026
Size: 47.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for model_garage-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`54d091b4c560df6e807f164cb3618967ce8fc41632566f7263e272742cec18c1`
MD5	`6871bc802aa6475fef7f9924b9deaf8b`
BLAKE2b-256	`73b1f920f191a437e8ad7a5094f69d397538804725a1f874a7965e86a4923b6d`

See more details on using hashes here.

model-garage 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Quick Start

What Can You Do?

Extract

Inject

Analyze

Compose

Debate

Supported Model Families

Research Papers

Blades: Compositional Capability Enhancement Through Hidden State Injection

Learned Routers Don't Learn: Expert Miscalibration in MoE Models

Sparse Pathways: Domain-Aware Neuron Routing for Efficient Inference

CLI Commands

Project Structure

Philosophy

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes