vLLM plugin for interacting with activations during inference

These details have not been verified by PyPI

Development Status
- 3 - Alpha
Intended Audience
- Science/Research
Topic
- Scientific/Engineering :: Artificial Intelligence

Project description

vLLM-lens

vLLM-Lens enables top-down interpretability (e.g., probes, steering, activation oracles). It offers high performance, supporting tensor parallelism & pipeline parallelism (across GPUs and nodes) out of the box. You can also apply all these techniques concurrently (in the same dynamic batch) - removing the need to switch between model instances.

Note this performance comes at the expense of flexibility - for example, you would need to edit the source to add additional custom hooks (though it should be easy enough for coding agents to do that). For more flexibility out of the box, consider nnsight or TransformerLens.

The module auto-registers as a vLLM general plugin and an Inspect model provider on install. Interact with model internals per-call via SamplingParams.extra_args (vLLM) or GenerateConfig.extra_body (Inspect).

Install

uv add vllm-lens

Examples

These examples use the Inspect integration. See the examples/ folder for offline and online direct vLLM usage.

Inspect AI provider

An Inspect AI model provider is auto-registered as vllm-lens, when you install this package. This model provider extends the built-in vLLM provider to serialize torch.Tensor steering vectors for HTTP transport and decode base64-encoded activations from responses into ModelOutput.metadata["activations"]. It also supports LoRA adaptors.

Usage is the same as the default vLLM provider but with the vllm-lens prefix (e.g. vllm-lens/meta-llama/Llama-3.1-1B).

Extracting activations

capture_config = GenerateConfig(
    temperature=0.0,
    max_tokens=1,
    extra_body={
        "extra_args": {"output_residual_stream": extraction_activation_layers},
        "chat_template_kwargs": {"enable_thinking": False},
    },
)
output = await model.generate(state.messages, config=capture_config)
residual_stream = output.metadata["activations"]["residual_stream"]

Steering with an Activation Oracle

from vllm_lens import SteeringVector

messages = [ChatMessageUser(content=oracle_content)]
oracle_config = GenerateConfig(
    temperature=0.0,
    max_tokens=50,
    extra_body={
        "extra_args": {
            "apply_steering_vectors": [
                SteeringVector(
                    activations=act_vector,
                    layer_indices=[injection_layer],
                    scale=steering_coefficient,
                    norm_match=True,
                    position_indices=[special_pos],
                )
            ],
        },
        "lora_request": {
            "lora_name": "oracle",
            "lora_int_id": 1,
            "lora_path": lora_path,
        },
        "chat_template_kwargs": {"enable_thinking": False},
    },
)
response = await model.generate(messages, config=oracle_config)

Theory

vllm-lens registers as a vLLM plugin and injects itself into vLLM's processing pipeline at broadly 3 stages:

Intercepting generate calls. To utilise the plugin, you can pass extra args such as output_residual_stream or apply_steering_vectors in the sampling parameters. The plugin extracts these, initialises relevant PyTorch hooks if they're not already setup (by adding a worker extension) and sends steering vectors directly to workers (vLLM typically has one worker per GPU).
Per-sample hook operations. vLLM dynamically batches tokens from multiple concurrent requests into a single forward pass, so a core challenge is "book-keeping" - working out which operations (e.g., activation extraction) should be applied to which parts of the request. To do this we read the forward_context metadata, utilising the query_start_loc (a tensor of token boundaries per request) and req_ids (mapping batch index to request ID). We then, for example, apply hooks to just the slices that correspond to the request. Any extracted activations are moved to CPU ram and compressed (lossless), ready to be requested by the vLLM scheduler process. Steering runs on all tensor-parallel ranks (since it modifies the forward pass), but capture only runs on TP rank 0 (residual streams are identical across TP replicas after all-reduce).
Response collation. The plugin intercepts the response before it is sent to the client, at which point it queries the relevant vLLM processes for any requested activations. If trims surplus activations, as vLLM does under the hook with tokens (the scheduler often gets ahead of the number of tokens it needs to generate, before stopping). Activations are then returned to the client.

Credits

Developed by Alan Cooney, with credit going to Sid Black for the original vLLM worker extension idea.

Project details

These details have not been verified by PyPI

Development Status
- 3 - Alpha
Intended Audience
- Science/Research
Topic
- Scientific/Engineering :: Artificial Intelligence

Release history Release notifications | RSS feed

This version

1.1.0

Apr 14, 2026

1.0.0

Mar 13, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vllm_lens-1.1.0.tar.gz (620.2 kB view details)

Uploaded Apr 14, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vllm_lens-1.1.0-py3-none-any.whl (298.7 kB view details)

Uploaded Apr 14, 2026 Python 3

File details

Details for the file vllm_lens-1.1.0.tar.gz.

File metadata

Download URL: vllm_lens-1.1.0.tar.gz
Upload date: Apr 14, 2026
Size: 620.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vllm_lens-1.1.0.tar.gz
Algorithm	Hash digest
SHA256	`2f24275afebca0b9b780077e99cd009aaa9ec581ecfa432e87b850a9d8624374`
MD5	`00d206c77b6c09ce34b0d5007dadd04f`
BLAKE2b-256	`f6e56d4f9b74634215955b3930aabf9ed6a13bcf20a886bf378bc4ed2d4232a7`

See more details on using hashes here.

Provenance

The following attestation bundles were made for vllm_lens-1.1.0.tar.gz:

Publisher: release.yaml on UKGovernmentBEIS/vllm-lens

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: vllm_lens-1.1.0.tar.gz
- Subject digest: 2f24275afebca0b9b780077e99cd009aaa9ec581ecfa432e87b850a9d8624374
- Sigstore transparency entry: 1293793037
- Sigstore integration time: Apr 14, 2026
Source repository:
- Permalink: UKGovernmentBEIS/vllm-lens@c6b3ab29ad16742b7c2a49b211f17fb93cd61f1c
- Branch / Tag: refs/tags/v1.1.0
- Owner: https://github.com/UKGovernmentBEIS
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yaml@c6b3ab29ad16742b7c2a49b211f17fb93cd61f1c
- Trigger Event: release

File details

Details for the file vllm_lens-1.1.0-py3-none-any.whl.

File metadata

Download URL: vllm_lens-1.1.0-py3-none-any.whl
Upload date: Apr 14, 2026
Size: 298.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vllm_lens-1.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`30ebc33357a77bbceec1a3943ceb1d8d1e2522bcbbbbb44895465294d53ff40f`
MD5	`9d72160753256a6d317d5940f264d92b`
BLAKE2b-256	`9d7447bb29e5398e06c9fadda0b7080defea40d4b6b35feff38814d1dd6e0a8b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for vllm_lens-1.1.0-py3-none-any.whl:

Publisher: release.yaml on UKGovernmentBEIS/vllm-lens

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: vllm_lens-1.1.0-py3-none-any.whl
- Subject digest: 30ebc33357a77bbceec1a3943ceb1d8d1e2522bcbbbbb44895465294d53ff40f
- Sigstore transparency entry: 1293793099
- Sigstore integration time: Apr 14, 2026
Source repository:
- Permalink: UKGovernmentBEIS/vllm-lens@c6b3ab29ad16742b7c2a49b211f17fb93cd61f1c
- Branch / Tag: refs/tags/v1.1.0
- Owner: https://github.com/UKGovernmentBEIS
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yaml@c6b3ab29ad16742b7c2a49b211f17fb93cd61f1c
- Trigger Event: release

vllm-lens 1.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

vLLM-lens

Install

Examples

Inspect AI provider

Extracting activations

Steering with an Activation Oracle

Theory

Credits

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance