A package of useful functions to analyze transformer based language models.

These details have not been verified by PyPI

Project links

Project description

minicons: Enabling Flexible Behavioral and Representational Analyses of Transformer Language Models

This repo is a wrapper around the transformers library from Hugging Face :hugs:

Installation

Install from Pypi using:

pip install minicons

Supported Functionality

Extract word representations from Contextualized Word Embeddings
Score sequences using language model scoring techniques, including masked language models following Salazar et al. (2020), and state space models (such as Mamba).
Score sequences using VLM models (see below)
Do scoring in a quantized, multi-gpu setting.

Examples

Extract word representations from contextualized word embeddings:

from minicons import cwe

model = cwe.CWE('bert-base-uncased')

context_words = [("I went to the bank to withdraw money.", "bank"), 
                 ("i was at the bank of the river ganga!", "bank")]

print(model.extract_representation(context_words, layer = 12))

''' 
tensor([[ 0.5399, -0.2461, -0.0968,  ..., -0.4670, -0.5312, -0.0549],
        [-0.8258, -0.4308,  0.2744,  ..., -0.5987, -0.6984,  0.2087]],
       grad_fn=<MeanBackward1>)
'''

# if model is seq2seq:
model = cwe.EncDecCWE('t5-small')

print(model.extract_representation(context_words))

'''(last layer, by default)
tensor([[-0.0895,  0.0758,  0.0753,  ...,  0.0130, -0.1093, -0.2354],
        [-0.0695,  0.1142,  0.0803,  ...,  0.0807, -0.1139, -0.2888]])
'''

Compute sentence acceptability measures (surprisals) using Language Models:

from minicons import scorer

mlm_model = scorer.MaskedLMScorer('bert-base-uncased', 'cpu')
ilm_model = scorer.IncrementalLMScorer('distilgpt2', 'cpu')

stimuli = ["The keys to the cabinet are on the table.",
           "The keys to the cabinet is on the table."]

# use sequence_score with different reduction options: 
# Sequence Surprisal - lambda x: -x.sum(0).item()
# Sequence Log-probability - lambda x: x.sum(0).item()
# Sequence Surprisal, normalized by number of tokens - lambda x: -x.mean(0).item()
# Sequence Log-probability, normalized by number of tokens - lambda x: x.mean(0).item()
# and so on...

print(ilm_model.sequence_score(stimuli, reduction = lambda x: -x.sum(0).item()))

'''
[39.879737854003906, 42.75846481323242]
'''

# MLM scoring, inspired by Salazar et al., 2020
print(mlm_model.sequence_score(stimuli, reduction = lambda x: -x.sum(0).item()))
'''
[13.962685585021973, 23.415111541748047]
'''

Computing conditional sequence scoring using LMs

s2s_model = scorer.Seq2SeqScorer('t5-base', 'cpu')

# sequence scoring for batch of input, output, by default = logprobs, can change to other quantities as needed (see minicons readme)
s2s_model.conditional_score(["What is the capital of France?", "What is the capital of France?"], ["Paris.", "Lyon."]) # the same thing works with ilm_model and mlm_model as well

'''OUTPUT:
[-6.089522838592529, -8.20227336883545]
''' 

# Token-wise score of the output queries: -- <pad> token is given a score of 0.0, pass rank=True to also give token ranks
s2s_model.conditional_token_score(["What is the capital of France?", "What is the capital of France?"], ["Paris.", "Lyon."], rank=True) 

'''OUTPUT:
[[('<pad>', 0.0, 0),
  ('Paris', -7.5618486404418945, 168),
  ('.', -4.617197036743164, 11)],
 [('<pad>', 0.0, 0),
  ('Lyon', -12.044157981872559, 3459),
  ('.', -4.36038875579834, 8)]]
'''

A better version of MLM Scoring by Kauf and Ivanova

This version leverages a locally-autoregressive scoring strategy to avoid the overestimation of probabilities of tokens in multi-token words (e.g., "ostrich" -> "ostr" + "#ich"). In particular, tokens probabilities are estimated using the bidirectional context, excluding any future tokens that belong to the same word as the current target token.

For more details, refer to Kauf and Ivanova, 2023

from minicons import scorer
mlm_model = scorer.MaskedLMScorer('bert-base-uncased', 'cpu')

stimuli = ['The traveler lost the souvenir.']

# un-normalized sequence score
print(mlm_model.sequence_score(stimuli, reduction = lambda x: -x.sum(0).item(), PLL_metric='within_word_l2r'))
'''
[32.77983617782593]
'''

# original metric, for comparison:
print(mlm_model.sequence_score(stimuli, reduction = lambda x: -x.sum(0).item(), PLL_metric='original'))
'''
[18.014726161956787]
'''

print(mlm_model.token_score(stimuli, PLL_metric='within_word_l2r'))
'''
[[('the', -0.07324600219726562), ('traveler', -9.668401718139648), ('lost', -6.955361366271973),
('the', -1.1923179626464844), ('so', -7.776356220245361), ('##uven', -6.989711761474609),
('##ir', -0.037807464599609375), ('.', -0.08663368225097656)]]
'''

# original values, for comparison (notice the 'souvenir' tokens):

print(mlm_model.token_score(stimuli, PLL_metric='original'))
'''
[[('the', -0.07324600219726562), ('traveler', -9.668402671813965), ('lost', -6.955359935760498), ('the', -1.192317008972168), ('so', -3.0517578125e-05), ('##uven', -0.0009250640869140625), ('##ir', -0.03780937194824219), ('.', -0.08663558959960938)]]
'''

NEW: Vision-Language Model (VLM) Scoring

Minicons now supports VLM scoring! The following code demonstrates how one can extract log-probs of caption/descriptions from Salesforce's BLIP-2 model, conditioned on a batch of images:

from minicons import scorer
from PIL import Image

# top image
penguin = Image.open('penguin.jpg')

# bottom image
cardinal = Image.open('cardinal.jpg')

lm = scorer.VLMScorer(
  "Salesforce/blip2-opt-2.7b", 
  device="cuda:0"
)

lm.sequence_score(
  text_batch=["This bird can fly."] * 2, 
  image_batch=[penguin, cardinal]
)

#> logprobs of penguin vs cardinal -> can fly
#> [-5.644123077392578, -5.129026889801025]

OpenAI API

[!CAUTION] THIS IS NOW DEPRECATED BECAUSE OPEN-AI NO LONGER MAKES INPUT LOGPROBS AVAILABLE!**

Some models on the OpenAI API also allow for querying of log-probs (for now), and minicons now (as of Sept 29) also supports it! Here's how:

First, make sure you save your OpenAI API Key in some file (say ~/.openaikey). Register the key using:

from minicons import openai as mo

PATH = "/path/to/apikey"
mo.register_api_key(PATH)

Then,

from minicons import openai as mo

stimuli = ["the keys to the cabinet are", "the keys to the cabinet is"]

# we want to test if p(are | prefix) > p(is | prefix)
model = "gpt-3.5-turbo-instruct"
query = mo.OpenAIQuery(model, stimuli)

# run query using the above batch
query.query()

# get conditional log-probs for are and is given prior context:
query.conditional_score(["are", "is"])

#> [-2.5472614765167236, -5.633198261260986] SUCCESS!

# NOTE: this will not be 100% reproducible since it seems OpenAI adds a little noise to its outputs.
# see https://twitter.com/xuanalogue/status/1653280462935146496

Tutorials

Recent Updates

November 6, 2021: MLM scoring has been fixed! You can now use model.token_score() and model.sequence_score() with MaskedLMScorers as well!
June 4, 2022: Added support for Seq2seq models. Thanks to Aaron Mueller 🥳
June 13, 2023: Added support for within_word_l2r, a better way to do MLM scoring, thanks to Carina Kauf (https://github.com/carina-kauf) 🥳
January, 2024: minicons now supports mamba!

Citation

If you use minicons, please cite the following paper:

@article{misra2022minicons,
    title={minicons: Enabling Flexible Behavioral and Representational Analyses of Transformer Language Models},
    author={Kanishka Misra},
    journal={arXiv preprint arXiv:2203.13112},
    year={2022}
}

If you use Kauf and Ivanova's PLL scoring technique, please additionally also cite the following paper:

@inproceedings{kauf2023better,
  title={A Better Way to Do Masked Language Model Scoring},
  author={Kauf, Carina and Ivanova, Anna},
  booktitle={Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
  year={2023}
}

Famous users of minicons:

A non-exhaustive but fun list of ppl:

Adele Goldberg
Chris Potts
Najoung Kim
Forrest Davis
Marten van Schijndel
Valentina Pyatkin
Aaron Mueller
Sanghee Kim
Venkata Govindarajan
Kyle Mahowald
Carina Kauf

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.3.37

May 8, 2026

0.3.36

Feb 26, 2026

0.3.35

Jan 30, 2026

0.3.34

Dec 22, 2025

0.3.33

Dec 22, 2025

0.3.32

Jun 26, 2025

0.3.31

May 3, 2025

0.3.30

Apr 19, 2025

0.3.29

Apr 19, 2025

0.3.28

Apr 15, 2025

0.3.27

Apr 15, 2025

0.3.26

Apr 14, 2025

0.3.25

Apr 8, 2025

0.3.24

Apr 2, 2025

0.3.23

Apr 2, 2025

0.3.22

Mar 27, 2025

0.3.21

Mar 16, 2025

0.3.20

Mar 16, 2025

0.3.19

Mar 14, 2025

0.3.18

Mar 12, 2025

0.3.17

Mar 12, 2025

0.3.16

Mar 12, 2025

0.3.15

Mar 12, 2025

0.3.14

Feb 20, 2025

0.3.13

Feb 1, 2025

0.3.12

Feb 1, 2025

0.3.11

Jan 23, 2025

0.3.10

Jan 23, 2025

0.3.9

Jan 7, 2025

0.3.8

Nov 30, 2024

0.3.7

Nov 26, 2024

0.3.6

Nov 25, 2024

0.3.5

Nov 25, 2024

0.3.4

Nov 25, 2024

0.3.3

Nov 25, 2024

0.3.2

Nov 25, 2024

0.3.1

Nov 25, 2024

0.3.0

Nov 25, 2024

0.2.50

Oct 6, 2024

0.2.49

Sep 6, 2024

0.2.48

Aug 29, 2024

0.2.47

Aug 12, 2024

0.2.46

Aug 12, 2024

0.2.45

Jul 10, 2024

0.2.44

Apr 24, 2024

0.2.43

Apr 24, 2024

0.2.42

Apr 3, 2024

0.2.41

Mar 21, 2024

0.2.40

Mar 21, 2024

0.2.39

Mar 21, 2024

0.2.38

Feb 25, 2024

0.2.37

Feb 21, 2024

0.2.36

Feb 21, 2024

0.2.35

Feb 21, 2024

0.2.34

Feb 20, 2024

0.2.33

Jan 26, 2024

0.2.32

Jan 26, 2024

0.2.31

Jan 26, 2024

0.2.30

Jan 8, 2024

0.2.29

Jan 8, 2024

0.2.27

Dec 9, 2023

0.2.26

Nov 17, 2023

0.2.25

Nov 15, 2023

0.2.24

Nov 15, 2023

0.2.23

Nov 14, 2023

0.2.22

Nov 12, 2023

0.2.21

Nov 8, 2023

0.2.20

Oct 30, 2023

0.2.19

Sep 30, 2023

0.2.18

Jul 25, 2023

0.2.17

Jun 13, 2023

0.2.16

Jun 1, 2023

0.2.15

Jun 1, 2023

0.2.14

Mar 26, 2023

0.2.13

Mar 26, 2023

0.2.12

Mar 26, 2023

0.2.11

Mar 26, 2023

0.2.10

Mar 21, 2023

0.2.9

Oct 19, 2022

0.2.8

Oct 19, 2022

0.2.7

Oct 19, 2022

0.2.5

Jul 1, 2022

0.2.4

Jun 5, 2022

0.2.3

Jan 20, 2022

0.2.2

Jan 20, 2022

0.2.1

Jan 20, 2022

0.2.0

Jan 16, 2022

0.1.19

Dec 15, 2021

0.1.18

Dec 5, 2021

0.1.17

Dec 5, 2021

0.1.16

Nov 6, 2021

0.1.15

Nov 5, 2021

0.1.14

Nov 5, 2021

0.1.13

Nov 5, 2021

0.1.12

Oct 21, 2021

0.1.11

Sep 24, 2021

0.1.10

Aug 21, 2021

0.1.9

Aug 21, 2021

0.1.8

Aug 8, 2021

0.1.7

May 6, 2021

0.1.6

May 5, 2021

0.1.5

Apr 9, 2021

0.1.4

Apr 9, 2021

0.1.3

Mar 18, 2021

0.1.1

Mar 18, 2021

0.1.0

Mar 18, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

minicons-0.3.37.tar.gz (43.0 kB view details)

Uploaded May 8, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

minicons-0.3.37-py3-none-any.whl (41.7 kB view details)

Uploaded May 8, 2026 Python 3

File details

Details for the file minicons-0.3.37.tar.gz.

File metadata

Download URL: minicons-0.3.37.tar.gz
Upload date: May 8, 2026
Size: 43.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.5.1 CPython/3.10.10 Darwin/25.3.0

File hashes

Hashes for minicons-0.3.37.tar.gz
Algorithm	Hash digest
SHA256	`605b9d2cc85e88b87adf01dfdf9286faf47e5e8601ee264704958a052a6a02cf`
MD5	`08a6f77aa8226102ff082564c625bb89`
BLAKE2b-256	`a0d16e558921141a979fe2a513b2cea3d213b7269f242447633d56d5e1ad5152`

See more details on using hashes here.

File details

Details for the file minicons-0.3.37-py3-none-any.whl.

File metadata

Download URL: minicons-0.3.37-py3-none-any.whl
Upload date: May 8, 2026
Size: 41.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.5.1 CPython/3.10.10 Darwin/25.3.0

File hashes

Hashes for minicons-0.3.37-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e4086b6163cf554355a680388598c345f6295f7198251110f4d267a21fb4c217`
MD5	`d1d9c7cf255c286d3e8c345ecc49c77c`
BLAKE2b-256	`e8ad2ed6ed8c321e9b731e0366d666c9a30bb3721b83fd412bded789389f06cf`

See more details on using hashes here.

minicons 0.3.37

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

minicons: Enabling Flexible Behavioral and Representational Analyses of Transformer Language Models

Installation

Supported Functionality

Examples

A better version of MLM Scoring by Kauf and Ivanova

NEW: Vision-Language Model (VLM) Scoring

OpenAI API

Tutorials

Recent Updates

Citation

Famous users of minicons:

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes