A package of useful functions to analyze transformer based language models.

These details have not been verified by PyPI

Project links

Project description

minicons

Helper functions for analyzing Transformer based representations of language

This repo is a wrapper around the transformers library from hugging face :hugs:

Installation

Install from Pypi using:

pip install minicons

Supported Functionality

Extract word representations from Contextualized Word Embeddings
Score sequences using language model scoring techniques, including masked language models following Salazar et al. (2020).

Examples

Extract word representations from contextualized word embeddings:

from minicons import cwe

model = cwe.CWE('bert-base-uncased')

context_words = [("I went to the bank to withdraw money.", "bank"), 
                 ("i was at the bank of the river ganga!", "bank")]

print(model.extract_representation(context_words, layer = 12))

''' 
tensor([[ 0.5399, -0.2461, -0.0968,  ..., -0.4670, -0.5312, -0.0549],
        [-0.8258, -0.4308,  0.2744,  ..., -0.5987, -0.6984,  0.2087]],
       grad_fn=<MeanBackward1>)
'''

Compute sentence acceptability measures (surprisals) using Word Prediction Models:

from minicons import scorer

mlm_model = scorer.MaskedLMScorer('bert-base-uncased', 'cpu')
ilm_model = scorer.IncrementalLMScorer('distilgpt2', 'cpu')

stimuli = ["The keys to the cabinet are on the table.",
           "The keys to the cabinet is on the table."]

# use sequence_score with different reduction options: 
# Sequence Surprisal - lambda x: -x.sum(1)
# Sequence Log-probability - lambda x: x.sum(1)
# Sequence Surprisal, normalized by number of tokens - lambda x: -x.mean(1)
# Sequence Log-probability, normalized by number of tokens - lambda x: x.mean(1)
# and so on...

print(ilm_model.sequence_score(stimuli, reduction = lambda x: -x.sum(0).item()))

'''
[39.879737854003906, 42.75846481323242]
'''

# MLM scoring, inspired by Salazar et al., 2020
print(mlm_model.sequence_score(stimuli, reduction = lambda x: -x.sum(0).item()))
'''
[13.962685585021973, 23.415111541748047]
'''

Tutorials

Recent Updates

November 6, 2021: MLM scoring has been fixed! You can now use model.token_score() and model.sequence_score() with MaskedLMScorers as well!

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.3.32

Jun 26, 2025

0.3.31

May 3, 2025

0.3.30

Apr 19, 2025

0.3.29

Apr 19, 2025

0.3.28

Apr 15, 2025

0.3.27

Apr 15, 2025

0.3.26

Apr 14, 2025

0.3.25

Apr 8, 2025

0.3.24

Apr 2, 2025

0.3.23

Apr 2, 2025

0.3.22

Mar 27, 2025

0.3.21

Mar 16, 2025

0.3.20

Mar 16, 2025

0.3.19

Mar 14, 2025

0.3.18

Mar 12, 2025

0.3.17

Mar 12, 2025

0.3.16

Mar 12, 2025

0.3.15

Mar 12, 2025

0.3.14

Feb 20, 2025

0.3.13

Feb 1, 2025

0.3.12

Feb 1, 2025

0.3.11

Jan 23, 2025

0.3.10

Jan 23, 2025

0.3.9

Jan 7, 2025

0.3.8

Nov 30, 2024

0.3.7

Nov 26, 2024

0.3.6

Nov 25, 2024

0.3.5

Nov 25, 2024

0.3.4

Nov 25, 2024

0.3.3

Nov 25, 2024

0.3.2

Nov 25, 2024

0.3.1

Nov 25, 2024

0.3.0

Nov 25, 2024

0.2.50

Oct 6, 2024

0.2.49

Sep 6, 2024

0.2.48

Aug 29, 2024

0.2.47

Aug 12, 2024

0.2.46

Aug 12, 2024

0.2.45

Jul 10, 2024

0.2.44

Apr 24, 2024

0.2.43

Apr 24, 2024

0.2.42

Apr 3, 2024

0.2.41

Mar 21, 2024

0.2.40

Mar 21, 2024

0.2.39

Mar 21, 2024

0.2.38

Feb 25, 2024

0.2.37

Feb 21, 2024

0.2.36

Feb 21, 2024

0.2.35

Feb 21, 2024

0.2.34

Feb 20, 2024

0.2.33

Jan 26, 2024

0.2.32

Jan 26, 2024

0.2.31

Jan 26, 2024

0.2.30

Jan 8, 2024

0.2.29

Jan 8, 2024

0.2.27

Dec 9, 2023

0.2.26

Nov 17, 2023

0.2.25

Nov 15, 2023

0.2.24

Nov 15, 2023

0.2.23

Nov 14, 2023

0.2.22

Nov 12, 2023

0.2.21

Nov 8, 2023

0.2.20

Oct 30, 2023

0.2.19

Sep 30, 2023

0.2.18

Jul 25, 2023

0.2.17

Jun 13, 2023

0.2.16

Jun 1, 2023

0.2.15

Jun 1, 2023

0.2.14

Mar 26, 2023

0.2.13

Mar 26, 2023

0.2.12

Mar 26, 2023

0.2.11

Mar 26, 2023

0.2.10

Mar 21, 2023

0.2.9

Oct 19, 2022

0.2.8

Oct 19, 2022

0.2.7

Oct 19, 2022

0.2.5

Jul 1, 2022

0.2.4

Jun 5, 2022

0.2.3

Jan 20, 2022

0.2.2

Jan 20, 2022

This version

0.2.1

Jan 20, 2022

0.2.0

Jan 16, 2022

0.1.19

Dec 15, 2021

0.1.18

Dec 5, 2021

0.1.17

Dec 5, 2021

0.1.16

Nov 6, 2021

0.1.15

Nov 5, 2021

0.1.14

Nov 5, 2021

0.1.13

Nov 5, 2021

0.1.12

Oct 21, 2021

0.1.11

Sep 24, 2021

0.1.10

Aug 21, 2021

0.1.9

Aug 21, 2021

0.1.8

Aug 8, 2021

0.1.7

May 6, 2021

0.1.6

May 5, 2021

0.1.5

Apr 9, 2021

0.1.4

Apr 9, 2021

0.1.3

Mar 18, 2021

0.1.1

Mar 18, 2021

0.1.0

Mar 18, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

minicons-0.2.1.tar.gz (19.3 kB view details)

Uploaded Jan 20, 2022 Source

Built Distribution

minicons-0.2.1-py3-none-any.whl (20.5 kB view details)

Uploaded Jan 20, 2022 Python 3

File details

Details for the file minicons-0.2.1.tar.gz.

File metadata

Download URL: minicons-0.2.1.tar.gz
Upload date: Jan 20, 2022
Size: 19.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.1.11 CPython/3.8.5 Darwin/21.2.0

File hashes

Hashes for minicons-0.2.1.tar.gz
Algorithm	Hash digest
SHA256	`e207314b20673c25835425c1a02eee0e1c86675ebf78d5b828602d3b858df9bd`
MD5	`fd39dab8c8dbff1fc54b40af6c490c81`
BLAKE2b-256	`e2d0eafdb90354dac94b74e8bfc112c84930b54b8a3e25ce871385c5d55b122f`

See more details on using hashes here.

File details

Details for the file minicons-0.2.1-py3-none-any.whl.

File metadata

Download URL: minicons-0.2.1-py3-none-any.whl
Upload date: Jan 20, 2022
Size: 20.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.1.11 CPython/3.8.5 Darwin/21.2.0

File hashes

Hashes for minicons-0.2.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`88117c0f6b605081b547810252f022f22607d5f037df547db63b98cecfe28840`
MD5	`8b21bfe7e197333fd1d802dda7be16f2`
BLAKE2b-256	`221431a1ecbbaa60258370d2ca5a7a8f9d3b2b5fc2a2429e976a4b2ce3fba5a1`

See more details on using hashes here.

minicons 0.2.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

minicons

Installation

Supported Functionality

Examples

Tutorials

Recent Updates

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes