Create interactive textual heat maps for Jupiter notebooks

These details have not been verified by PyPI

Project description

textualheatmap

Create interactive textual heatmaps for Jupiter notebooks.

I originally published this visualization method in my distill paper https://distill.pub/2019/memorization-in-rnns/. In this context, it is used as a saliency map for showing which parts of a sentence are used to predict the next word. However, the visualization method is more general-purpose than that and can be used for any kind of textual heatmap purposes.

textualheatmap works with python 3.6 or newer and is distributed under the MIT license.

Gif of saliency in RNN models

An end-to-end example of how to use the HuggingFace 🤗 Transformers python module to create a textual saliency map for how each masked token is predicted.

Gif of saliency in BERT models

Install

pip install -U textualheatmap

API

textualheatmap.TextualHeatmap

Examples

Example of sequential-charecter model with metadata visible

from textualheatmap import TextualHeatmap

data = [[
    # GRU data
    {"token":" ",
     "meta":["the","one","of"],
     "heat":[1,0,0,0,0,0,0,0,0]},
    {"token":"c",
     "meta":["can","called","century"],
     "heat":[1,0.22,0,0,0,0,0,0,0]},
    {"token":"o",
     "meta":["country","could","company"],
     "heat":[0.57,0.059,1,0,0,0,0,0,0]},
    {"token":"n",
     "meta":["control","considered","construction"],
     "heat":[1,0.20,0.11,0.84,0,0,0,0,0]},
    {"token":"t",
     "meta":["control","continued","continental"],
     "heat":[0.27,0.17,0.052,0.44,1,0,0,0,0]},
    {"token":"e",
     "meta":["context","content","contested"],
     "heat":[0.17,0.039,0.034,0.22,1,0.53,0,0,0]},
    {"token":"x",
     "meta":["context","contexts","contemporary"],
     "heat":[0.17,0.0044,0.021,0.17,1,0.90,0.48,0,0]},
    {"token":"t",
     "meta":["context","contexts","contentious"],
     "heat":[0.14,0.011,0.034,0.14,0.68,1,0.80,0.86,0]},
    {"token":" ",
     "meta":["of","and","the"],
     "heat":[0.014,0.0063,0.0044,0.011,0.034,0.10,0.32,0.28,1]},
    # ...
],[
    # LSTM data
    # ...
]]

heatmap = TextualHeatmap(
    width = 600,
    show_meta = True,
    facet_titles = ['GRU', 'LSTM']
)
# Set data and render plot, this can be called again to replace
# the data.
heatmap.set_data(data)
# Focus on the token with the given index. Especially useful when
# `interactive=False` is used in `TextualHeatmap`.
heatmap.highlight(159)

Shows saliency with predicted words at metadata

Example of sequential-charecter model without metadata

When show_meta is not True, the meta part of the data object has no effect.

heatmap = TextualHeatmap(
    facet_titles = ['LSTM', 'GRU'],
    rotate_facet_titles = True
)
heatmap.set_data(data)
heatmap.highlight(159)

Shows saliency without metadata

Example of non-sequential-word model

format = True can be set in the data object to inducate tokens that are not directly used by the model. This is useful if word or sub-word tokenization is used.

data = [[
{'token': '[CLR]',
 'meta': ['', '', ''],
 'heat': [1, 0, 0, 0, 0, ...]},
{'token': ' ',
 'format': True},
{'token': 'context',
 'meta': ['today', 'and', 'thus'],
 'heat': [0.13, 0.40, 0.23, 1.0, 0.56, ...]},
{'token': ' ',
 'format': True},
{'token': 'the',
 'meta': ['##ual', 'the', '##ually'],
 'heat': [0.11, 1.0, 0.34, 0.58, 0.59, ...]},
{'token': ' ',
 'format': True},
{'token': 'formal',
 'meta': ['formal', 'academic', 'systematic'],
 'heat': [0.13, 0.74, 0.26, 0.35, 1.0, ...]},
{'token': ' ',
 'format': True},
{'token': 'study',
 'meta': ['##ization', 'study', '##ity'],
 'heat': [0.09, 0.27, 0.19, 1.0, 0.26, ...]}
]]

heatmap = TextualHeatmap(facet_titles = ['BERT'], show_meta=True)
heatmap.set_data(data)

Shows saliency in a BERT model, using sub-word tokenization

Citation

If you use this in a publication, please cite my Distill publication where I first demonstrated this visualization method.

@article{madsen2019visualizing,
  author = {Madsen, Andreas},
  title = {Visualizing memorization in RNNs},
  journal = {Distill},
  year = {2019},
  note = {https://distill.pub/2019/memorization-in-rnns},
  doi = {10.23915/distill.00016}
}

Sponsor

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

1.2.0

May 30, 2024

1.1.1

Mar 25, 2020

1.1.0

Mar 25, 2020

1.0.2

Mar 25, 2020

1.0.1

Mar 21, 2020

1.0.0

Mar 20, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

textualheatmap-1.2.0.tar.gz (11.1 kB view details)

Uploaded May 30, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

textualheatmap-1.2.0-py3-none-any.whl (10.1 kB view details)

Uploaded May 30, 2024 Python 3

File details

Details for the file textualheatmap-1.2.0.tar.gz.

File metadata

Download URL: textualheatmap-1.2.0.tar.gz
Upload date: May 30, 2024
Size: 11.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.0 CPython/3.11.0

File hashes

Hashes for textualheatmap-1.2.0.tar.gz
Algorithm	Hash digest
SHA256	`0e7b24f8b8815db1690fe8b29525f86f38738c950df2b000b8001e9e2565f457`
MD5	`9400d0bf158429b611c2d6d930330716`
BLAKE2b-256	`240aa2e90d14891f84c1e6e8e70e0b0ba5866523e9ae3350accfc89d53ff9171`

See more details on using hashes here.

File details

Details for the file textualheatmap-1.2.0-py3-none-any.whl.

File metadata

Download URL: textualheatmap-1.2.0-py3-none-any.whl
Upload date: May 30, 2024
Size: 10.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.0 CPython/3.11.0

File hashes

Hashes for textualheatmap-1.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`27beaaaa36e84d7261f7c0bc0588fc90d971e86a9df89e7af8f1414dbdbe2a9c`
MD5	`9fe04ae723f072bc14e2f92ca7a60be9`
BLAKE2b-256	`178b6a1b2fb7b3aec829c677039eab566d67a020edb2475ad4d65d26101ea673`

See more details on using hashes here.

textualheatmap 1.2.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

textualheatmap

Install

API

Examples

Example of sequential-charecter model with metadata visible

Example of sequential-charecter model without metadata

Example of non-sequential-word model

Citation

Sponsor

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes