Attention mechanism for processing sequential data that considers the context for each timestamp

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

Keras Self-Attention

License

[中文|English]

Attention mechanism for processing sequential data that considers the context for each timestamp.

Install

pip install keras-self-attention

Usage

Basic

By default, the attention layer uses additive attention and considers the whole context while calculating the relevance. The following code creates an attention layer that follows the equations in the first section (attention_activation is the activation function of e_{t, t'}):

import keras
from keras_self_attention import SeqSelfAttention


model = keras.models.Sequential()
model.add(keras.layers.Embedding(input_dim=10000,
                                 output_dim=300,
                                 mask_zero=True))
model.add(keras.layers.Bidirectional(keras.layers.LSTM(units=128,
                                                       return_sequences=True)))
model.add(SeqSelfAttention(attention_activation='sigmoid'))
model.add(keras.layers.Dense(units=5))
model.compile(
    optimizer='adam',
    loss='categorical_crossentropy',
    metrics=['categorical_accuracy'],
)
model.summary()

Local Attention

The global context may be too broad for one piece of data. The parameter attention_width controls the width of the local context:

from keras_self_attention import SeqSelfAttention

SeqSelfAttention(
    attention_width=15,
    attention_activation='sigmoid',
    name='Attention',
)

Multiplicative Attention

You can use multiplicative attention by setting attention_type:

from keras_self_attention import SeqSelfAttention

SeqSelfAttention(
    attention_width=15,
    attention_type=SeqSelfAttention.ATTENTION_TYPE_MUL,
    attention_activation=None,
    kernel_regularizer=keras.regularizers.l2(1e-6),
    use_attention_bias=False,
    name='Attention',
)

Regularizer

To use the regularizer, set attention_regularizer_weight to a positive number:

import keras
from keras_self_attention import SeqSelfAttention

inputs = keras.layers.Input(shape=(None,))
embd = keras.layers.Embedding(input_dim=32,
                              output_dim=16,
                              mask_zero=True)(inputs)
lstm = keras.layers.Bidirectional(keras.layers.LSTM(units=16,
                                                    return_sequences=True))(embd)
att = SeqSelfAttention(attention_type=SeqSelfAttention.ATTENTION_TYPE_MUL,
                       kernel_regularizer=keras.regularizers.l2(1e-4),
                       bias_regularizer=keras.regularizers.l1(1e-4),
                       attention_regularizer_weight=1e-4,
                       name='Attention')(lstm)
dense = keras.layers.Dense(units=5, name='Dense')(att)
model = keras.models.Model(inputs=inputs, outputs=[dense])
model.compile(
    optimizer='adam',
    loss={'Dense': 'sparse_categorical_crossentropy'},
    metrics={'Dense': 'categorical_accuracy'},
)
model.summary(line_length=100)

Load the Model

Make sure to add SeqSelfAttention to custom objects:

import keras

keras.models.load_model(model_path, custom_objects=SeqSelfAttention.get_custom_objects())

History Only

Set history_only to True when only historical data could be used:

SeqSelfAttention(
    attention_width=3,
    history_only=True,
    name='Attention',
)

Multi-Head

Please refer to keras-multi-head.

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

0.51.0

Jan 22, 2022

0.50.0

Jun 15, 2021

0.49.0

Dec 30, 2020

0.48.0

Dec 30, 2020

0.47.0

Jul 29, 2020

0.46.0

Jun 6, 2020

0.45.0

Jun 5, 2020

0.44.0

Jun 3, 2020

0.43.0

Jun 2, 2020

0.42.0

Sep 6, 2019

0.41.0

May 11, 2019

0.40.0

Apr 24, 2019

0.39.0

Apr 16, 2019

0.38.0

Apr 16, 2019

0.37.0

Apr 16, 2019

0.36.0

Apr 1, 2019

0.35.0

Mar 11, 2019

0.34.0

Feb 1, 2019

0.33.0

Jan 31, 2019

0.32.0

Nov 26, 2018

0.31.0

Nov 12, 2018

0.30.0

Nov 6, 2018

0.0.21

Sep 6, 2018

0.0.20

Aug 31, 2018

0.0.19

Aug 30, 2018

0.0.18

Aug 30, 2018

0.0.17

Aug 20, 2018

0.0.16

Aug 17, 2018

0.0.15

Aug 17, 2018

0.0.14

Aug 17, 2018

0.0.13

Aug 17, 2018

0.0.12

Aug 17, 2018

0.0.11

Aug 16, 2018

0.0.10

Aug 16, 2018

0.0.9

Aug 16, 2018

0.0.8

Aug 16, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

keras-self-attention-0.51.0.tar.gz (11.1 kB view details)

Uploaded Jan 22, 2022 Source

File details

Details for the file keras-self-attention-0.51.0.tar.gz.

File metadata

Download URL: keras-self-attention-0.51.0.tar.gz
Upload date: Jan 22, 2022
Size: 11.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.1 setuptools/51.0.0.post20201207 requests-toolbelt/0.9.1 tqdm/4.55.0 CPython/3.6.12

File hashes

Hashes for keras-self-attention-0.51.0.tar.gz
Algorithm	Hash digest
SHA256	`77fce72b12d235722cbbcf7da5b3609b89ee212f5f07352945cc088e850900e9`
MD5	`7bc0e7a51eb634705a34b7a7361261d5`
BLAKE2b-256	`d5a50a1d003e420da49791f64def11d8d2837280e1a680c2eaaab216f9f17ed7`

See more details on using hashes here.

keras-self-attention 0.51.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Keras Self-Attention

Install

Usage

Basic

Local Attention

Multiplicative Attention

Regularizer

Load the Model

History Only

Multi-Head

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes