Keras Simple Attention
Project description
Keras Attention Mechanism
Many-to-one attention mechanism for Keras.
Installation via pip
pip install attention
Import in the source code
from attention import Attention
# [...]
m = Sequential([
LSTM(128, input_shape=(seq_length, 1), return_sequences=True),
Attention(), # <--------- here.
Dense(1, activation='linear')
])
Examples
Install the requirements before running the examples: pip install -r requirements.txt
.
IMDB Dataset
In this experiment, we demonstrate that using attention yields a higher accuracy on the IMDB dataset. We consider two LSTM networks: one with this attention layer and the other one with a fully connected layer. Both have the same number of parameters for a fair comparison (250K).
Here are the results on 10 runs. For every run, we record the max accuracy on the test set for 10 epochs.
Measure | No Attention (250K params) | Attention (250K params) |
---|---|---|
MAX Accuracy | 88.22 | 88.76 |
AVG Accuracy | 87.02 | 87.62 |
STDDEV Accuracy | 0.18 | 0.14 |
As expected, there is a boost in accuracy for the model with attention. It also reduces the variability between the runs, which is something nice to have.
Adding two numbers
Let's consider the task of adding two numbers that come right after some delimiters (0 in this case):
x = [1, 2, 3, 0, 4, 5, 6, 0, 7, 8]
. Result is y = 4 + 7 = 11
.
The attention is expected to be the highest after the delimiters. An overview of the training is shown below, where the top represents the attention map and the bottom the ground truth. As the training progresses, the model learns the task and the attention map converges to the ground truth.
Finding max of a sequence
We consider many 1D sequences of the same length. The task is to find the maximum of each sequence.
We give the full sequence processed by the RNN layer to the attention layer. We expect the attention layer to focus on the maximum of each sequence.
After a few epochs, the attention layer converges perfectly to what we expected.
References
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Hashes for attention-4.0-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | bfbc019a700c6cfba3e1f29c590d4cbd243eeb4dcdd19b40e8b9cdb6e76d9cf6 |
|
MD5 | 919a1706cc40ec22269bcbc493b5cce7 |
|
BLAKE2b-256 | cb3fd8b19195a2f5827dcbf0ee6d7e6fe4352f42dcc60693bdb1e431440c8b59 |