A wrapper layer for stacking layers horizontally

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

Keras Multi-Head

License

A wrapper layer for stacking layers horizontally.

Install

pip install keras-multi-head

Usage

Duplicate Layers

The layer will be duplicated if only a single layer is provided. The layer_num argument controls how many layers will be duplicated eventually.

from tensorflow import keras
from keras_multi_head import MultiHead


model = keras.models.Sequential()
model.add(keras.layers.Embedding(input_dim=100, output_dim=20, name='Embedding'))
model.add(MultiHead(keras.layers.LSTM(units=32), layer_num=5, name='Multi-LSTMs'))
model.add(keras.layers.Flatten(name='Flatten'))
model.add(keras.layers.Dense(units=4, activation='softmax', name='Dense'))
model.build()
model.summary()

Use Multiple-Layers

The first argument could also be a list of layers with different configurations, however, they must have the same output shapes.

from tensorflow import keras
from keras_multi_head import MultiHead


model = keras.models.Sequential()
model.add(keras.layers.Embedding(input_dim=100, output_dim=20, name='Embedding'))
model.add(MultiHead([
    keras.layers.Conv1D(filters=32, kernel_size=3, padding='same'),
    keras.layers.Conv1D(filters=32, kernel_size=5, padding='same'),
    keras.layers.Conv1D(filters=32, kernel_size=7, padding='same'),
], name='Multi-CNNs'))
model.build()
model.summary()

Linear Transformation

The input data will be mapped to different values of the same shape for each layer when hidden_dim is given.

Regularization

The regularization is used when you expect to extract different features from the parallel layers. You can customize the indices of weights in the layers, the intervals represent the parts of the weights and the factor of the regularization.

For example, the bidirectional LSTM layer has 6 weights by default, and the first 3s belong to the forward layer. The 2nd weight (recurrent kernel) in the forward layer controls the computation of gates for recurrent connections. The kernel for computing cell states lays in units x 2 to units x 3 of the recurrent kernel. We can used the regularization for the kernels:

from tensorflow import keras
from keras_multi_head import MultiHead


model = keras.models.Sequential()
model.add(keras.layers.Embedding(input_dim=5, output_dim=3, name='Embed'))
model.add(MultiHead(
    layer=keras.layers.Bidirectional(keras.layers.LSTM(units=16), name='LSTM'),
    layer_num=5,
    reg_index=[1, 4],
    reg_slice=(slice(None, None), slice(32, 48)),
    reg_factor=0.1,
    name='Multi-Head-Attention',
))
model.add(keras.layers.Flatten(name='Flatten'))
model.add(keras.layers.Dense(units=2, activation='softmax', name='Dense'))
model.build()

reg_index: The indices of layer.get_weights(), a single integer or a list of integers.
reg_slice: slices or a tuple of slices or a list of the previous choices. If multiple indices are provided in reg_index and reg_slice is not a list, then reg_slice is assumed to be equal for all the indices. The whole array will be used if you leave this argument to None.
reg_factor: The factor of the regularization, a float or a list of floats.

Multi-Head Attention

A more specific multi-head layer is provided (since the general one is harder to use). The layer uses scaled dot product attention layers as its sub-layers and only head_num is required:

from tensorflow import keras
from keras_multi_head import MultiHeadAttention

input_layer = keras.layers.Input(
    shape=(2, 3),
    name='Input',
)
att_layer = MultiHeadAttention(
    head_num=3,
    name='Multi-Head',
)(input_layer)
model = keras.models.Model(inputs=input_layer, outputs=att_layer)
model.compile(
    optimizer='adam',
    loss='mse',
    metrics={},
)
model.summary()

The shapes of input and output tensors would be the same if only one layer is presented as input. The input layers will be considered as query, key and value when a list is given:

from tensorflow import keras
from keras_multi_head import MultiHeadAttention

input_query = keras.layers.Input(
    shape=(2, 3),
    name='Input-Q',
)
input_key = keras.layers.Input(
    shape=(4, 5),
    name='Input-K',
)
input_value = keras.layers.Input(
    shape=(4, 6),
    name='Input-V',
)
att_layer = MultiHeadAttention(
    head_num=3,
    name='Multi-Head',
)([input_query, input_key, input_value])
model = keras.models.Model(inputs=[input_query, input_key, input_value], outputs=att_layer)
model.compile(
    optimizer='adam',
    loss='mse',
    metrics={},
)
model.summary()

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

0.29.0

Jan 22, 2022

0.28.0

Jun 15, 2021

0.27.0

Jun 6, 2020

0.26.0

Jun 3, 2020

0.25.0

Jun 2, 2020

0.24.0

May 17, 2020

0.22.0

Aug 23, 2019

0.20.0

May 21, 2019

0.19.0

May 11, 2019

0.18.0

Apr 16, 2019

0.17.0

Apr 16, 2019

0.16.0

Mar 11, 2019

0.15.0

Feb 1, 2019

0.14.0

Nov 13, 2018

0.13.0

Nov 12, 2018

0.12.0

Nov 12, 2018

0.11.0

Nov 12, 2018

0.10.0

Nov 9, 2018

0.9.0

Nov 8, 2018

0.8.0

Nov 8, 2018

0.7.0

Nov 7, 2018

0.6.0

Nov 7, 2018

0.5

Oct 19, 2018

0.4

Sep 21, 2018

0.3

Sep 20, 2018

0.2

Sep 20, 2018

0.1

Sep 20, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

keras-multi-head-0.29.0.tar.gz (13.7 kB view details)

Uploaded Jan 22, 2022 Source

File details

Details for the file keras-multi-head-0.29.0.tar.gz.

File metadata

Download URL: keras-multi-head-0.29.0.tar.gz
Upload date: Jan 22, 2022
Size: 13.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.7.4

File hashes

Hashes for keras-multi-head-0.29.0.tar.gz
Algorithm	Hash digest
SHA256	`b0634eed2b77d6b34097a2d7ec49d080d778813218dd61374fd776e21762bbf0`
MD5	`fad0c0532a7c37b34708f3023e3707c0`
BLAKE2b-256	`2c215e1699e9d63a8e3c0d5fd0716b9a8be7d8c2c07fc8de34902e55de5ba58e`

See more details on using hashes here.

keras-multi-head 0.29.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Keras Multi-Head

Install

Usage

Duplicate Layers

Use Multiple-Layers

Linear Transformation

Regularization

Multi-Head Attention

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes