Tensorflow libs: layers, metrics, ops, etc.

These details have not been verified by PyPI

Project description

MLable

Tensorflow libs:

layers:
- reshaping:
  - Divide
  - Merge
- embedding:
  - RotaryPositionalEmbedding
- transformer:
  - CachedMultiHeadAttention
  - FeedForwardGate
metrics:
- CategoricalGroupAccuracy

Installation

The package is available on pypi:

pip install -U mlable

Layers

Divide

Relative reshaping layers that divides a given axis and multiplies another by the same factor:

import mlable.layers.reshaping

__x = tf.ones(shape=(2, 4, 6, 8))
__l = mlable.layers.reshaping.Divide(
    input_axis=2, # relative to the NEW shape / rank
    output_axis=-1, # same
    factor=3,
    insert=False,) # whether to create a new axis

list(__l(__x).shape)
# [2, 4, 2, 24]

Merge

Relative reshaping layers that merges two axes:

import mlable.layers.reshaping

__x = tf.ones(shape=(2, 4, 6, 8))
__l = mlable.layers.reshaping.Merge(
    left_axis=1,
    right_axis=-1,
    left=False,) # whether to merge into the left axis

list(__l(__x).shape)
# [2, 6, 32]

CachedMultiHeadAttention

This layer subclasses the regular MultiHeadAttention and adds a cache.

It has the same parameters:

import mlable.layers.transformer

mlable.layers.transformer.CachedMultiHeadAttention(
    num_heads,
    key_dim,
    value_dim=None,
    dropout=0.0,
    use_bias=True,
    output_shape=None,
    attention_axes=None,
    kernel_initializer='glorot_uniform',
    bias_initializer='zeros',
    kernel_regularizer=None,
    bias_regularizer=None,
    activity_regularizer=None,
    kernel_constraint=None,
    bias_constraint=None,
    **kwargs)

And its call function has the following arguments:

mlable.layers.transformer.CachedMultiHeadAttention.call(
    query,
    value,
    key=None,
    cache=None,
    step=None,
    training=False,
    attention_mask=None,
    return_attention_scores=False,
    use_causal_mask=True,)

FeedForwardGate

A typical feed-forward layer with GELU activation:

import mlable.layers.transformer

__x = tf.ones(shape=(2, 3, 5), dtype=tf.dtypes.float32)
__l = mlable.layers.transformer.FeedForwardGate(
    input_dim=256,
    hidden_dim=1024)

__l(__x)

RotaryPositionalEmbedding

Tensorflow implementation of RoPE:

import mlable.layers.embedding

__x = tf.ones(shape=(2, 3, 5))
__l = mlable.layers.embedding.RotaryPositionalEmbedding(
    sequence_axis=1, # position along this axis
    feature_axis=-1, # output axis
    max_wavelength=10_000, # see the paper
    scaling_factor=1.) # see the paper

__l(inputs=__x, offset=2) # the offset is typically used to perform iterative decoding during inference

Metrics

CategoricalGroupAccuracy

Hierarchical models should not be scored on individual predictions but on their combination.

For example, tokun is a byte level autoencoder. It predicts probabilities for each byte of the output, like 0 in the UTF-32-BE encoding of "a" (0, 0, 0, 97).

A prediction of (0, 0, 0, 98) for "a" has 3 correct byte out of 4, but the prediction is actually "b".

In this case the byte accuracy is 75% while the character accuracy is 0%. Having several hierarchies of scoring helps with training and evaluation.

import mlable.metrics

byte_accuracy = mlable.metrics.CategoricalGroupAccuracy(group=1, name='byte_accuracy')
character_accuracy = mlable.metrics.CategoricalGroupAccuracy(group=4, name='character_accuracy')
token_accuracy = mlable.metrics.CategoricalGroupAccuracy(group=64, name='token_accuracy')

Credits

Andrej Karpathy reconnected my ML synapses with micrograd.

License

Licensed under the aGPLv3.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.13.1

Oct 30, 2024

0.13.0

Oct 30, 2024

0.12.9

Oct 28, 2024

0.12.8

Oct 27, 2024

0.12.7

Oct 27, 2024

0.12.6

Oct 27, 2024

0.12.5

Oct 27, 2024

0.12.4

Oct 25, 2024

0.12.3

Oct 21, 2024

0.12.2

Oct 20, 2024

0.12.1

Oct 20, 2024

0.12.0

Oct 17, 2024

0.11.8

Oct 17, 2024

0.11.7

Oct 16, 2024

0.11.5

Oct 11, 2024

0.11.4

Oct 9, 2024

0.11.3

Oct 9, 2024

0.11.2

Oct 8, 2024

0.11.1

Oct 8, 2024

0.11.0

Oct 7, 2024

0.10.5

Oct 7, 2024

0.10.4

Oct 7, 2024

0.10.3

Oct 6, 2024

0.10.2

Oct 4, 2024

0.10.1

Oct 4, 2024

0.9.2

Oct 3, 2024

0.9.1

Oct 2, 2024

0.9.0

Oct 2, 2024

0.8.13

Oct 1, 2024

0.8.12

Sep 29, 2024

0.8.11

Sep 29, 2024

0.8.10

Sep 28, 2024

0.8.9

Sep 27, 2024

0.8.8

Sep 27, 2024

0.8.7

Sep 17, 2024

0.8.6

Sep 14, 2024

0.8.5

Sep 4, 2024

0.8.4

Sep 1, 2024

0.8.3

Sep 1, 2024

0.8.2

Sep 1, 2024

0.8.0

Aug 29, 2024

0.7.17

Aug 28, 2024

0.7.16

Aug 28, 2024

0.7.15

Aug 28, 2024

0.7.14

Aug 27, 2024

0.7.13

Aug 26, 2024

0.7.12

Aug 26, 2024

0.7.11

Aug 25, 2024

0.7.10

Aug 25, 2024

This version

0.7.9

Aug 24, 2024

0.7.8

Aug 24, 2024

0.7.7

Aug 14, 2024

0.7.6

Aug 14, 2024

0.7.5

Aug 14, 2024

0.7.4

Jul 17, 2024

0.7.3

Jul 17, 2024

0.7.2

Jul 16, 2024

0.7.1

Jul 16, 2024

0.7.0

Jul 15, 2024

0.6.9

Jul 15, 2024

0.6.8

Jul 15, 2024

0.6.7

Jul 15, 2024

0.6.6

Jul 14, 2024

0.6.5

Jul 10, 2024

0.6.5a0 pre-release

Jul 10, 2024

0.6.4

Jul 9, 2024

0.6.4a0 pre-release

Jul 10, 2024

0.6.3

Jul 9, 2024

0.6.2

Jul 9, 2024

0.6.2a5 pre-release

Jul 9, 2024

0.6.2a4 pre-release

Jul 9, 2024

0.6.2a3 pre-release

Jul 9, 2024

0.6.2a2 pre-release

Jul 9, 2024

0.6.2a1 pre-release

Jul 9, 2024

0.6.2a0 pre-release

Jul 8, 2024

0.6.1

Jul 8, 2024

0.6.1b0 pre-release

Jul 8, 2024

0.6.0

Jul 7, 2024

0.5.3

Jul 7, 2024

0.5.2

Jul 7, 2024

0.5.1

Jul 5, 2024

0.5.0

Jul 5, 2024

0.4.9

Jul 3, 2024

0.4.8

Jul 3, 2024

0.4.7

Jul 3, 2024

0.4.6

Jun 29, 2024

0.4.5

Jun 29, 2024

0.4.4

Jun 28, 2024

0.4.3

Jun 28, 2024

0.4.2

Jun 19, 2024

0.4.1

Jun 19, 2024

0.4.0

Jun 19, 2024

0.3.13

Jun 12, 2024

0.3.12

Jun 10, 2024

0.3.11

Jun 10, 2024

0.3.10

Jun 8, 2024

0.3.9

Jun 6, 2024

0.3.8

Jun 6, 2024

0.3.7

Jun 5, 2024

0.3.6

Jun 4, 2024

0.3.5

Jun 4, 2024

0.3.4

Jun 4, 2024

0.3.2

Jun 1, 2024

0.3.1

Jun 1, 2024

0.2.2

May 30, 2024

0.2.1

May 29, 2024

0.1.4

May 25, 2024

0.1.3

May 10, 2024

0.1.1

May 7, 2024

0.1.0

May 6, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlable-0.7.9.tar.gz (17.1 kB view hashes)

Uploaded Aug 24, 2024 Source

Built Distribution

mlable-0.7.9-py3-none-any.whl (22.2 kB view hashes)

Uploaded Aug 24, 2024 Python 3

Hashes for mlable-0.7.9.tar.gz

Hashes for mlable-0.7.9.tar.gz
Algorithm	Hash digest
SHA256	`6026d6d714e3ea731e2acb2cf061403189dacdc8608958f53edc9089de48bf4d`
MD5	`d8a761049013b7ef60b34c85909ebf60`
BLAKE2b-256	`f0d44df777a13afb2b951a700523e2a1381b7640504bcbedf255acd9bd542ef9`

Hashes for mlable-0.7.9-py3-none-any.whl

Hashes for mlable-0.7.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6b5d34060e17b7982e7c93b907ba46bba990ee5ace065dc0a5ae2afdcfce24d1`
MD5	`ab2aca8d980b6ab49f195a48e3468c22`
BLAKE2b-256	`adf041f8b49478cf7265af2add0d5d0497b60d1acad532c3d557727db6ea2689`