Tensorflow libs: layers, metrics, ops, etc.

These details have not been verified by PyPI

Project description

MLable

Tensorflow libs:

layers:
- reshaping:
  - Divide
  - Merge
- embedding:
  - TokunEmbedding
  - RotaryPositionalEmbedding
- transformer:
  - CachedMultiHeadAttention
  - FeedForwardGate
metrics:

Installation

The package is available on pypi:

pip install -U mlable

Layers

Divide

Relative reshaping layers that divides a given axis and multiplies another by the same factor:

import mlable.layers.reshaping

__x = tf.ones(shape=(2, 4, 6, 8))
__l = mlable.layers.reshaping.Divide(
    input_axis=2, # relative to the NEW shape / rank
    output_axis=-1, # same
    factor=3,
    insert=False,) # whether to create a new axis

list(__l(__x).shape)
# [2, 4, 2, 24]

Merge

Relative reshaping layers that merges two axes:

import mlable.layers.reshaping

__x = tf.ones(shape=(2, 4, 6, 8))
__l = mlable.layers.reshaping.Merge(
    left_axis=1,
    right_axis=-1,
    left=False,) # whether to merge into the left axis

list(__l(__x).shape)
# [2, 6, 32]

TokunEmbedding

These embeddings are made from the combination of elementary embeddings.

The layer inherits from keras.layers.Embedding. It expects a tensor with a shape following the structure:

axis -2: sequence axis, with dimension S / T
axis -1: token axis, with dimension T

The T values in the token axis are the indexes of the embeddings to be combined. Typically, these are byte values:

import mlable.layers.embedding

__x = tf.random.uniform((128, 1024, 16), minval=0, maxval=256, dtype=int32)
__l = mlable.layers.embedding.TokunEmbedding(
    input_dim=256,
    output_dim=128,)

list(__l(__x).shape)
# [128, 1024, 2048]

And the output tensor has a shape (..., S / T, T * E), where T * E = H is the embedding dimension inside the LLM. In the above example, it is set to 2048.

RotaryPositionalEmbedding

Tensorflow implementation of RoPE:

import mlable.layers.embedding

__x = tf.ones(shape=(2, 3, 5))
__l = mlable.layers.embedding.RotaryPositionalEmbedding(
    sequence_axis=1, # position along this axis
    feature_axis=-1, # output axis
    max_wavelength=10_000, # see the paper
    scaling_factor=1.) # see the paper

__l(inputs=__x, offset=2) # the offset is typically used to perform iterative decoding during inference

CachedMultiHeadAttention

This layer subclasses the regular MultiHeadAttention and adds a cache.

It has the same parameters:

import mlable.layers.transformer

mlable.layers.transformer.CachedMultiHeadAttention(
    num_heads,
    key_dim,
    value_dim=None,
    dropout=0.0,
    use_bias=True,
    output_shape=None,
    attention_axes=None,
    kernel_initializer='glorot_uniform',
    bias_initializer='zeros',
    kernel_regularizer=None,
    bias_regularizer=None,
    activity_regularizer=None,
    kernel_constraint=None,
    bias_constraint=None,
    **kwargs)

And its call function has the following arguments:

mlable.layers.transformer.CachedMultiHeadAttention.call(
    query,
    value,
    key=None,
    cache=None,
    step=None,
    training=False,
    attention_mask=None,
    return_attention_scores=False,
    use_causal_mask=True,)

FeedForwardGate

A typical feed-forward layer with GELU activation:

import mlable.layers.transformer

__x = tf.ones(shape=(2, 3, 5), dtype=tf.dtypes.float32)
__l = mlable.layers.transformer.FeedForwardGate(
    input_dim=256,
    hidden_dim=1024)

__l(__x)

Metrics

Hierarchical models should not be scored on individual predictions but on their combination.

For example, tokun is a byte level autoencoder. It predicts probabilities for each byte of the output, like 0 in the UTF-32-BE encoding of "a" (0, 0, 0, 97).

A prediction of (0, 0, 0, 98) for "a" has 3 correct byte out of 4, but the prediction is actually "b".

In this case the byte accuracy is 75% while the character accuracy is 0%. Having several hierarchies of scoring helps with training and evaluation.

The individual predictions are evaluated in groups forming logical entities. These predictions can be in binary, categorical or raw formats. Each of these formats has a dedicated metric.

BinaryGroupAccuracy

Arguments:

group: the number of elementary predictions that must be correct to predict a higher level entity
depth: the dimension of the binary embedding for each predicted value
threshold: probabilities below the threshold are scored as 0 and above 1

import mlable.metrics

byte_accuracy = mlable.metrics.BinaryGroupAccuracy(group=1, depth=8, threshold=0.6, name='byte_accuracy')
character_accuracy = mlable.metrics.BinaryGroupAccuracy(group=4, depth=8, threshold=0.6, name='character_accuracy')
token_accuracy = mlable.metrics.BinaryGroupAccuracy(group=64, depth=8, threshold=0.6, name='token_accuracy')

CategoricalGroupAccuracy

Arguments:

group: the number of elementary predictions that must be correct to predict a higher level entity

import mlable.metrics

byte_accuracy = mlable.metrics.CategoricalGroupAccuracy(group=1, name='byte_accuracy')
character_accuracy = mlable.metrics.CategoricalGroupAccuracy(group=4, name='character_accuracy')
token_accuracy = mlable.metrics.CategoricalGroupAccuracy(group=64, name='token_accuracy')

RawGroupAccuracy

Arguments:

group: the number of elementary predictions that must be correct to predict a higher level entity
factor: scaling factor, typically from a probability distribution to a numeric value

import mlable.metrics

byte_accuracy = mlable.metrics.RawGroupAccuracy(group=1, factor=256.0, name='byte_accuracy')
character_accuracy = mlable.metrics.RawGroupAccuracy(group=4, factor=256.0, name='character_accuracy')
token_accuracy = mlable.metrics.RawGroupAccuracy(group=64, factor=256.0, name='token_accuracy')

Credits

Andrej Karpathy reconnected my ML synapses with micrograd.

License

Licensed under the aGPLv3.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.23.0

Mar 16, 2026

0.22.10

Jul 8, 2025

0.22.9

Jul 8, 2025

0.22.8

Jul 8, 2025

0.22.7

Jul 8, 2025

0.22.6

Jun 19, 2025

0.22.5

Jun 14, 2025

0.22.4

Jun 14, 2025

0.22.3

Jun 14, 2025

0.22.2

Jun 13, 2025

0.22.1

Jun 13, 2025

0.22.0

Jun 13, 2025

0.21.11

Jun 9, 2025

0.21.10

Jun 9, 2025

0.21.9

Jun 8, 2025

0.21.8

Jun 8, 2025

0.21.7

Jun 7, 2025

0.21.6

Jun 7, 2025

0.21.5

Jun 7, 2025

0.21.4

Jun 5, 2025

0.21.3

Jun 3, 2025

0.21.2

Jun 2, 2025

0.21.1

Jun 2, 2025

0.21.0

Jun 1, 2025

0.20.10

May 11, 2025

0.20.9

May 5, 2025

0.20.8

May 5, 2025

0.20.7

May 3, 2025

This version

0.20.6

May 3, 2025

0.20.5

May 3, 2025

0.20.4

May 2, 2025

0.20.3

May 2, 2025

0.20.2

May 1, 2025

0.20.1

Apr 30, 2025

0.20.0

Apr 29, 2025

0.19.3

Apr 28, 2025

0.19.2

Apr 28, 2025

0.19.1

Apr 26, 2025

0.19.0

Apr 26, 2025

0.18.3

Apr 19, 2025

0.18.2

Apr 18, 2025

0.18.1

Apr 17, 2025

0.18.0

Apr 17, 2025

0.17.0

Apr 15, 2025

0.16.2

Apr 11, 2025

0.16.1

Apr 10, 2025

0.16.0

Apr 10, 2025

0.15.14

Apr 9, 2025

0.15.13

Apr 9, 2025

0.15.12

Apr 9, 2025

0.15.11

Apr 6, 2025

0.15.10

Apr 6, 2025

0.15.9

Apr 5, 2025

0.15.8

Mar 30, 2025

0.15.6

Jan 29, 2025

0.15.5

Jan 27, 2025

0.15.4

Jan 25, 2025

0.15.3

Jan 25, 2025

0.15.2

Jan 23, 2025

0.14.11

Jan 22, 2025

0.14.10

Dec 16, 2024

0.14.9

Dec 4, 2024

0.14.8

Dec 4, 2024

0.14.7

Nov 26, 2024

0.14.6

Nov 20, 2024

0.14.5

Nov 17, 2024

0.14.4

Nov 15, 2024

0.14.3

Nov 14, 2024

0.14.0

Nov 14, 2024

0.13.1

Oct 30, 2024

0.13.0

Oct 30, 2024

0.12.9

Oct 28, 2024

0.12.8

Oct 27, 2024

0.12.7

Oct 27, 2024

0.12.6

Oct 27, 2024

0.12.5

Oct 27, 2024

0.12.4

Oct 25, 2024

0.12.3

Oct 21, 2024

0.12.2

Oct 20, 2024

0.12.1

Oct 20, 2024

0.12.0

Oct 17, 2024

0.11.8

Oct 17, 2024

0.11.7

Oct 16, 2024

0.11.5

Oct 11, 2024

0.11.4

Oct 9, 2024

0.11.3

Oct 9, 2024

0.11.2

Oct 8, 2024

0.11.1

Oct 8, 2024

0.11.0

Oct 7, 2024

0.10.5

Oct 7, 2024

0.10.4

Oct 7, 2024

0.10.3

Oct 6, 2024

0.10.2

Oct 4, 2024

0.10.1

Oct 4, 2024

0.9.2

Oct 3, 2024

0.9.1

Oct 2, 2024

0.9.0

Oct 2, 2024

0.8.13

Oct 1, 2024

0.8.12

Sep 29, 2024

0.8.11

Sep 29, 2024

0.8.10

Sep 28, 2024

0.8.9

Sep 27, 2024

0.8.8

Sep 27, 2024

0.8.7

Sep 17, 2024

0.8.6

Sep 14, 2024

0.8.5

Sep 4, 2024

0.8.4

Sep 1, 2024

0.8.3

Sep 1, 2024

0.8.2

Sep 1, 2024

0.8.0

Aug 29, 2024

0.7.17

Aug 28, 2024

0.7.16

Aug 28, 2024

0.7.15

Aug 28, 2024

0.7.14

Aug 27, 2024

0.7.13

Aug 26, 2024

0.7.12

Aug 26, 2024

0.7.11

Aug 25, 2024

0.7.10

Aug 25, 2024

0.7.9

Aug 24, 2024

0.7.8

Aug 24, 2024

0.7.7

Aug 14, 2024

0.7.6

Aug 14, 2024

0.7.5

Aug 14, 2024

0.7.4

Jul 17, 2024

0.7.3

Jul 17, 2024

0.7.2

Jul 16, 2024

0.7.1

Jul 16, 2024

0.7.0

Jul 15, 2024

0.6.9

Jul 15, 2024

0.6.8

Jul 15, 2024

0.6.7

Jul 15, 2024

0.6.6

Jul 14, 2024

0.6.5

Jul 10, 2024

0.6.5a0 pre-release

Jul 10, 2024

0.6.4

Jul 9, 2024

0.6.4a0 pre-release

Jul 10, 2024

0.6.3

Jul 9, 2024

0.6.2

Jul 9, 2024

0.6.2a5 pre-release

Jul 9, 2024

0.6.2a4 pre-release

Jul 9, 2024

0.6.2a3 pre-release

Jul 9, 2024

0.6.2a2 pre-release

Jul 9, 2024

0.6.2a1 pre-release

Jul 9, 2024

0.6.2a0 pre-release

Jul 8, 2024

0.6.1

Jul 8, 2024

0.6.1b0 pre-release

Jul 8, 2024

0.6.0

Jul 7, 2024

0.5.3

Jul 7, 2024

0.5.2

Jul 7, 2024

0.5.1

Jul 5, 2024

0.5.0

Jul 5, 2024

0.4.9

Jul 3, 2024

0.4.8

Jul 3, 2024

0.4.7

Jul 3, 2024

0.4.6

Jun 29, 2024

0.4.5

Jun 29, 2024

0.4.4

Jun 28, 2024

0.4.3

Jun 28, 2024

0.4.2

Jun 19, 2024

0.4.1

Jun 19, 2024

0.4.0

Jun 19, 2024

0.3.13

Jun 12, 2024

0.3.12

Jun 10, 2024

0.3.11

Jun 10, 2024

0.3.10

Jun 8, 2024

0.3.9

Jun 6, 2024

0.3.8

Jun 6, 2024

0.3.7

Jun 5, 2024

0.3.6

Jun 4, 2024

0.3.5

Jun 4, 2024

0.3.4

Jun 4, 2024

0.3.2

Jun 1, 2024

0.3.1

Jun 1, 2024

0.2.2

May 30, 2024

0.2.1

May 29, 2024

0.1.4

May 25, 2024

0.1.3

May 10, 2024

0.1.1

May 7, 2024

0.1.0

May 6, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlable-0.20.6.tar.gz (25.6 kB view details)

Uploaded May 3, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mlable-0.20.6-py3-none-any.whl (33.5 kB view details)

Uploaded May 3, 2025 Python 3

File details

Details for the file mlable-0.20.6.tar.gz.

File metadata

Download URL: mlable-0.20.6.tar.gz
Upload date: May 3, 2025
Size: 25.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.0.1 CPython/3.13.3 Linux/6.14.4-arch1-2

File hashes

Hashes for mlable-0.20.6.tar.gz
Algorithm	Hash digest
SHA256	`76e355cf68e65169d203d8bad6840318eeee3047067d51925aabc5a2fdfa4a22`
MD5	`1dffad2a8b4df837fa3b61edcde48ec5`
BLAKE2b-256	`fb86d218b975f7433c02dab69047b9c3b4cb71dabbbf8999297c5113748e757d`

See more details on using hashes here.

File details

Details for the file mlable-0.20.6-py3-none-any.whl.

File metadata

Download URL: mlable-0.20.6-py3-none-any.whl
Upload date: May 3, 2025
Size: 33.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.0.1 CPython/3.13.3 Linux/6.14.4-arch1-2

File hashes

Hashes for mlable-0.20.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`fe8424acfa1e1bd4a9b92526c51192f6eb4efa9388c30a13bd5e57dce4947160`
MD5	`61a68d798ae56db67f265f68e2da58c1`
BLAKE2b-256	`1dcb4d155c42bc4a85de9cba9de44dd4dcce6f7e1f0650ee22e1f28b00848266`

See more details on using hashes here.

mlable 0.20.6

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

MLable

Installation

Layers

Divide

Merge

TokunEmbedding

RotaryPositionalEmbedding

CachedMultiHeadAttention

FeedForwardGate

Metrics

BinaryGroupAccuracy

CategoricalGroupAccuracy

RawGroupAccuracy

Credits

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes