Machine learning tools to complement the AIronSuit package.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

AIronTools

AIronTools (Beta) is a Python library that provides the user with deep learning tools built to work with tensorflow as a backend.

Key features:

Model constructor that allows multiple models to be optimized in parallel across multiple GPUs.
Block constructor to build customised blocks/models.
Layer constructor to build customised layers.
Preprocessing utils.

Installation

pip install airontools

Custom Keras subclass to build a variational autoencoder (VAE) with airontools and compatible with aironsuit

import tensorflow as tf
from tensorflow.keras.models import Model
from tensorflow.keras.layers import *
from tensorflow.keras.metrics import Mean
from tensorflow.keras.losses import binary_crossentropy
import json
import numpy as np
from airontools.constructors.layers import layer_constructor


class ImageVAE(Model):

    def __init__(self, latent_dim, **kwargs):
        super(ImageVAE, self).__init__(**kwargs)

        self.total_loss_tracker = Mean(name="total_loss")
        self.reconstruction_loss_tracker = Mean(name="reconstruction_loss")
        self.kl_loss_tracker = Mean(name="kl_loss")

        # Encoder
        encoder_inputs = Input(shape=(28, 28, 1))
        encoder_conv = layer_constructor(
            encoder_inputs,
            name='encoder_conv',
            filters=32,  # Number of filters used for the convolutional layer
            kernel_size=3,  # Kernel size used for the convolutional layer
            strides=2,  # Strides used for the convolutional layer
            sequential_axis=-1,  # It's the channel axis, used to define the sequence for the 
            # self-attention layer
            num_heads=2,  # Self-attention heads applied after the convolutional layer
            units=latent_dim,  # Dense units applied after the self-attention layer
            advanced_reg=True)
        z_mean = layer_constructor(
            encoder_conv,
            name='z_mean',
            units=latent_dim,
            advanced_reg=True)
        z_log_var = layer_constructor(
            encoder_conv,
            name='z_log_var',
            units=latent_dim,
            advanced_reg=True)
        z = Sampling(name='z')([z_mean, z_log_var])
        self.encoder = Model(encoder_inputs, [z_mean, z_log_var, z], name="encoder")

        # Decoder
        latent_inputs = Input(shape=(latent_dim,))
        decoder_outputs = layer_constructor(
            latent_inputs,
            name='encoder_dense',
            units=7 * 7 * 64,
            advanced_reg=True)
        decoder_outputs = Reshape((7, 7, 64))(decoder_outputs)
        for i, filters, activation in zip([1, 2], [64, 32], ['relu', 'relu']):
            decoder_outputs = layer_constructor(
                decoder_outputs,
                name='decoder_conv',
                name_ext=str(i),
                filters=filters,
                kernel_size=3,
                strides=2,
                padding='same',
                conv_transpose=True,
                activation=activation,
                advanced_reg=True)
        decoder_outputs = layer_constructor(
            decoder_outputs,
            name='decoder_output',
            filters=1,
            kernel_size=3,
            padding='same',
            conv_transpose=True,
            activation='sigmoid',
            advanced_reg=True)
        self.decoder = Model(latent_inputs, decoder_outputs, name="decoder")

    @property
    def metrics(self):
        return [
            self.total_loss_tracker,
            self.reconstruction_loss_tracker,
            self.kl_loss_tracker,
        ]

    def train_step(self, data):
        total_loss, reconstruction_loss, kl_loss, tape = self.loss_evaluation(data, return_tape=True)
        grads = tape.gradient(total_loss, self.trainable_weights)
        self.optimizer.apply_gradients(zip(grads, self.trainable_weights))
        self.total_loss_tracker.update_state(total_loss)
        self.reconstruction_loss_tracker.update_state(reconstruction_loss)
        self.kl_loss_tracker.update_state(kl_loss)
        return {
            "loss": self.total_loss_tracker.result(),
            "reconstruction_loss": self.reconstruction_loss_tracker.result(),
            "kl_loss": self.kl_loss_tracker.result(),
        }

    def evaluate(self, data):
        total_loss, reconstruction_loss, kl_loss = self.loss_evaluation(data)
        return {
            'total_loss': total_loss.numpy(),
            'reconstruction_loss': reconstruction_loss.numpy(),
            'kl_loss': kl_loss.numpy()
        }

    def loss_evaluation(self, data, return_tape=False):
        def loss_evaluation_():
            z_mean, z_log_var, z = self.encoder(data)
            reconstruction = self.decoder(z)
            reconstruction_loss = tf.reduce_mean(
                tf.reduce_sum(
                    binary_crossentropy(data, reconstruction), axis=(1, 2)
                )
            )
            kl_loss = -0.5 * (1 + z_log_var - tf.square(z_mean) - tf.exp(z_log_var))
            kl_loss = tf.reduce_mean(tf.reduce_sum(kl_loss, axis=1))
            total_loss = reconstruction_loss + kl_loss
            return total_loss, reconstruction_loss, kl_loss

        if return_tape:
            with tf.GradientTape() as tape:
                total_loss, reconstruction_loss, kl_loss = loss_evaluation_()
            return total_loss, reconstruction_loss, kl_loss, tape
        else:
            total_loss, reconstruction_loss, kl_loss = loss_evaluation_()
            return total_loss, reconstruction_loss, kl_loss

    def save_weights(self, path):
        with open(path + '_encoder', 'w') as f:
            json.dump([w.tolist() for w in self.encoder.get_weights()], f)
        with open(path + '_decoder', 'w') as f:
            json.dump([w.tolist() for w in self.decoder.get_weights()], f)

    def load_weights(self, path):
        with open(path + '_encoder', 'r') as f:
            encoder_weights = [np.array(w) for w in json.load(f)]
        self.encoder.set_weights(encoder_weights)
        with open(path + '_decoder', 'r') as f:
            decoder_weights = [np.array(w) for w in json.load(f)]
        self.decoder.set_weights(decoder_weights)

    def summary(self):
        self.encoder.summary()
        self.decoder.summary()


class Sampling(Layer):
    """Uses (z_mean, z_log_var) to sample z, the vector encoding a digit."""

    def call(self, inputs):
        z_mean, z_log_var = inputs
        batch = tf.shape(z_mean)[0]
        dim = tf.shape(z_mean)[1]
        epsilon = tf.keras.backend.random_normal(shape=(batch, dim))
        return z_mean + tf.exp(0.5 * z_log_var) * epsilon

More examples

see usage examples in aironsuit/examples

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.1.43

Mar 26, 2024

0.1.41

Mar 26, 2024

0.1.40

Feb 26, 2024

0.1.39

Feb 22, 2024

0.1.37

Jan 18, 2024

0.1.36

Jan 11, 2024

0.1.35

Nov 10, 2023

0.1.34

Nov 10, 2023

0.1.33

Nov 3, 2023

0.1.31

Oct 26, 2023

0.1.29

Oct 18, 2023

0.1.28

Oct 18, 2023

0.1.27

Oct 18, 2023

0.1.26

Oct 18, 2023

0.1.25

Oct 17, 2023

0.1.24

Jul 4, 2023

0.1.23

Jun 25, 2023

0.1.22

Jun 25, 2023

0.1.21

Jun 25, 2023

0.1.20

Jun 25, 2023

0.1.19

Jun 25, 2023

0.1.18

May 17, 2023

0.1.17

Apr 22, 2022

0.1.16

Mar 7, 2022

0.1.15

Jan 20, 2022

This version

0.1.14

Jan 3, 2022

0.1.13

Nov 27, 2021

0.1.12

Nov 25, 2021

0.1.11

Nov 21, 2021

0.1.9

Nov 16, 2021

0.1.8

Nov 2, 2021

0.1.7

Oct 10, 2021

0.1.6

Sep 21, 2021

0.1.5

Aug 12, 2021

0.1.4

Aug 11, 2021

0.1.3

Jul 28, 2021

0.1.2

Jul 4, 2021

0.1.1

Jun 27, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

airontools-0.1.14-py3-none-any.whl (22.1 kB view hashes)

Uploaded Jan 3, 2022 Python 3

Hashes for airontools-0.1.14-py3-none-any.whl

Hashes for airontools-0.1.14-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0932198721388f9cc27ad2ed3f2216bacc7ce71e02353878691db4688ad9692b`
MD5	`ddacb302e635f172fa026d82a49cb587`
BLAKE2b-256	`bd26b3b4a3eb9e8e4f1c05cc775ec3ac169ca64f94f778ad03c5f9639d226819`