A Bayesian Neural Network implementation in TensorFlow

These details have not been verified by PyPI

Project links

Homepage

Project description

TensorBNN

This package contains code which can be used to train Bayesian Neural Networks using Hamiltonian Monte Carlo sampling as proposed by Radford Neal in his thesis "Bayesian Learning for Neural Networks" along with added features. The package is written in python3 and uses the packages Tensorflow and Tensorflow-Probability as the framework for the implementation.

Dependencies

All python code written here is in python3. The code is dependent upon the packages numpy, tensorflow, tensorflow-probability, and scipy.

The package, along with numpy and scipy, can be installed via

pip install tensorBNN

Alternatively, you can download numpy and scipy from source through the command:

pip3 install numpy scipy

TensorFlow and TensorFlow-probability must be instaled separately. The TensorFlow version must be 2.0. Using a 1.x version will not work. It is also highly recomended that this code be run on a gpu due to its high computational complexity. TensorFlow 2.0 for the gpu can be installed with the command:

pip3 install tensorflow-gpu==2.0.0-beta1

In order to be compatible with this version of tensorflow 2.0, tensorflow-probability version 0.8.0 must be installed. This is done with the following command:

pip3 install tensorflow-probability==0.8.0

Usage

In order to use this code you must import network, Dense Layer, and an activation such as Relu. This can be done as follows:

from TensorBNN.layer import DenseLayer
from TensorBNN.network import network
from TensorBNN.activationFunctions import Relu

Next, it is highly convenient to turn off the deprecation warnings. These are all from tensorflow, tensorflow-probability, and numpy intereacting with tensorflow, so it isn't something easily fixed and there are a lot of warnings. These are turned off with:

import warnings
warnings.filterwarnings("ignore", category=DeprecationWarning)
warnings.filterwarnings("ignore", category=UserWarning)

The other important setup task is determining whether or not to seed the random number generator before training. Please note that if you are using a gpu then there will always be some randomness which cannot be removed. To set all cpu random numbers use these lines of code:

import os

import numpy as np
import random as rn
import tensorflow as tf

os.environ["PYTHONHASHSEED"] = "0"
np.random.seed(42)
rn.seed(12345)
tf.random.set_seed(3)

Moving on to the actual use of this code, start with the declaration of a network object:

neuralNet = network.network(dtype, inputDims, trainX, trainY, validationX, validationY, mean, sd)

The paramaters are described as follows:

dtype: data type for Tensors
inputDims: dimension of input vector
trainX: the training data input, shape is n by inputDims
trainY: the training data output
validateX: the validation data input, shape is n by inputDims
validateY: the validation data output
mean: the mean used to scale trainY and validateY
sd: standard deviation used to scale trainY and validateY

Next, add all of the desired layers and activation functions as follows:

neuralNet.add(DenseLayer(inputDims, outputDims, seed=seed, dtype=tf.float32))
neuralNet.add(Relu())

For added control, especially when using pre-trained networks it is possible to feed pretrained weights, biases, and values for the activation functions. This can be done as follows:

neuralNet.add(DenseLayer(inputDims,outputDims, weights=weights, biases=biases, seed=seed, dtype=dtype))
neuralNet.add(SquarePrelu(width, alpha=alpha**(0.5), activation=activation, dtype=dtype))

The paramater inputDims is the output shape of the layer before, and the width is the ouput shape of the layers itself. The seed is used for seeding the random number generator. Currently, only ReLU is supported for easy predictions off of saved networks. The other activation functions can be used, but they will require more custom code to predict from saved networks.

Next, the Markov Chain Monte Carlo algorithm must be initialized. This can be done as follows:

neuralNet.setupMCMC(self, stepSize, stepMin, stepMax, stepNum, leapfrog, leapMin,
                    leapMax, leapStep, hyperStepSize, hyperLeapfrog, burnin,
                    cores, averagingSteps=2, a=4, delta=0.1):

The paramaters are described as follows:

stepSize: the starting step size for the weights and biases
stepMin: the minimum step size
stepMax: the maximum step size
stepNum: the number of step sizes in grid
leapfrog: number of leapfrog steps for weights and biases
leapMin: the minimum number of leapfrog steps
leapMax: the maximum number of leapfrog steps
leapStep: the step in number of leapfrog for search grid
hyperStepSize: the starting step size for the hyper parameters
hyperLeapfrog: leapfrog steps for hyper parameters
cores: number of cores to use
averaginSteps: number of averaging steps
a: constant, 4 in paper
delta: constant, 0.1 in paper

This code uses the adaptive Hamlitonain Monte Carlo described in "Adaptive Hamiltonian and Riemann Manifold Monte Carlo Samplers" by Wang, Mohamed, and de Freitas. In accordance with this paper there are a few more paramaters that can be adjusted, though it is recomended that their default values are kept.

After initializaing the HMC, we must declare the likelihood that we want to use as well as any metrics. This can be accomplished through the following code:

# Declare Gaussian Likelihood with sd of 0.1
likelihood =  GaussianLikelihood(sd = 0.1)
metricList = [ #Declare metrics
    SquaredError(mean = 0, sd = 1, scaleExp = False),
    PercentError(mean = 10, sd = 2, scaleExp = True)]

The last thing to do is actually tell the model to start learning this is done with the following command:

network.train(
        epochs, # epochs to train for
        samplingStep, # increment between network saves
        likelihood,
        metricList = metricList,
        folderName = "Regression", 
        # Name of folder for saved networks
        networksPerFile=50)
        # Number of networks saved per file

The arguments have the following meanings:

Epochs: Number of training cycles
samplingStep: Epochs between sampled networks
likelihood: The likelihood function used to evaluate the prediction we defined previously
startSigma: Starting standard deviation for likelihood function for regression models
folderName: name of folder for saved networks
networksPerFile: number of networks saved in a given file

Once the network has trained, which may take a while, the saved networks can be loaded and then used to make predictions using the following code:

from TensorBNN.predictor import predictor 

network = predictor(filePath,
                    dtype = dtype, 
                    # data type used by network
                    customLayerDict={"dense2": Dense2},
                    # A dense layer with a different 
                    # hyperprior
                    likelihood = Likelihood)
                    # The likelihood function is required to  
                    # calculate the probabilities for 
                    # re-weighting

initialResults = network.predict(inputData, skip, dtype)

The variable filePath is the directory from which the networks are being loaded, inputData is the normalized data for which predictions should be made, and dtype is the data type to be used for predictions. The customLayerDict is a dictionary holding the names and objects for any user defined layers. Likelihood is the likelihood function used to train the model.

The variable initialResults will be a list of numpy arrays, each numpy array corresponding to the predcitions from a single network in the BNN. The skip variable instructs the predictor to only use every n networks, where n=skip

Additionally, the predictor function allows for the calculation of the correlation between different betworks through:

correlations = network.correlation(dtype)

Finally, the predictor object can calculate new weights for the different networks if they were given new priors. These priors take the form of new Layer objects which must be referenced in an architecture file. The reweighting function call looks like this:

weights = network.reweight(                                            
                    trainX, # training input
                    trainY, # training output
                    skip = 10, # Use every 10 saved networks
                    architecture = "architecture2.txt")
                    # New architecture file

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.12.3

Nov 11, 2020

0.12.2

Nov 11, 2020

0.12.1

Oct 22, 2020

0.12.0

Oct 22, 2020

0.11.6

Oct 4, 2020

0.11.5

Oct 4, 2020

0.11.4

Oct 4, 2020

0.11.3

Sep 27, 2020

0.11.2

Sep 27, 2020

0.11.1

Sep 27, 2020

0.11.0

Sep 27, 2020

This version

0.10.0

Sep 22, 2020

0.9.0

Jun 12, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tensorBNN-0.10.0.tar.gz (23.8 kB view details)

Uploaded Sep 22, 2020 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

tensorBNN-0.10.0-py3-none-any.whl (25.0 kB view details)

Uploaded Sep 22, 2020 Python 3

File details

Details for the file tensorBNN-0.10.0.tar.gz.

File metadata

Download URL: tensorBNN-0.10.0.tar.gz
Upload date: Sep 22, 2020
Size: 23.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.2.0 requests-toolbelt/0.9.1 tqdm/4.49.0 CPython/3.8.2

File hashes

Hashes for tensorBNN-0.10.0.tar.gz
Algorithm	Hash digest
SHA256	`613ddb877613d76146a9237bb230cfedb7818875cb27e44f37d4670014cc4e51`
MD5	`986a17f46e007e526136d19ee338022c`
BLAKE2b-256	`8b2954e3dd4b0550b3462693e852c664434041c6a95f021ba9bd2da09bfb0d19`

See more details on using hashes here.

File details

Details for the file tensorBNN-0.10.0-py3-none-any.whl.

File metadata

Download URL: tensorBNN-0.10.0-py3-none-any.whl
Upload date: Sep 22, 2020
Size: 25.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.2.0 requests-toolbelt/0.9.1 tqdm/4.49.0 CPython/3.8.2

File hashes

Hashes for tensorBNN-0.10.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a5d6642fe5d68d45e025dfb86a6fd505309659c4e3d1ad44c859e1dba9c23490`
MD5	`98fa650068da232be91ef4cdf7a5c603`
BLAKE2b-256	`0d25f6fd23c1de870c75943d2c2e9176908f4c96423995da6100634a42c58e7e`

See more details on using hashes here.

tensorBNN 0.10.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

TensorBNN

Dependencies

Usage

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes