Math on (Hyper-Dual) Tensors with Trailing Axes

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

tensortrax

 _                            
| |                          ████████╗██████╗  █████╗ ██╗  ██╗
| |_ ___ _ __  ___  ___  _ __╚══██╔══╝██╔══██╗██╔══██╗╚██╗██╔╝
| __/ _ \ '_ \/ __|/ _ \| '__|  ██║   ██████╔╝███████║ ╚███╔╝ 
| ||  __/ | | \__ \ (_) | |     ██║   ██╔══██╗██╔══██║ ██╔██╗ 
 \__\___|_| |_|___/\___/|_|     ██║   ██║  ██║██║  ██║██╔╝ ██╗
                                ╚═╝   ╚═╝  ╚═╝╚═╝  ╚═╝╚═╝  ╚═╝

Math on (Hyper-Dual) Tensors with Trailing Axes.

Made with love in Graz (Austria)

Features

Designed to operate on input arrays with trailing axes
Essential vector/tensor Hyper-Dual number math, including limited support for einsum (restricted to max. three operands)
Forward Mode Automatic Differentiation (AD) using Hyper-Dual Tensors, up to second order derivatives
Create functions in terms of Hyper-Dual Tensors
Evaluate the function, the gradient (jacobian) and the hessian of scalar-valued functions or functionals on given input arrays
Straight-forward definition of custom functions in variational-calculus notation
Stable gradient and hessian of eigenvalues eigvalsh in case of repeated equal eigenvalues

Not Features

Not imitating a full-featured NumPy (e.g. like Autograd)
No arbitrary-order gradients (only first- and second order gradients)

Why `tensortrax`?

Compared to other libaries which introduce a new (hyper-) dual dtype (treated as dtype=object in NumPy), tensortrax relies on its own Tensor class. This approach involves a re-definition of all essential math operations (and NumPy-functions), whereas the dtype-approach supports most operations (even NumPy) out of the box. However, in tensortrax NumPy operates on default data types (e.g. dtype=float). This allows to support functions like np.einsum(). Beside the differences concerning the underlying dtype, tensortrax is formulated on (tensorial) calculus of variation. Gradient- and hessian-vector products are evaluated with very little overhead compared to analytic formulations.

Usage

Let's define a scalar-valued function which operates on a tensor.

import tensortrax as tr
import tensortrax.math as tm

def fun(F, mu=1):
    C = F.T() @ F
    I1 = tm.trace(C)
    J = tm.linalg.det(F)
    return mu / 2 * (J ** (-2 / 3) * I1 - 3)

The hessian of the scalar-valued function w.r.t. the chosen function argument (here, wrt=0 or wrt="F") is evaluated by variational calculus (Forward Mode AD implemented as Hyper-Dual Tensors). The function is called once for each component of the hessian (symmetry is taken care of). The function and the gradient are evaluated with no additional computational cost. Optionally, the function calls are executed in parallel (threaded).

import numpy as np

# some random input data
np.random.seed(125161)
F = np.random.rand(3, 3, 8, 50) / 10
for a in range(3):
    F[a, a] += 1

# W = tr.function(fun, wrt=0, ntrax=2)(F)
# dWdF = tr.gradient(fun, wrt=0, ntrax=2)(F)
# d2WdF2, dWdF, W = tr.hessian(fun, wrt="F", ntrax=2, full_output=True)(F=F)
d2WdF2 = tr.hessian(fun, wrt="F", ntrax=2, parallel=False)(F=F)

Theory

The calculus of variation deals with variations, i.e. small changes in functions and functionals. A small-change in a function is evaluated by applying small changes on the tensor components.

\psi = \psi(\boldsymbol{F})

\delta \psi = \delta \psi(\boldsymbol{F}, \delta \boldsymbol{F})

Let's take the trace of a tensor product as an example. The variation is evaluated as follows:

\psi = tr(\boldsymbol{F}^T \boldsymbol{F}) = \boldsymbol{F} : \boldsymbol{F}

\delta \psi = \delta \boldsymbol{F} : \boldsymbol{F} + \boldsymbol{F} : \delta \boldsymbol{F} = 2 \ \boldsymbol{F} : \delta \boldsymbol{F}

The $P_{ij}$ - component of the jacobian $\boldsymbol{P}$ is now numerically evaluated by setting the respective variational component $\delta P_{ij}$ of the tensor to one and all other components to zero. In total, $i \cdot j$ function calls are necessary to assemble the full jacobian. For example, the $12$ - component is evaluated as follows:

\delta \boldsymbol{F}_{(12)} = \begin{bmatrix} 0 & 1 & 0 \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{bmatrix}

\delta_{(12)} \psi = \frac{\partial \psi}{\partial F_{12}} = 2 \ \boldsymbol{F} : \delta \boldsymbol{F}_{(12)} = 2 \ \boldsymbol{F} : \begin{bmatrix} 0 & 1 & 0 \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{bmatrix}

The second order variation, i.e. a variation applied on another variation of a function is evaluated in the same way as a first order variation.

\Delta \delta \psi = 2 \ \delta \boldsymbol{F} : \Delta \boldsymbol{F} + 2 \ \boldsymbol{F} : \Delta \delta \boldsymbol{F}

Once again, each component $A_{ijkl}$ of the fourth-order hessian is numerically evaluated. In total, $i \cdot j \cdot k \cdot l$ function calls are necessary to assemble the full hessian (without considering symmetry). For example, the $1223$ - component is evaluated by setting $\Delta \delta \boldsymbol{F} = \boldsymbol{0}$ and $\delta \boldsymbol{F}$ and $\Delta \boldsymbol{F}$ as follows:

\delta \boldsymbol{F}_{(12)} = \begin{bmatrix} 0 & 1 & 0 \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{bmatrix}

\Delta \boldsymbol{F}_{(23)} = \begin{bmatrix} 0 & 0 & 0 \\ 0 & 0 & 1 \\ 0 & 0 & 0 \end{bmatrix}

\Delta \delta \boldsymbol{F} = \begin{bmatrix} 0 & 0 & 0 \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{bmatrix}

\Delta_{(23)} \delta_{(12)} \psi = \Delta_{(12)} \delta_{(23)} \psi = \frac{\partial^2 \psi}{\partial F_{12}\ \partial F_{23}}

\Delta_{(23)} \delta_{(12)} \psi = 2 \ \delta \boldsymbol{F}_{(12)} : \Delta \boldsymbol{F}_{(23)} + 2 \ \boldsymbol{F} : \Delta \delta \boldsymbol{F}

Numeric calculus of variation in `tensortrax`

Each Tensor has four attributes: the (real) tensor array and the (hyper-dual) variational arrays. To obtain the above mentioned $12$ - component of the gradient and the $1223$ - component of the hessian, a tensor has to be created with the appropriate small-changes of the tensor components (dual arrays).

import tensortrax as tr
from tensortrax import Tensor, f, δ, Δ, Δδ
from tensortrax.math import trace

δF_12 = np.array([
    [0, 1, 0], 
    [0, 0, 0], 
    [0, 0, 0],
], dtype=float)

ΔF_23 = np.array([
    [0, 0, 0], 
    [0, 0, 1], 
    [0, 0, 0],
], dtype=float)

x = np.eye(3) + np.arange(9).reshape(3, 3) / 10
F = Tensor(x=x, δx=δF_12, Δx=ΔF_23, Δδx=None)
I1_C = trace(F.T() @ F)

The function as well as the gradient and hessian components are accessible as:

ψ      =  f(I1_C)
P_12   =  δ(I1_C) # (= Δ(I1_C))
A_1223 = Δδ(I1_C)

To obtain full gradients and hessians in one function call, tensortrax provides helpers (decorators) which handle the multiple function calls.

fun = lambda F: trace(F.T() @ F)

func = tr.function(fun)(x)
grad = tr.gradient(fun)(x)
hess = tr.hessian(fun)(x)

Evaluate the gradient- as well as the hessian-vector-product:

gvp = tr.gradient_vector_product(fun)(x, δx=x)
hvp = tr.hessian_vector_product(fun)(x, δx=x, Δx=x)

Extensions

Custom functions (extensions) are easy to implement in tensortrax. Beside the function expression, three additional (dual) variation expressions have to be defined.

import numpy as np
from tensortrax import Tensor, f, δ, Δ, Δδ

def sin(A):
    return Tensor(
        x=np.sin(f(A)),
        δx=np.cos(f(A)) * δ(A),
        Δx=np.cos(f(A)) * Δ(A),
        Δδx=-np.sin(f(A)) * δ(A) * Δ(A) + np.cos(f(A)) * Δδ(A),
        ntrax=A.ntrax,
    )

x = np.eye(3)
y = sin(Tensor(x))

Hint: Feel free to contribute missing math-functions to tensortrax/math/_math_tensor.py :page_with_curl: :pencil2:.

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.18.1

Mar 4, 2024

0.18.0

Feb 28, 2024

0.17.1

Oct 3, 2023

0.17.0

Aug 10, 2023

0.16.1

Jul 26, 2023

0.15.2

Jul 24, 2023

0.15.1

May 24, 2023

0.15.0

May 24, 2023

0.14.0

May 24, 2023

0.13.0

May 24, 2023

0.12.1

May 10, 2023

0.12.0

Apr 19, 2023

0.11.0

Apr 17, 2023

0.10.0

Mar 27, 2023

0.9.0

Feb 15, 2023

0.8.5

Feb 13, 2023

0.8.4

Feb 12, 2023

0.8.3

Feb 12, 2023

0.8.2

Feb 10, 2023

0.8.1

Feb 8, 2023

0.8.0

Feb 7, 2023

0.7.0

Feb 6, 2023

0.6.0

Feb 4, 2023

0.5.1

Feb 3, 2023

0.5.0

Jan 29, 2023

0.4.0

Jan 28, 2023

0.3.0

Jan 24, 2023

0.2.9

Jan 2, 2023

0.2.8

Jan 1, 2023

0.2.7

Dec 30, 2022

0.2.6

Dec 17, 2022

0.2.5

Dec 13, 2022

0.2.4

Dec 13, 2022

0.2.3

Dec 12, 2022

0.2.2

Dec 11, 2022

This version

0.2.1

Dec 11, 2022

0.2.0

Dec 10, 2022

0.1.9

Dec 10, 2022

0.1.8

Dec 10, 2022

0.1.7

Dec 10, 2022

0.1.6

Dec 9, 2022

0.1.5

Dec 9, 2022

0.1.4

Dec 9, 2022

0.1.3

Dec 8, 2022

0.1.2

Dec 7, 2022

0.1.0

Dec 7, 2022

0.0.14

Dec 6, 2022

0.0.13

Dec 6, 2022

0.0.11

Dec 6, 2022

0.0.10

Dec 6, 2022

0.0.9

Dec 4, 2022

0.0.8

Dec 4, 2022

0.0.7

Dec 4, 2022

0.0.6

Dec 2, 2022

0.0.5

Dec 2, 2022

0.0.4

Dec 1, 2022

0.0.3

Nov 30, 2022

0.0.2

Nov 30, 2022

0.0.1

Nov 30, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tensortrax-0.2.1.tar.gz (52.1 kB view hashes)

Uploaded Dec 11, 2022 Source

Built Distribution

tensortrax-0.2.1-py3-none-any.whl (41.3 kB view hashes)

Uploaded Dec 11, 2022 Python 3

Hashes for tensortrax-0.2.1.tar.gz

Hashes for tensortrax-0.2.1.tar.gz
Algorithm	Hash digest
SHA256	`884fe323d42cd537bb5dc6b06c8cfb57606c5cab946ab9d2ffdb70f8120760e8`
MD5	`083a823dbf12c6df85ad182c4740fc51`
BLAKE2b-256	`688a13be241d490c94c44148ade65cf1a1488a6ad563b03e40e00e4aa27f6de0`

Hashes for tensortrax-0.2.1-py3-none-any.whl

Hashes for tensortrax-0.2.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ccf5562102e53656450432c4b1b53c47e1c093d72606fd0d3984566db53110e7`
MD5	`1af2df678cf91fe67a00eaa2b896ae7e`
BLAKE2b-256	`5e297d326ab409d62e5c91d7fa75ecb21ab8b91991b0372a83ba791da480b936`

tensortrax 0.2.1

Navigation

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Project description

tensortrax

Features

Not Features

Why `tensortrax`?

Usage

Theory

Numeric calculus of variation in `tensortrax`

Extensions

Project details

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

tensortrax 0.2.1

Navigation

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Project description

tensortrax

Features

Not Features

Why tensortrax?

Usage

Theory

Numeric calculus of variation in tensortrax

Extensions

Project details

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

Why `tensortrax`?

Numeric calculus of variation in `tensortrax`