nobuco

Pytorch to Tensorflow conversion made intuitive

These details have not been verified by PyPI

Project links

Project description

No Bullshit Converter is a tool that helps you translate Pytorch models into Keras/Tensorflow graphs without losing your mind.

Supports a wide range of architectures
- Control flow ops (If, For, While)
- Recurrent layers (LSTM, GRU)
- Arbitrary torch functions
Simple
Flexible
Efficient
Sanity-preserving, with clear mistake messaging

Installation

pip install -U nobuco

Essentials
Channel order wizardry
In-place operations
Implementation mismatch: pick your poison
Going dynamic
- Control flows
- Dynamic shapes
Ad hoc modifications
So we put a converter inside your converter
But was it worth it?
Nobuco knowledge base

Essentials

Suppose we want to convert a Pytorch module similar to this one:

class MyModule(nn.Module):
    def __init__(self):
        super().__init__()
        self.conv = nn.Conv2d(3, 16, kernel_size=(3, 3), padding=(1, 1), stride=(2, 2))

    def forward(self, x):
        x = self.conv(x)
        x = nn.Hardsigmoid()(x)
        x = 1 - x[:, ::2] * x[:, 1::2]
        return x

The process is exactly what you would expect. Instantiate the module, create dummy inputs and call the magic function:

import nobuco
from nobuco import ChannelOrder, ChannelOrderingStrategy
from nobuco.layers.weight import WeightLayer

dummy_image = torch.rand(size=(1, 3, 256, 256))
pytorch_module = MyModule().eval()

keras_model = nobuco.pytorch_to_keras(
    pytorch_module,
    args=[dummy_image], kwargs=None,
    inputs_channel_order=ChannelOrder.TENSORFLOW,
    outputs_channel_order=ChannelOrder.TENSORFLOW
)

Aaaand done! That's all it takes to... hold on, what's that?

Nobuco says it doesn't know how to handle hard sigmoid. Apparently, it's our job to provide a node converter for either F.hardsigmoid or the enclosing Hardsigmoid module (or the entire MyModule, but that makes little sense). Here, we'll go for the former.

Conversion is done directly. No layers upon layers of abstraction, no obscure intermediate representation. A node converter is just a Callable that takes in the same arguments as the corresponding node in Pytorch and outputs an equivalent node in Tensorflow. The converted node preserves the original node's signature, but Pytorch tensors replaced with Tensorflow counterparts (be that tf.Tensor, KerasTensor, tf.Variable, or ResourceVariable).

This should do the trick:

@nobuco.converter(F.hardsigmoid, channel_ordering_strategy=ChannelOrderingStrategy.MINIMUM_TRANSPOSITIONS)
def hardsigmoid(input: torch.Tensor, inplace: bool = False):
    return lambda input, inplace=False: tf.keras.activations.hard_sigmoid(input)

It works, but the outputs don't quite match. Perhaps we should check how Pytorch and Tensorflow define hard sigmoid. And sure enough, their implementations differ. Have to type in the formula manually, I guess...

@nobuco.converter(F.hardsigmoid, channel_ordering_strategy=ChannelOrderingStrategy.MINIMUM_TRANSPOSITIONS)
def hardsigmoid(input: torch.Tensor, inplace: bool = False):
    return lambda input, inplace=False: tf.clip_by_value(input/6 + 1/2, clip_value_min=0, clip_value_max=1)

And the happy result:

The example above is artificial, but it illustrates the point. It's not feasible to provide a node converter for every existing Pytorch op. There's literally thousands of them! Best we can do without the converter constantly lacking essential functionality, being riddled with bugs, doing weird stuff and breaking apart with every other PT/TF release is to keep the tool simple and customizable, make it clear where a problem comes from and let the user sort things out. Usually it's easy for a human to translate an isolated operation from one framework to another. Reproducing the graph structure is a different matter entirely. For that, Nobuco has you covered!

Nobuco lets you intervene in conversion at each step, asks for help where needed and doesn't bother you with routine stuff.

https://user-images.githubusercontent.com/2457934/233740603-cc11acc5-cd6b-48c8-b089-ff3ead772dd0.mp4

With an IDE, you can jump right where the node was [I]nvoked, [D]efined and [C]onverted

Channel order wizardry

Some operations assume its input tensors have a channel dimension. And as you probably know, Pytorch and Tensorflow do not agree on the layout of such tensors. Pytorch adopts channel-first layout (BCH, BCHW, etc.) while Tensorflow works efficiently with channel-last tensors (BHC, BHWC, ...). Transposing tensors between the two layouts incurs non-trivial overhead as generally, tensor data must be physically rearranged. In an effort to keep that overhead to the minimum, Nobuco does layout coercions lazily. A couple of things are needed to make it possible:

Tensorflow tensors are augmented with an additional property which stores their channel order, either Pytorch (channel first) or Tensorflow (channel last) style.
Node converters have requirements on what channel order their inputs must have. Said requirements are expressed with channel_ordering_strategy argument.

Channel ordering strategies are

FORCE_TENSORFLOW_ORDER
- Input tensors will be coerced to Tensorflow channel order.
- Convenient for converting channel-aware operations (convolution, batchnorm).
FORCE_PYTORCH_ORDER
- Input tensors entering the node will look exactly as they do in the original Pytorch graph.
- Use it when the node does not interpret its input tensors as having a channel dimension (linear, matmul).
MINIMUM_TRANSPOSITIONS
- The channel order is decided by a majority vote (whichever prevails among the inputs). This way the number of coercions (i.e. tensor transpositions) is kept to the minimum. It also means whenever there's only one input, it will be left untouched.
- Best choice for element-wise ops (most activations).
MANUAL
- You are on your own. In exchange for unrestricted freedom, you take responsibility to coerce input tensors to suitable channel order and to also annotate output tensors with their order.

The simple lazy approach makes wonders in most situations, but sometimes it produces suboptimal graphs. Consider the code below. Imagine this is some sort of text processing network. It first applies a GRU layer which assumes the inputs do not have a channel dimension, so its input/output layouts are the same in both Pytorch and Tensorflow. But then, the outputs are passed to a couple of 1D convolutions which are channel-aware. Because of that, a transpose op must be put in the converted graph.

class MyModule(nn.Module):
    def __init__(self):
        super().__init__()
        self.gru = nn.GRU(32, 128, num_layers=1, batch_first=True, bidirectional=False)
        self.conv1 = nn.Conv1d(12, 40, kernel_size=3, padding=1)
        self.conv2 = nn.Conv1d(12, 60, kernel_size=1, padding=0)

    def forward(self, x):
        x, hx = self.gru(x)
        x1 = self.conv1(x)
        x2 = self.conv2(x)
        return x1, x2

pytorch_module = MyModule().eval()

inputs = [
    torch.normal(0, 1, size=(1, 12, 32)),
]
keras_model = nobuco.pytorch_to_keras(
    pytorch_module, inputs,
    inputs_channel_order=ChannelOrder.PYTORCH,
)

The laziness shoots us in the foot here, and we get not one transpose but two:

For such occasions, there's two brethren functions: force_tensorflow_order and force_pytorch_order.

x, hx = self.gru(x)
x = nobuco.force_tensorflow_order(x)
x1 = self.conv1(x)
x2 = self.conv2(x)

In case you are curious, the implementation is trivial:

@nobuco.traceable
def force_tensorflow_order(inputs):
    return inputs


@nobuco.converter(force_tensorflow_order, channel_ordering_strategy=ChannelOrderingStrategy.FORCE_TENSORFLOW_ORDER)
def converter_force_tensorflow_order(inputs):
    return lambda inputs: inputs

force_pytorch_order is defined analogously.

In-place operations

Nobuco can handle most situations where tensors are modified in-place. For instance, these will work just fine:

class MyModule(nn.Module):
    def forward(self, x):
        x[:, 1:2, 16:25, 8::2] *= 2
        torch.relu_(x)
        return x

However, applying in-place operation to a slice yields incorrect result. What gives?

class MyModule(nn.Module):
    def forward(self, x):
        torch.relu_(x[:, 1:2, 16:25, 8::2])
        return x

You see, Tensorflow graphs (and many other formats like ONNX) do not support in-place ops. So when we take slice (x[:, 1:2, 16:25, 8::2]) in TF/ONNX, the result is not a view of the original tensor but a copy. This copy is then passed to relu (which is not in-place either), and its result is not used anywhere. As you can see above, the output tensors of __getitem__ and relu_ are grayed out, and these operations are excluded from the graph. In fact, it's empty:

The easiest way of fixing this is to explicitly assign the result to the slice. Conveniently enough, most standard in-place operations in Pytorch do return their modified arguments as outputs.

class MyModule(nn.Module):
    def forward(self, x):
        x[:, 1:2, 16:25, 8::2] = torch.relu_(x[:, 1:2, 16:25, 8::2])
        return x

Implementation mismatch: pick your poison

Sometimes, Pytorch and Tensorflow just don't go along. In case of hardsigmoid, it's a mere inconvenience, but it can be much more sinister.

Take the model below, for example.

class MyModule(nn.Module):
    def __init__(self):
        super().__init__()
        self.factor = 4
        self.conv = nn.Conv2d(3*self.factor**2, 3*self.factor**2, kernel_size=1)

    def forward(self, x):
        x = nn.PixelUnshuffle(self.factor)(x)
        x = self.conv(x)
        x = nn.PixelShuffle(self.factor)(x)
        return x

Ideally, there would only be three nodes in the converted graph. That's not what we get, though.

Tensorflow does not have pixel_unshuffle/pixel_shuffle. Their closest counterparts, tf.nn.space_to_depth/tf.nn.depth_to_space, do almost the same thing but not quite: output channels are in a different order. The order must be fixed with a pricey transpose, no way around that. Or is there?

Instead of emulating an absent Pytorch op in Tensorflow, we might do the procedure in reverse: provide a Pytorch implementation for the Tensorflow node we want to convert to. The overhead would be carried by the original Pytorch model leaving the converted graph nice and clean.

from nobuco.addons.torch.space_to_depth import SpaceToDepth
from nobuco.addons.torch.depth_to_space import DepthToSpace


class MyModuleTFOptimized(nn.Module):
    def __init__(self):
        super().__init__()
        self.factor = 4
        self.conv = nn.Conv2d(3*self.factor**2, 3*self.factor**2, kernel_size=1)

    def forward(self, x):
        x = SpaceToDepth(self.factor)(x)
        x = self.conv(x)
        x = DepthToSpace(self.factor)(x)
        return x

Torch-optimized	Tensorflow-optimized
Torch implementation F.pixel_unshuffle Tensorflow converter @nobuco.converter(F.pixel_unshuffle, channel_ordering_strategy=ChannelOrderingStrategy.FORCE_TENSORFLOW_ORDER) def converter_pixel_unshuffle(input: Tensor, downscale_factor: _int): def func(input, downscale_factor): x = tf.nn.space_to_depth(input, downscale_factor) x = channel_interleave2d(x, downscale_factor, reverse=True) return x return func def channel_interleave2d(x, block_size: int, reverse: bool): b, h, w, c = x.shape n_blocks = block_size ** 2 if reverse: x = tf.reshape(x, (b, h, w, n_blocks, c // n_blocks)) else: x = tf.reshape(x, (b, h, w, c // n_blocks, n_blocks)) x = tf.transpose(x, (0, 1, 2, 4, 3)) x = tf.reshape(x, (b, h, w, c)) return x	Torch implementation class SpaceToDepth(nn.Module): def __init__(self, block_size): super().__init__() self.block_size = block_size def forward(self, input): x = F.pixel_unshuffle(input, self.block_size) x = channel_interleave2d(x, self.block_size, reverse=False) return x def channel_interleave2d(x: torch.Tensor, block_size: int, reverse: bool) -> torch.Tensor: b, c, h, w = x.shape n_blocks = block_size 2 if reverse: x = x.view(b, n_blocks, c // n_blocks, h, w) else: x = x.view(b, c // n_blocks, n_blocks, h, w) x = x.transpose(1, 2).reshape(b, c, h, w) return x Tensorflow converter** @nobuco.converter(SpaceToDepth, channel_ordering_strategy=ChannelOrderingStrategy.FORCE_TENSORFLOW_ORDER) def converter_space_to_depth(self, input: torch.Tensor): return lambda input: tf.nn.space_to_depth(input, self.block_size)

Torch-optimized

Tensorflow-optimized

Torch implementation

F.pixel_unshuffle

Tensorflow converter

@nobuco.converter(F.pixel_unshuffle, 
                  channel_ordering_strategy=ChannelOrderingStrategy.FORCE_TENSORFLOW_ORDER)
def converter_pixel_unshuffle(input: Tensor, downscale_factor: _int):
    def func(input, downscale_factor):
        x = tf.nn.space_to_depth(input, downscale_factor)
        x = channel_interleave2d(x, downscale_factor, reverse=True)
        return x
    return func


def channel_interleave2d(x, block_size: int, reverse: bool):
    b, h, w, c = x.shape
    n_blocks = block_size ** 2

    if reverse:
        x = tf.reshape(x, (b, h, w, n_blocks, c // n_blocks))
    else:
        x = tf.reshape(x, (b, h, w, c // n_blocks, n_blocks))

    x = tf.transpose(x, (0, 1, 2, 4, 3))
    x = tf.reshape(x, (b, h, w, c))
    return x

Torch implementation

class SpaceToDepth(nn.Module):
    def __init__(self, block_size):
        super().__init__()
        self.block_size = block_size

    def forward(self, input):
        x = F.pixel_unshuffle(input, self.block_size)
        x = channel_interleave2d(x, self.block_size, reverse=False)
        return x

    
def channel_interleave2d(x: torch.Tensor, block_size: int, reverse: bool) -> torch.Tensor:
    b, c, h, w = x.shape
    n_blocks = block_size ** 2

    if reverse:
        x = x.view(b, n_blocks, c // n_blocks, h, w)
    else:
        x = x.view(b, c // n_blocks, n_blocks, h, w)

    x = x.transpose(1, 2).reshape(b, c, h, w)
    return x

Tensorflow converter

@nobuco.converter(SpaceToDepth, 
                  channel_ordering_strategy=ChannelOrderingStrategy.FORCE_TENSORFLOW_ORDER)
def converter_space_to_depth(self, input: torch.Tensor):
    return lambda input: tf.nn.space_to_depth(input, self.block_size)

Going dynamic

Control flows

Introducing python control flow statements into the compute graph is no easy feat. Tensorflow can do so via tf.autograph, but at a cost of system's complexity and with some notable limitations. Stuff like that is way above Nobuco's paygrade, so the following module cannot be properly handled without human intervention.

class ControlIf(nn.Module):
    def __init__(self):
        super().__init__()
        self.conv_pre = nn.Conv2d(3, 16, kernel_size=(1, 1))
        self.conv_true = nn.Conv2d(16, 32, kernel_size=(1, 1))
        self.conv_false = nn.Conv2d(16, 32, kernel_size=(1, 1))
        self.conv_shared = nn.Conv2d(32, 32, kernel_size=(1, 1))

    def forward(self, x):
        x = self.conv_pre(x)
        if x.mean() > 0:
            x = self.conv_true(x)
            x = torch.tanh(x)
            x = self.conv_shared(x)
            x = x + 1
        else:
            x = self.conv_false(x)
            x = torch.sigmoid(x)
            x = self.conv_shared(x)
            x = x - 1
        x = self.conv_shared(x)
        return x

Of course, it's possible to translate the dynamic module into a Tensorflow layer (don't forget to decorate it with @tf.function for autograph to kick in). But what if it contains inner modules, do you replicate them in Tensorflow all by hand? Not unless you want to! Just convert them separately and use the resulting graphs inside the parent layer.

class ControlIfKeras(tf.keras.layers.Layer):
    def __init__(self, conv_pre, conv_true, conv_false, conv_shared, *args, **kwargs):
        super().__init__(*args, **kwargs)
        self.conv_pre = conv_pre
        self.conv_true = conv_true
        self.conv_false = conv_false
        self.conv_shared = conv_shared

    def get_config(self):
        config = super().get_config()
        config.update({
            "conv_pre": self.conv_pre,
            "conv_true": self.conv_true,
            "conv_false": self.conv_false,
            "conv_shared": self.conv_shared,
        })
        return config

    @tf.function
    def call(self, x):
        x = self.conv_pre(x)
        if tf.reduce_mean(x) > 0:
            x = self.conv_true(x)
            x = tf.tanh(x)
            x = self.conv_shared(x)
            x = x + 1
        else:
            x = self.conv_false(x)
            x = tf.sigmoid(x)
            x = self.conv_shared(x)
            x = x - 1
        x = self.conv_shared(x)
        return x


@nobuco.converter(ControlIf, channel_ordering_strategy=ChannelOrderingStrategy.FORCE_TENSORFLOW_ORDER)
def converter_ControlIf(self, x):
    order = ChannelOrder.TENSORFLOW
    kwargs = {'inputs_channel_order': order, 'outputs_channel_order': order, 'return_outputs_pt': True}
    
    conv_pre, out_pre = nobuco.pytorch_to_keras(self.conv_pre, [x], **kwargs)
    conv_true, out_true = nobuco.pytorch_to_keras(self.conv_true, [out_pre], **kwargs)
    conv_false, out_false = nobuco.pytorch_to_keras(self.conv_false, [out_pre], **kwargs)
    conv_shared, _ = nobuco.pytorch_to_keras(self.conv_shared, [out_true], **kwargs)
    layer = ControlIfKeras(conv_pre, conv_true, conv_false, conv_shared)
    return layer

See examples for other ways to convert control flow ops.

Dynamic shapes

What if we wanted our module to accept images of arbitrary height and width? Can we have that? Let's try:

class DynamicShape(nn.Module):
    def __init__(self):
        super().__init__()
        self.conv = nn.Conv2d(3, 16, kernel_size=(1, 1))

    def forward(self, x):
        x = self.conv(x)

        # Produces static shape
        b, c, h, w = x.shape

        x = x[:, :, h//3:, w//3:]
        return x


input = torch.normal(0, 1, size=(1, 3, 128, 128))
pytorch_module = DynamicShape().eval()

keras_model = nobuco.pytorch_to_keras(
    pytorch_module,
    args=[input],
    input_shapes={input: (None, 3, None, None)}, # Annotate dynamic axes with None
    inputs_channel_order=ChannelOrder.TENSORFLOW,
    outputs_channel_order=ChannelOrder.TENSORFLOW,
)

Something's not right. We don't see shape extraction ops in the debug output or the graph:

That's not surprising, actually. In Pytorch, tensor shape is a tuple of regular integers, not tensors, so it's quite difficult to track them. nobuco.shape solves this problem. This function returns tensors, much like tf.shape does:

# Allows for dynamic shape
b, c, h, w = nobuco.shape(x)

It's also possible to automatically substitute every .shape/.size call with nobuco.shape during the tracing phase by setting trace_shape flag:

keras_model = nobuco.pytorch_to_keras(
  # ...
  trace_shape=True
)

Ad hoc modifications

Let's say, for illustrative purposes, that we prefer putting batchnorm before convolution. We were sure TFLiteConverter would fuse these two linear operations into one. Alas, it failed to meet our expectations. Can we still get the fusion to work without re-training or messing around with the model checkpoint?

class FusibleModule(nn.Module):
    def __init__(self):
        super().__init__()
        self.bn = nn.BatchNorm2d(3)
        self.conv = nn.Conv2d(3, 16, kernel_size=(3, 3), padding=(0, 0))
        self.act = nn.ReLU()

    def forward(self, x):
        x = self.bn(x)
        x = self.conv(x)
        x = self.act(x)
        return x

Here's one way to do it:

Wrap the two ops in a Callable. Decorate it with @nobuco.traceable.
Make a custom converter for it which does the desired optimization.

class FusibleModule(nn.Module):
    # ...

    @nobuco.traceable
    def bn_conv(self, x):
        x = self.bn(x)
        x = self.conv(x)
        return x

    def forward(self, x):
        x = self.bn_conv(x)
        x = self.act(x)
        return x

@nobuco.converter(FusibleModule.bn_conv, channel_ordering_strategy=ChannelOrderingStrategy.FORCE_TENSORFLOW_ORDER)
def converter_bn_conv(self, x):
    order = ChannelOrder.TENSORFLOW
    bn, out_bn = nobuco.pytorch_to_keras(self.bn, [x], inputs_channel_order=order, outputs_channel_order=order, return_outputs_pt=True)
    conv = nobuco.pytorch_to_keras(self.conv, [out_bn], inputs_channel_order=order, outputs_channel_order=order)

    gamma, beta, moving_mean, moving_variance = bn.get_weights()
    kernel, bias = conv.get_weights()
    eps = self.bn.eps

    '''
    y = gamma * (x - moving_mean) / sqrt(moving_variance + eps) + beta
    z = kernel * y + bias
    =>
    z = kernel_fused * x + bias_fused WHERE
    kernel_fused = kernel * gamma / sqrt(moving_variance + eps)
    bias_fused = -kernel_fused * moving_mean + kernel * beta + bias
    '''
    kernel_fused = kernel * (gamma / np.sqrt(moving_variance + eps))[None, None, :, None]
    bias_fused = (-kernel_fused * moving_mean[None, None, :, None] + kernel * beta[None, None, :, None]).sum(axis=(0, 1, 2)).flatten() + bias
    conv.set_weights([kernel_fused, bias_fused])
    return lambda self, x: conv(x)

So we put a converter inside your converter

As we've learned, Nobuco gets confused when in-place operation is applied to a slice. There's a way to fix that, but let's not do it now. Instead, we'll use it as an excuse to explain the concept of nested converters. So, for this module, conversion will give us incorrect result:

class SliceReLU(nn.Module):
    def forward(self, x):
        # Gives incorrect result after conversion
        torch.relu_(x[:, 1:2, 16:25, 8::2])
        # That's the recommended approach, but we're not going for it now
        # x[:, 1:2, 16:25, 8::2] = torch.relu_(x[:, 1:2, 16:25, 8::2])
        return x


class MyModule(nn.Module):
    def __init__(self):
        super().__init__()
        self.conv = nn.Conv2d(3, 3, kernel_size=(3, 3), padding=(1, 1))

    def forward(self, x):
        x = self.conv(x)
        SliceReLU()(x)
        return x

We've seen it's possible to invoke a Nobuco converter inside another Nobuco converter. Can we embed some third-party converter? You bet! Why? Because it might just do what we need. Let's consider the standard route: Pytorch -> ONNX -> Tensorflow, with the latter step done with onnx-tf. This library likes transposing stuff so much, converting the whole graph with it may introduce intolerable inference overhead. Nonetheless, it does the job. A sensible tradeoff would be to wrap the problematic operation into its own nn.Module and give it a special treat, while handling everything else with Nobuco.

import onnx
from onnx_tf.backend import prepare


@nobuco.converter(SliceReLU, channel_ordering_strategy=ChannelOrderingStrategy.FORCE_PYTORCH_ORDER, reusable=False)
def converter_SliceReLU(self, x):
    model_path = 'slice_relu'
    onnx_path = model_path + '.onnx'

    # NB: onnx.export in implemented via tracing i.e. it may modify the inputs!
    torch.onnx.export(self, (x,), onnx_path, opset_version=12, input_names=['input'],
                      dynamic_axes={'input': [0, 1, 2, 3]}
                      )
    onnx_model = onnx.load(onnx_path)
    tf_rep = prepare(onnx_model)
    tf_rep.export_graph(model_path)
    model = tf.keras.models.load_model(model_path)
    return keras.layers.Lambda(lambda x: model(input=x))

But was it worth it?

Let's cut to the chase, here's the numbers.

mobilenet_v3_large (26.8 Mb)

	nobuco	onnx_tf	speedup
x86 (XNNPACK)	11.1 ms	14.7 ms	1.3x
Arm CPU (XNNPACK)	24.3 ms	40.3 ms	1.6x
Arm GPU (OpenCL)	21.3 ms	192.6 ms	9x

deeplabv3_resnet50 (158.5 Mb)

	nobuco	onnx_tf	speedup
x86 (XNNPACK)	1.25 s	1.34 s	1.07x
Arm CPU (XNNPACK)	2.0 s	2.7 s	1.35x
Arm GPU (OpenCL)	1.6 s	2.6 s	1.62x

As we can see, redundant transpositions may completely ruin the performance, especially on a GPU. But that's not the only issue. Let's test this:

class SliceReLU(nn.Module):
    def forward(self, x):
        x[:, 1:2, 16:25, 8::2] = torch.relu_(x[:, 1:2, 16:25, 8::2])
        return x

	nobuco	onnx_tf	speedup
x86 (XNNPACK)	0.40 ms	1.57 ms	3.9x
Arm CPU	4.6 ms	2.9 ms	0.6x
Arm CPU (XNNPACK)	2.1 ms	FAIL	—
Arm GPU (OpenCL)	21.8 ms	FAIL	—

Again, the graph obtained with onnx_tf is much slower on x86 CPU. Worse yet, on mobile processor, optimized TFLite delegates for both GPU and CPU failed. No transpose ops were added this time, so who's to blame? Just look what torch.onnx.export gives us:

slice_relu.onnx

onnx_tf does a fair job optimizing the monstrosity of a graph it's given, but combining consecutive slice ops seems to be too much to ask. It also leaves out garbage nodes sometimes (note the free-floating While in this example).

Nobuco evades these types of problems by simply not dealing with onnx.

slice_relu_nobuco	slice_relu_onnxtf

Nobuco knowledge base

Don't want to convert anything but looking for a Tensorflow equivalent of a certain Pytorch node (operation or module)? Nobuco already implements quite a few node converters, most written in concise and, hopefully, understandable way. These are located in nobuco/node_converters, and there's a utility function to help you find what you need:

node = torch.Tensor.repeat
# node = F.relu_
# node = nn.LSTM

location_link, source_code = nobuco.locate_converter(node)
print('Converter location:')
print(location_link)
print('Converter source code:')
print(source_code)

Converter location:
File "/home/user/anaconda3/envs/nb/lib/python3.9/site-packages/nobuco/node_converters/tensor_manipulation.py", line 141

Converter source code:
@converter(torch.Tensor.repeat, channel_ordering_strategy=ChannelOrderingStrategy.MINIMUM_TRANSPOSITIONS)
def converter_repeat(self, *sizes):
    def func(self, *sizes):
        if get_channel_order(self) == ChannelOrder.TENSORFLOW:
            sizes = permute_pytorch2keras(sizes)
        return tf.tile(self, sizes)
    return func

Acknowledgements

Slice assign converter is based on Zaccharie Ramzi's tf-slice-assign script.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.16.4

Aug 16, 2024

0.16.3

Jul 29, 2024

0.16.2

Jul 22, 2024

0.16.1

Jul 21, 2024

0.16.0

Jul 19, 2024

0.15.2

Jul 16, 2024

0.15.1

Jul 9, 2024

0.15.0

Jul 3, 2024

0.14.7

Jun 23, 2024

0.14.6

Jun 13, 2024

0.14.5

Jun 7, 2024

0.14.4

May 17, 2024

0.14.3

May 16, 2024

0.14.2

May 16, 2024

0.14.1

May 11, 2024

0.14.0

May 9, 2024

0.13.3

Apr 9, 2024

0.13.2

Apr 8, 2024

0.13.1

Apr 5, 2024

0.13.0

Apr 2, 2024

0.12.4

Mar 27, 2024

0.12.3

Mar 27, 2024

0.12.2

Mar 27, 2024

0.12.1

Mar 12, 2024

0.12.0

Feb 29, 2024

0.11.7

Feb 28, 2024

0.11.6

Feb 28, 2024

This version

0.11.5

Feb 6, 2024

0.11.4

Jan 20, 2024

0.11.3

Jan 13, 2024

0.11.2

Jan 10, 2024

0.11.1

Jan 4, 2024

0.11.0

Dec 19, 2023

0.10.7

Dec 18, 2023

0.10.6

Dec 18, 2023

0.10.5

Dec 16, 2023

0.10.4

Dec 16, 2023

0.10.3

Dec 15, 2023

0.10.2

Dec 15, 2023

0.10.1

Dec 9, 2023

0.10.0

Dec 4, 2023

0.9.2

Nov 30, 2023

0.9.1

Nov 30, 2023

0.9.0

Nov 27, 2023

0.8.15

Nov 27, 2023

0.8.14

Nov 27, 2023

0.8.13

Nov 20, 2023

0.8.12

Nov 15, 2023

0.8.11

Nov 15, 2023

0.8.10

Oct 31, 2023

0.8.9

Oct 31, 2023

0.8.8

Oct 31, 2023

0.8.7

Oct 23, 2023

0.8.6

Oct 20, 2023

0.8.5

Oct 18, 2023

0.8.4

Oct 17, 2023

0.8.3

Oct 13, 2023

0.8.2

Oct 12, 2023

0.8.1

Oct 4, 2023

0.8.0

Oct 1, 2023

0.7.2

Sep 25, 2023

0.7.1

Aug 23, 2023

0.7.0

Aug 18, 2023

0.6.4

Aug 6, 2023

0.6.3

Jul 26, 2023

0.6.2

Jul 25, 2023

0.6.1

Jul 24, 2023

0.6.0

Jul 20, 2023

0.5.8

Jul 11, 2023

0.5.7

Jul 4, 2023

0.5.6

Jul 1, 2023

0.5.5

Jun 30, 2023

0.5.4

Jun 20, 2023

0.5.3

Jun 19, 2023

0.5.2

Jun 19, 2023

0.5.1

Jun 19, 2023

0.5.0

Jun 18, 2023

0.4.10

Jun 15, 2023

0.4.9

Jun 9, 2023

0.4.8

Jun 6, 2023

0.4.7

Jun 1, 2023

0.4.6

May 17, 2023

0.4.5

May 4, 2023

0.4.4

May 4, 2023

0.4.3

May 1, 2023

0.4.2

May 1, 2023

0.4.1

Apr 25, 2023

0.4.0

Apr 21, 2023

0.3.0

Apr 20, 2023

0.2.1

Apr 19, 2023

0.2.0

Apr 18, 2023

0.1.4

Apr 18, 2023

0.1.3

Apr 18, 2023

0.1.2

Apr 18, 2023

0.1.1

Apr 16, 2023

0.1.0

Apr 15, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nobuco-0.11.5.tar.gz (64.8 kB view hashes)

Uploaded Feb 6, 2024 Source

Built Distribution

nobuco-0.11.5-py3-none-any.whl (74.4 kB view hashes)

Uploaded Feb 6, 2024 Python 3

Hashes for nobuco-0.11.5.tar.gz

Hashes for nobuco-0.11.5.tar.gz
Algorithm	Hash digest
SHA256	`9e42e19ecf21980c392380ab3197493c4c412c5772708c639565d31479e5efc8`
MD5	`0b812449c865ba83076d3401071a4216`
BLAKE2b-256	`9466cddef76acf3a72d9701d880939c6312cdf409a5d6d9b50c1f364e7892f88`

Hashes for nobuco-0.11.5-py3-none-any.whl

Hashes for nobuco-0.11.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ce021b852cf1e79fff26f9c990a5f12221b5591a484031e18e06f52bc8e3d93a`
MD5	`b0b70724e8cc16dc90c486eb72fb4938`
BLAKE2b-256	`cfdeb95bd93976eb0273665fb272cf3a1914fef15053c5957322820e304e2ed3`