pytorch-to-returnn

Make PyTorch code runnable within RETURNN (TensorFlow)

These details have not been verified by PyPI

Project links

Homepage

Project description

Make PyTorch code runnable within RETURNN (on TensorFlow). This provides some wrappers (and maybe some magic) to do that.

Installation

This package is on PyPI.

pip install pytorch-to-returnn

torch drop-in replacement for RETURNN

The idea:

import torch

class Model(torch.nn.Module):
 ...

Can be changed to:

from pytorch_to_returnn import torch as torch_returnn

class Model(torch_returnn.nn.Module):
 ...

And this can be used directly in RETURNN.

This would convert the model to a RETURNN model. Example constructed RETURNN net dict, created from this PyTorch code.

Why

From PyTorch perspective:

RETURNN will keep track of the meaning of tensor axes. I.e. it knows about the batch axis, and any spatial axes (width/height or time), including their sequence lengths. (This goes far beyond just named axes.) This can be used to verify whether the operations are on the right axes and to detect potential bugs.
RETURNN can do further optimizations and might make the model run faster. (If this is not the case, likely there is some bug, or non-optimal implementation on RETURNN side, which we can improve.)

From RETURNN/TF perspective:

This can serve as a new way to define your RETURNN networks (TF networks), which might be simpler to use than the existing way.
We can reuse PyTorch code, and even trained models, within RETURNN, and combine it easily with other RETURNN models.
We might find non-optimal or buggy implementations in RETURNN (e.g. when there is some module which runs better/faster in PyTorch) and can improve upon them (the corresponding RETURNN layer).

How does this work

On a high level, RETURNN layers mostly corresponds to PyTorch modules. So all PyTorch modules are mapped directly or indirectly to RETURNN layers. The same is done for all functions in functional.

All RETURNN layers have further meta information about tensors, esp their axes/dimensions, and they might reorder axes when this is more efficient. We keep track of the axis mapping.

See the documentation of the pytorch_to_returnn.torch package for details about how this works, and what can be done with it. Obviously, this is incomplete. For some status of what is not supported currently, see the unsupported document. Otherwise, when you hit some Module or functional function, or Tensor function which is not implemented, it just means that no-one has implemented it yet.

Somewhat related is also the torch.fx module.

Direct use in RETURNN

A RETURNN config could be written in this way.

Use some PyTorch model as a component / subnetwork:

from pytorch_to_returnn import torch as torch_returnn

class MyTorchModel(torch_returnn.nn.Module):
  ...

my_torch_model = MyTorchModel()

extern_data = {...}  # as usual

# RETURNN network dict
network = {
"prenet": my_torch_model.as_returnn_layer_dict(extern_data["data"]),

# Other RETURNN layers
...
}

Or directly using a PyTorch model as-is:

from pytorch_to_returnn import torch as torch_returnn

class MyTorchModel(torch_returnn.nn.Module):
  ...

my_torch_model = MyTorchModel()

extern_data = {...}  # as usual

# RETURNN network dict
network = my_torch_model.as_returnn_net_dict(extern_data["data"])

Model converter

For the process of converting a model from PyTorch to RETURNN, including a PyTorch model checkpoint, we provide some utilities to automate this, and verify whether all outputs match. This is in pytorch_to_returnn.converter.

Example for Parallel WaveGAN:

def model_func(wrapped_import, inputs: torch.Tensor):
    if typing.TYPE_CHECKING or not wrapped_import:
        import torch
        from parallel_wavegan import models as pwg_models
        from parallel_wavegan import layers as pwg_layers

    else:
        torch = wrapped_import("torch")
        wrapped_import("parallel_wavegan")
        pwg_models = wrapped_import("parallel_wavegan.models")
        pwg_layers = wrapped_import("parallel_wavegan.layers")

    # Initialize PWG
    pwg_config = yaml.load(open(args.pwg_config), Loader=yaml.Loader)
    generator = pwg_models.MelGANGenerator(**pwg_config['generator_params'])
    generator.load_state_dict(
        torch.load(args.pwg_checkpoint, map_location="cpu")["model"]["generator"])
    generator.remove_weight_norm()
    pwg_model = generator.eval()
    pwg_pqmf = pwg_layers.PQMF(pwg_config["generator_params"]["out_channels"])

    return pwg_pqmf.synthesis(pwg_model(inputs))


feature_data = numpy.load(args.features)  # shape (Batch,Channel,Time) (1,80,80)

from pytorch_to_returnn.converter import verify_torch_and_convert_to_returnn
verify_torch_and_convert_to_returnn(model_func, inputs=feature_data)

The wrapped_import uses some import wrappers, which automatically converts the import torch statements.

This will automatically do the conversion, i.e. create a RETURNN model, including the RETURNN net dict and TF checkpoint file, and do verification on several steps of all the outputs (PyTorch module outputs vs RETURNN layer outputs).

Import wrapper

We also support to transform external PyTorch code on-the-fly (without the need to rewrite the code; it translates the code on AST level in the way above on-the-fly). I.e. it basically replaces import torch by from pytorch_to_returnn import torch – that’s all it does.

This is via our generic Python import wrapper pytorch_to_returnn.import_wrapper.

Example for Parallel WaveGAN:

import tensorflow as tf
from pytorch_to_returnn.import_wrapper import wrapped_import_torch_returnn
from pytorch_to_returnn.naming import Naming
from returnn.tf.util.data import Data

torch = wrapped_import_torch_returnn("torch")
wrapped_import_torch_returnn("parallel_wavegan")
pwg_models = wrapped_import_torch_returnn("parallel_wavegan.models")
pwg_layers = wrapped_import_torch_returnn("parallel_wavegan.layers")

naming = Naming.get_instance()  # default instance

inputs = torch.from_numpy(inputs)  # shape (Batch,Channel,Time), e.g. (1,80,80)
x = naming.register_input(
    inputs, Data("data", shape=(80, None), feature_dim_axis=1, time_dim_axis=2))
assert isinstance(x, Data)

# Initialize PWG
pwg_config = yaml.load(open(args.pwg_config), Loader=yaml.Loader)
generator = pwg_models.MelGANGenerator(**pwg_config['generator_params'])
generator.load_state_dict(
    torch.load(args.pwg_checkpoint, map_location="cpu")["model"]["generator"])
generator.remove_weight_norm()
pwg_model = generator.eval()
pwg_pqmf = pwg_layers.PQMF(pwg_config["generator_params"]["out_channels"])

outputs = pwg_pqmf.synthesis(pwg_model(inputs))

outputs = naming.register_output(outputs)
y = outputs.returnn_data
assert isinstance(y, Data)
assert isinstance(y.placeholder, tf.Tensor)

(RETURNN Data encapsulates a tensor and adds a lot of meta information about it and its axes, such as sequence lengths, beam, vocabulary of class indices, etc.)

Examples

See examples.

Tests

See tests. They are automatically run via GitHub Actions for CI.

https://github.com/rwth-i6/pytorch-to-returnn/workflows/CI/badge.svg

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.20220408.110245

Apr 8, 2022

1.20220406.143832

Apr 6, 2022

1.20220331.223303

Mar 31, 2022

1.20220225.155603

Feb 25, 2022

1.20220225.151732

Feb 25, 2022

1.20220223.105750

Feb 23, 2022

1.20220221.113450

Feb 21, 2022

1.20220221.111746

Feb 21, 2022

1.20220217.133947

Feb 17, 2022

1.20220208.174833

Feb 8, 2022

1.20220208.172204

Feb 8, 2022

1.20220207.112521

Feb 7, 2022

1.20220207.100627

Feb 7, 2022

1.20220204.162942

Feb 4, 2022

1.20220204.93858

Feb 4, 2022

1.20220203.124722

Feb 3, 2022

1.20220203.113443

Feb 3, 2022

1.20220202.194612

Feb 2, 2022

1.20220202.134138

Feb 2, 2022

1.20220202.121034

Feb 2, 2022

1.20220127.164613

Jan 27, 2022

1.20220127.130643

Jan 27, 2022

1.20220126.142952

Jan 26, 2022

1.20220126.131031

Jan 26, 2022

1.20220126.115956

Jan 26, 2022

1.20220124.163954

Jan 24, 2022

1.20220124.143720

Jan 24, 2022

1.20220124.134133

Jan 24, 2022

1.20220124.125259

Jan 24, 2022

1.20220121.164743

Jan 21, 2022

1.20220118.163019

Jan 18, 2022

1.20220118.114921

Jan 18, 2022

1.20220118.105148

Jan 18, 2022

1.20220118.104730

Jan 18, 2022

1.20220118.103741

Jan 18, 2022

1.20220118.92436

Jan 18, 2022

1.20220117.234650

Jan 17, 2022

1.20211203.152318

Dec 3, 2021

1.20211202.175534

Dec 2, 2021

1.20211202.163916

Dec 2, 2021

1.20211202.113128

Dec 2, 2021

1.20211201.160001

Dec 1, 2021

1.20211027.93811

Oct 27, 2021

1.20211021.104646

Oct 21, 2021

1.20211020.231759

Oct 20, 2021

1.20211020.231616

Oct 20, 2021

1.20211020.230852

Oct 20, 2021

1.20211020.175544

Oct 20, 2021

1.20211020.162157

Oct 20, 2021

1.20211014.143810

Oct 14, 2021

1.20211008.110943

Oct 8, 2021

1.20210923.111843

Sep 23, 2021

1.20210923.110330

Sep 23, 2021

1.20210827.104744

Aug 27, 2021

1.20210818.125010

Aug 18, 2021

1.20210812.105642

Aug 12, 2021

1.20210812.104154

Aug 12, 2021

1.20210720.110225

Jul 20, 2021

1.20210622.154608

Jun 22, 2021

1.20210621.232119

Jun 21, 2021

1.20210621.231102

Jun 21, 2021

1.20210520.151009

May 20, 2021

1.20210505.141359

May 5, 2021

1.20210430.165844

Apr 30, 2021

1.20210430.103629

Apr 30, 2021

1.20210430.93644

Apr 30, 2021

1.20210429.141755

Apr 29, 2021

1.20210429.101439

Apr 29, 2021

1.20210428.161622

Apr 28, 2021

1.20210428.134528

Apr 28, 2021

1.20210426.95853

Apr 26, 2021

1.20210422.103616

Apr 22, 2021

1.20210420.173706

Apr 20, 2021

1.20210420.165854

Apr 20, 2021

1.20210216.150649

Feb 16, 2021

1.20210212.142449

Feb 12, 2021

1.20210211.180049

Feb 11, 2021

1.20210211.85651

Feb 11, 2021

1.20210108.145730

Jan 8, 2021

1.20201222.232903

Dec 22, 2020

1.20201222.181404

Dec 22, 2020

1.20201220.173357

Dec 20, 2020

1.20201220.140039

Dec 20, 2020

1.20201220.124030

Dec 20, 2020

1.20201218.173916

Dec 18, 2020

1.20201218.173405

Dec 18, 2020

1.20201218.171839

Dec 18, 2020

1.20201217.214559

Dec 17, 2020

1.20201217.175050

Dec 17, 2020

1.20201217.142027

Dec 17, 2020

1.20201217.120120

Dec 17, 2020

1.20201216.180620

Dec 16, 2020

1.20201216.174827

Dec 16, 2020

This version

1.20201216.170027

Dec 16, 2020

1.20201216.162556

Dec 16, 2020

1.20201212.172355

Dec 12, 2020

1.20201212.124501

Dec 12, 2020

1.20201212.114705

Dec 12, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pytorch_to_returnn-1.20201216.170027.tar.gz (106.0 kB view details)

Uploaded Dec 16, 2020 Source

File details

Details for the file pytorch_to_returnn-1.20201216.170027.tar.gz.

File metadata

Download URL: pytorch_to_returnn-1.20201216.170027.tar.gz
Upload date: Dec 16, 2020
Size: 106.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/51.0.0 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.8.6

File hashes

Hashes for pytorch_to_returnn-1.20201216.170027.tar.gz
Algorithm	Hash digest
SHA256	`eb8db7cb2f6072bf8421b1d40a8176e414f04e907a05b5fd03a557cc5d451fa6`
MD5	`8f693a1b589742d858576aacbb9e1f96`
BLAKE2b-256	`85bd03bd587699c651d1173f32e2126472687ed1b9ed2197a0e3116454e866f8`

See more details on using hashes here.

pytorch-to-returnn 1.20201216.170027

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Installation

torch drop-in replacement for RETURNN

Why

How does this work

Direct use in RETURNN

Model converter

Import wrapper

Examples

Tests

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes