Neural machine translation and sequence learning using TensorFlow

These details have not been verified by PyPI

Project links

Project description

OpenNMT-tf

OpenNMT-tf is a general purpose sequence learning toolkit using TensorFlow 2. While neural machine translation is the main target task, it has been designed to more generally support:

sequence to sequence mapping
sequence tagging
sequence classification
language modeling

The project is production-oriented and comes with backward compatibility guarantees.

Key features

Modular model architecture

Models are described with code to allow training custom architectures and overriding default behavior. For example, the following instance defines a sequence to sequence model with 2 concatenated input features, a self-attentional encoder, and an attentional RNN decoder sharing its input and output embeddings:

opennmt.models.SequenceToSequence(
    source_inputter=opennmt.inputters.ParallelInputter(
        [
            opennmt.inputters.WordEmbedder(embedding_size=256),
            opennmt.inputters.WordEmbedder(embedding_size=256),
        ],
        reducer=opennmt.layers.ConcatReducer(axis=-1),
    ),
    target_inputter=opennmt.inputters.WordEmbedder(embedding_size=512),
    encoder=opennmt.encoders.SelfAttentionEncoder(num_layers=6),
    decoder=opennmt.decoders.AttentionalRNNDecoder(
        num_layers=4,
        num_units=512,
        attention_mechanism_class=tfa.seq2seq.LuongAttention,
    ),
    share_embeddings=opennmt.models.EmbeddingsSharingLevel.TARGET,
)

The opennmt package exposes other building blocks that can be used to design:

Standard models such as the Transformer are defined in a model catalog and can be used without additional configuration.

Find more information about model configuration in the documentation.

Full TensorFlow 2 integration

OpenNMT-tf is fully integrated in the TensorFlow 2 ecosystem:

Reusable layers extending tf.keras.layers.Layer
Multi-GPU training with tf.distribute and distributed training with Horovod
Mixed precision training with tf.keras.mixed_precision
Visualization with TensorBoard
tf.function graph tracing that can be exported to a SavedModel and served with TensorFlow Serving or Python

Compatibility with CTranslate2

CTranslate2 is an optimized inference engine for OpenNMT models featuring fast CPU and GPU execution, model quantization, parallel translations, dynamic memory usage, interactive decoding, and more! OpenNMT-tf can automatically export models to be used in CTranslate2.

Dynamic data pipeline

OpenNMT-tf does not require to compile the data before the training. Instead, it can directly read text files and preprocess the data when needed by the training. This allows on-the-fly tokenization and data augmentation by injecting random noise.

Model fine-tuning

OpenNMT-tf supports model fine-tuning workflows:

Model weights can be transferred to new word vocabularies, e.g. to inject domain terminology before fine-tuning on in-domain data
Contrastive learning to reduce word omission errors

Source-target alignment

Sequence to sequence models can be trained with guided alignment and alignment information are returned as part of the translation API.

OpenNMT-tf also implements most of the techniques commonly used to train and evaluate sequence models, such as:

automatic evaluation during the training
multiple decoding strategy: greedy search, beam search, random sampling
N-best rescoring
gradient accumulation
scheduled sampling
checkpoint averaging
... and more!

See the documentation to learn how to use these features.

Usage

OpenNMT-tf requires:

Python 3.7 or above
TensorFlow 2.6, 2.7, 2.8, 2.9, 2.10, 2.11, 2.12, or 2.13

We recommend installing it with pip:

pip install --upgrade pip
pip install OpenNMT-tf

See the documentation for more information.

Command line

OpenNMT-tf comes with several command line utilities to prepare data, train, and evaluate models.

For all tasks involving a model execution, OpenNMT-tf uses a unique entrypoint: onmt-main. A typical OpenNMT-tf run consists of 3 elements:

the model type
the parameters described in a YAML file
the run type such as train, eval, infer, export, score, average_checkpoints, or update_vocab

that are passed to the main script:

onmt-main --model_type <model> --config <config_file.yml> --auto_config <run_type> <run_options>

For more information and examples on how to use OpenNMT-tf, please visit our documentation.

Library

OpenNMT-tf also exposes well-defined and stable APIs, from high-level training utilities to low-level model layers and dataset transformations.

For example, the Runner class can be used to train and evaluate models with few lines of code:

import opennmt

config = {
    "model_dir": "/data/wmt-ende/checkpoints/",
    "data": {
        "source_vocabulary": "/data/wmt-ende/joint-vocab.txt",
        "target_vocabulary": "/data/wmt-ende/joint-vocab.txt",
        "train_features_file": "/data/wmt-ende/train.en",
        "train_labels_file": "/data/wmt-ende/train.de",
        "eval_features_file": "/data/wmt-ende/valid.en",
        "eval_labels_file": "/data/wmt-ende/valid.de",
    }
}

model = opennmt.models.TransformerBase()
runner = opennmt.Runner(model, config, auto_config=True)
runner.train(num_devices=2, with_eval=True)

Here is another example using OpenNMT-tf to run efficient beam search with a self-attentional decoder:

decoder = opennmt.decoders.SelfAttentionDecoder(num_layers=6, vocab_size=32000)

initial_state = decoder.initial_state(
    memory=memory, memory_sequence_length=memory_sequence_length
)

batch_size = tf.shape(memory)[0]
start_ids = tf.fill([batch_size], opennmt.START_OF_SENTENCE_ID)

decoding_result = decoder.dynamic_decode(
    target_embedding,
    start_ids=start_ids,
    initial_state=initial_state,
    decoding_strategy=opennmt.utils.BeamSearch(4),
)

More examples using OpenNMT-tf as a library can be found online:

The directory examples/library contains additional examples that use OpenNMT-tf as a library
nmt-wizard-docker uses the high-level opennmt.Runner API to wrap OpenNMT-tf with a custom interface for training, translating, and serving

For a complete overview of the APIs, see the package documentation.

Additional resources

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

2.32.0

Aug 4, 2023

2.31.0

Jan 13, 2023

2.30.0

Dec 12, 2022

2.29.1

Oct 3, 2022

2.29.0

Sep 26, 2022

2.28.0

Jul 29, 2022

2.27.1

Jun 2, 2022

2.27.0

May 30, 2022

2.26.1

Mar 31, 2022

2.26.0

Mar 31, 2022

2.25.0

Feb 21, 2022

2.24.0

Dec 17, 2021

2.23.0

Nov 15, 2021

2.22.0

Sep 30, 2021

2.21.0

Aug 30, 2021

2.20.1

Jul 1, 2021

2.20.0

Jun 17, 2021

2.19.0

May 31, 2021

2.18.1

Apr 27, 2021

2.18.0

Apr 19, 2021

2.17.1

Mar 23, 2021

2.17.0

Mar 15, 2021

2.16.0

Feb 25, 2021

2.15.0

Jan 28, 2021

2.14.0

Dec 28, 2020

2.13.0

Oct 20, 2020

2.12.1

Sep 16, 2020

2.12.0

Aug 31, 2020

2.11.1

Jun 25, 2020

2.11.0

Jun 17, 2020

2.10.1

Jun 4, 2020

2.10.0

May 28, 2020

2.9.3

May 6, 2020

2.9.2

Apr 22, 2020

2.9.1

Apr 14, 2020

2.9.0

Apr 7, 2020

2.8.1

Mar 24, 2020

2.8.0

Mar 2, 2020

2.7.0

Feb 14, 2020

2.6.0

Jan 28, 2020

2.5.1

Jan 20, 2020

2.5.0

Jan 16, 2020

2.4.0

Dec 10, 2019

2.3.0

Nov 25, 2019

2.2.1

Nov 7, 2019

2.2.0

Nov 6, 2019

2.1.1

Oct 18, 2019

2.1.0

Oct 10, 2019

2.0.1

Oct 4, 2019

2.0.0

Oct 1, 2019

1.25.3

Nov 25, 2019

1.25.2

Oct 22, 2019

1.25.1

Sep 25, 2019

1.25.0

Sep 13, 2019

1.24.1

Aug 29, 2019

1.24.0

Jun 26, 2019

1.23.1

Jun 7, 2019

1.23.0

May 30, 2019

1.22.2

May 17, 2019

1.22.1

Apr 29, 2019

1.22.0

Apr 6, 2019

1.21.7

Mar 20, 2019

1.21.6

Mar 12, 2019

1.21.5

Mar 11, 2019

1.21.4

Mar 7, 2019

1.21.3

Mar 6, 2019

1.21.2

Mar 5, 2019

1.21.1

Mar 4, 2019

1.21.0

Mar 1, 2019

1.20.1

Feb 22, 2019

1.20.0

Feb 15, 2019

1.19.2

Feb 13, 2019

1.19.1

Feb 13, 2019

1.19.0

Feb 8, 2019

1.18.0

Feb 1, 2019

1.17.1

Jan 21, 2019

1.17.0

Jan 10, 2019

1.16.0

Dec 21, 2018

1.15.0

Nov 30, 2018

1.14.1

Nov 28, 2018

1.14.0

Nov 22, 2018

1.13.1

Nov 19, 2018

1.13.0

Nov 14, 2018

1.12.0

Nov 7, 2018

1.11.0

Oct 24, 2018

1.10.1

Oct 15, 2018

1.10.0

Oct 11, 2018

1.9.0

Oct 5, 2018

1.8.1

Sep 28, 2018

1.8.0

Sep 25, 2018

1.7.0

Aug 7, 2018

1.6.2

Jul 14, 2018

1.6.1

Jul 11, 2018

1.6.0

Jul 5, 2018

1.5.0

Jun 8, 2018

1.4.1

May 25, 2018

1.4.0

May 25, 2018

1.3.0

May 14, 2018

1.2.0

Apr 28, 2018

1.1.0

Apr 12, 2018

1.0.3

Apr 5, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

OpenNMT-tf-2.32.0.tar.gz (133.0 kB view details)

Uploaded Aug 4, 2023 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

OpenNMT_tf-2.32.0-py3-none-any.whl (162.0 kB view details)

Uploaded Aug 4, 2023 Python 3

File details

Details for the file OpenNMT-tf-2.32.0.tar.gz.

File metadata

Download URL: OpenNMT-tf-2.32.0.tar.gz
Upload date: Aug 4, 2023
Size: 133.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for OpenNMT-tf-2.32.0.tar.gz
Algorithm	Hash digest
SHA256	`dcc7d80046bf6e94f3d7f2e477d1c89c7c57ef6b99f7d9b92235341442833c79`
MD5	`1c360d336c32891c0cd6e303f5707e64`
BLAKE2b-256	`39a7aa003550a746f2c843718b55f21a6697a61b2c9e10587c5ba0fa4b62708b`

See more details on using hashes here.

File details

Details for the file OpenNMT_tf-2.32.0-py3-none-any.whl.

File metadata

Download URL: OpenNMT_tf-2.32.0-py3-none-any.whl
Upload date: Aug 4, 2023
Size: 162.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for OpenNMT_tf-2.32.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`41b4221777fed15a84247123eebba6a1f6becef824a85af33248416a6d331cd5`
MD5	`994570a8a71dd5fd184a53fb2bb366d9`
BLAKE2b-256	`ff2db655f288685a9c6b62384d97d2b9132861980e84f8c15af0de48d8a3e5f9`

See more details on using hashes here.

OpenNMT-tf 2.32.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

OpenNMT-tf

Key features

Modular model architecture

Full TensorFlow 2 integration

Compatibility with CTranslate2

Dynamic data pipeline

Model fine-tuning

Source-target alignment

Usage

Command line

Library

Additional resources

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes