Diverse and extensible generation decoding libraries for transformers.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Decoders for 🤗 transformers

This package provides a convenient interface for extensible and customizable generation strategies -aka decoders- in 🤗 transformers.

It also provides extra implementations out of the box, like the Stochastic Beam Search decoder.

Installation

pip install decoders

Usage

Simple use of the new interface:

from decoders import inject_supervitamined_decoders
from transformers import T5ForConditionalGeneration

model = T5ForConditionalGeneration.from_pretrained('t5-small')
inject_supervitamined_decoders(model)
model.generate(...)

Decoders

Stochastic Beam Search

This decoder is a stochastic version of the Beam Search decoder. It is a HF implementation of the paper Stochastic Beam Search.

It can be used as follows:

from decoders import OldStochasticBeamSearchDecoder, inject_supervitamined_decoders
from transformers import T5ForConditionalGeneration

model = T5ForConditionalGeneration.from_pretrained('t5-small')
inject_supervitamined_decoders(model)

decoder = OldStochasticBeamSearchDecoder()
outputs = model.generate(input_ids, generation_strategy=decoder,
                         num_beams=4, num_return_sequences=4,  # sample without repl. = return all beams
                         length_penalty=0.0,  # for correct probabilities, disable length penalty
                         return_dict_in_generate=True, output_scores=True, early_stopping=True,
                         # early stopping because without length penalty, we can discard worse sequences
                         # return_dict_in_generate and output_scores are required for sbs for now,
                         # as scores keep the past generated gumbel noise, which is used by the logits processor
                         )

Note that when sampling without replacement, you must set num_beams and num_return_sequences to the same value, the number of SWOR samples that you want to obtain.

Of course, the samples for the same input are not independent. If you want R different groups of SWOR samples of size n, you should replicate your batched input tensor by R, and then set num_beams and num_return_sequences to n.

See here for a full example.

Included goodies

BinaryCodeTransformer

The BinaryCodeTransformer is a custom transformer model that acts like a probabilistic binary sequence generator. Given a discrete probability distribution over all possible binary sequences of a given length, it generates a sequence of that length according to that distribution. It is useful to test HF compatible sample-without-replacement decoders, like the Stochastic Beam Search decoder.

The code maps each of the 2^n possible binary sequences of length n to its positive integer decimal representation. Then, it uses that number as the index of the corresponding probability in the input distribution. Since we are interested in autoregressive generation, the model computes the conditional probabilities by summing over the possible continuations of the sequence.

FakeTransformer

The FakeTransformer operates as a very simple Probabilistic Finite State Automaton. See here for a full explanation.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.3.0

Feb 27, 2024

0.2.19

Feb 26, 2024

0.2.18

Feb 23, 2024

0.2.17

Feb 23, 2024

0.2.16

Feb 22, 2024

This version

0.2.15

Feb 14, 2024

0.2.14

Feb 14, 2024

0.2.13

Feb 14, 2024

0.2.12

Feb 14, 2024

0.2.11

Feb 14, 2024

0.2.10

Feb 13, 2024

0.2.9

Feb 9, 2024

0.2.8

Feb 9, 2024

0.2.7

Feb 8, 2024

0.2.6

Jan 24, 2024

0.2.5

Dec 31, 2023

0.2.4

Dec 29, 2023

0.2.3

Dec 29, 2023

0.2.2

Dec 29, 2023

0.2.1

Dec 29, 2023

0.2.0

Dec 29, 2023

0.1.9

Dec 29, 2023

0.1.8

Dec 29, 2023

0.1.7

Dec 28, 2023

0.1.6

Dec 27, 2023

0.1.5

Dec 27, 2023

0.1.4

Dec 20, 2023

0.1.3

Dec 19, 2023

0.1.2

Nov 26, 2023

0.1.1

Nov 21, 2023

0.1.0

Nov 9, 2023

0.0.12

Nov 2, 2023

0.0.11

Nov 2, 2023

0.0.10

Nov 2, 2023

0.0.9

Nov 2, 2023

0.0.8

Nov 2, 2023

0.0.7

Nov 1, 2023

0.0.6

Nov 1, 2023

0.0.5

Nov 1, 2023

0.0.4

Jul 27, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

decoders-0.2.15.tar.gz (94.8 kB view hashes)

Uploaded Feb 14, 2024 Source

Built Distribution

decoders-0.2.15-py3-none-any.whl (122.9 kB view hashes)

Uploaded Feb 14, 2024 Python 3

Hashes for decoders-0.2.15.tar.gz

Hashes for decoders-0.2.15.tar.gz
Algorithm	Hash digest
SHA256	`d55c1653fd6e0ca54ea3cfdc15a5bdaa26c12c3b7d13571016f9594d1b5e3069`
MD5	`2e58be2a4c3a3865f49f02718f8c81e0`
BLAKE2b-256	`b187ac8c8b1745486e210e7296977f4f4079d3515bef7c945bab68722965e5ac`

Hashes for decoders-0.2.15-py3-none-any.whl

Hashes for decoders-0.2.15-py3-none-any.whl
Algorithm	Hash digest
SHA256	`657f58cbc1e54b3411ad62c31704166a5efe0b84cab61cea1cf8141449367721`
MD5	`1b8c36387baeb8e49c7ed37eb68af149`
BLAKE2b-256	`d83f65e2199a7162b57d0b7bee110dc2252d8895b50ef22ae8a9bc9a915f5773`