XLNet implemented in Keras

These details have not been verified by PyPI

Project links

Homepage

Project description

Keras XLNet

Downloads License

Unofficial implementation of XLNet. Embedding extraction and embedding extract with memory show how to get the outputs of the last transformer layer using pre-trained checkpoints.

Install

pip install keras-xlnet

Usage

Fine-tuning on GLUE

Click the task name to see the demos with base model:

Task Name	Metrics	Approximate Results on Dev Set
CoLA	Matthew Corr.	52
SST-2	Accuracy	93
MRPC	Accuracy/F1	86/89
STS-B	Pearson Corr. / Spearman Corr.	86/87
QQP	Accuracy/F1	90/86
MNLI	Accuracy	84/84
QNLI	Accuracy	86
RTE	Accuracy	64
WNLI	Accuracy	56

(Only 0s are predicted in WNLI dataset)

Load Pretrained Checkpoints

import os
from keras_xlnet import Tokenizer, load_trained_model_from_checkpoint, ATTENTION_TYPE_BI

checkpoint_path = '.../xlnet_cased_L-24_H-1024_A-16'

tokenizer = Tokenizer(os.path.join(checkpoint_path, 'spiece.model'))
model = load_trained_model_from_checkpoint(
    config_path=os.path.join(checkpoint_path, 'xlnet_config.json'),
    checkpoint_path=os.path.join(checkpoint_path, 'xlnet_model.ckpt'),
    batch_size=16,
    memory_len=512,
    target_len=128,
    in_train_phase=False,
    attention_type=ATTENTION_TYPE_BI,
)
model.summary()

Arguments batch_size, memory_len and target_len are maximum sizes used for initialization of memories. The model used for training a language model is returned if in_train_phase is True, otherwise a model used for fine-tuning will be returned.

About I/O

Note that shuffle should be False in either fit or fit_generator if memories are used.

`in_train_phase` is `False`

3 inputs:

IDs of tokens, with shape (batch_size, target_len).
IDs of segments, with shape (batch_size, target_len).
Length of memories, with shape (batch_size, 1).

1 output:

The feature for each token, with shape (batch_size, target_len, units).

`in_train_phase` is `True`

4 inputs:

IDs of tokens, with shape (batch_size, target_len).
IDs of segments, with shape (batch_size, target_len).
Length of memories, with shape (batch_size, 1).
Masks of tokens, with shape (batch_size, target_len).

1 output:

The probability of each token in each position, with shape (batch_size, target_len, num_token).

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.20.0

Jan 22, 2022

0.19.0

Jun 19, 2021

0.18.0

Nov 5, 2019

0.17.0

Sep 3, 2019

0.16.0

Aug 23, 2019

0.15.0

Aug 23, 2019

This version

0.14.1

Aug 6, 2019

0.14.0

Aug 6, 2019

0.13.0

Aug 5, 2019

0.11.0

Jul 30, 2019

0.10.0

Jul 30, 2019

0.9.0

Jul 30, 2019

0.8.0

Jul 30, 2019

0.7.0

Jul 30, 2019

0.5.0

Jul 18, 2019

0.4.0

Jul 18, 2019

0.3.0

Jul 18, 2019

0.2.0

Jul 18, 2019

0.1.0

Jul 16, 2019

0.0.3

Jul 15, 2019

0.0.2

Jul 15, 2019

0.0.1

Jul 15, 2019

0.0.0

Jul 15, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

keras-xlnet-0.14.1.tar.gz (20.8 kB view hashes)

Uploaded Aug 6, 2019 Source

Hashes for keras-xlnet-0.14.1.tar.gz

Hashes for keras-xlnet-0.14.1.tar.gz
Algorithm	Hash digest
SHA256	`3652564d6af8b3d641f8e1ba1781ca023d84ad48f8341848594bce86738f1636`
MD5	`ad2740642a5266077a4a1ea75cd41d61`
BLAKE2b-256	`2332b9e7bf730f79613541567c46e4cd4a1395fa8441e6b3c602882ea4612fdb`

keras-xlnet 0.14.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Keras XLNet

Install

Usage

Fine-tuning on GLUE

Load Pretrained Checkpoints

About I/O

`in_train_phase` is `False`

`in_train_phase` is `True`

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

keras-xlnet 0.14.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Keras XLNet

Install

Usage

Fine-tuning on GLUE

Load Pretrained Checkpoints

About I/O

in_train_phase is False

in_train_phase is True

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

`in_train_phase` is `False`

`in_train_phase` is `True`