An easy-t0-use wrapper library for the Transformers library.

These details have not been verified by PyPI

Project links

Homepage

Project description

Simple Transformers

This library is based on the Pytorch-Transformers library by HuggingFace. Using this library, you can quickly train and evaluate Transformer models. Only 3 lines of code are needed to initialize a model, train the model, and evaluate the model.

Setup
- With Conda
Usage
Acknowledgements

Setup

With Conda

Install Anaconda or Miniconda Package Manager from here
Create a new virtual environment and install packages.
conda create -n transformers python pandas tqdm
conda activate transformers
If using cuda:
conda install pytorch cudatoolkit=10.0 -c pytorch
else:
conda install pytorch cpuonly -c pytorch
conda install -c anaconda scipy
conda install -c anaconda scikit-learn
pip install transformers
pip install tensorboardx
Install simpletransformers.
pip install simpletransformers

Usage

Minimal Start

from simpletransformers.model import TransformerModel
import pandas as pd


# Train and Evaluation data needs to be in a Pandas Dataframe of two columns. The first column is the text with type str, and the second column is the label with type int.
train_data = [['Example sentence belonging to class 1', 1], ['Example sentence belonging to class 0', 0]]
train_df = pd.DataFrame(train_data)

eval_data = [['Example eval sentence belonging to class 1', 1], ['Example eval sentence belonging to class 0', 0]]
eval_df = pd.DataFrame(eval_data)

# Create a TransformerModel
model = TransformerModel('roberta', 'roberta-base')

# Train the model
model.train_model(train_df)

# Evaluate the model
result, model_outputs, wrong_predictions = model.eval_model(eval_df)

To make predictions on arbitary data, the predict(to_predict) function can be used. For a list of text, it returns the model predictions and the raw model outputs.

predictions = model.predict(['Some arbitary sentence'])

Please refer to this Medium article for an example of using the library on the Yelp Reviews Dataset.

Default Settings

The default args used are given below. Any of these can be overridden by passing a dict containing the corresponding key: value pairs to the the init method of TransformerModel.

self.args = {
   'model_type':  'roberta',
   'model_name': 'roberta-base',
   'output_dir': 'outputs/',
   'cache_dir': 'cache/',

   'fp16': True,
   'fp16_opt_level': 'O1',
   'max_seq_length': 128,
   'train_batch_size': 8,
   'eval_batch_size': 8,
   'gradient_accumulation_steps': 1,
   'num_train_epochs': 1,
   'weight_decay': 0,
   'learning_rate': 4e-5,
   'adam_epsilon': 1e-8,
   'warmup_ratio': 0.06,
   'warmup_steps': 0,
   'max_grad_norm': 1.0,

   'logging_steps': 50,
   'evaluate_during_training': False,
   'save_steps': 2000,
   'eval_all_checkpoints': True,
   'use_tensorboard': True,

   'overwrite_output_dir': False,
   'reprocess_input_data': False,
}

Args Explained

output_dir: str

The directory where all outputs will be stored. This includes model checkpoints and evaluation results.

cache_dir: str

The directory where cached files will be saved.

fp16: bool

Whether or not fp16 mode should be used. Requires NVidia Apex library.

fp16_opt_level: str

Can be '01', '02', '03'. See the Apex docs for an explanation of the different optimization levels (opt_levels).

max_seq_length: int

Maximum sequence level the model will support.

train_batch_size: int

The training batch size.

gradient_accumulation_steps: int

The number of training steps to execute before performing a optimizer.step(). Effectively increases the training batch size while sacrificing training time to lower memory consumption.

eval_batch_size: int

The evaluation batch size.

num_train_epochs: int

The number of epochs the model will be trained for.

weight_decay: float

Adds L2 penalty.

learning_rate: float

The learning rate for training.

adam_epsilon: float

Epsilon hyperparameter used in AdamOptimizer.

max_grad_norm: float

Maximum gradient clipping.

logging_steps: int

Log training loss and learning at every specified number of steps.

save_steps: int

Save a model checkpoint at every specified number of steps.

overwrite_output_dir: bool

If True, the trained model will be saved to the ouput_dir and will overwrite existing saved models in the same directory.

reprocess_input_data: bool

If True, the input data will be reprocessed even if a cached file of the input data exists in the cache_dir.

process_count: int

Number of cpu cores (processes) to use when converting examples to features. Default is (number of cores - 2) or 1 if (number of cores <= 2)

TransformerModel

class simpletransformers.model.TransformerModel (model_type, model_name, args=None, use_cuda=True)
This is the main class of this library. All configuration, training, and evaluation is performed using this class.

Class attributes

tokenizer: The tokenizer to be used.
model: The model to be used.
device: The device on which the model will be trained and evaluated.
results: A python dict of past evaluation results for the TransformerModel object.
args: A python dict of arguments used for training and evaluation.

Parameters

model_type: (required) str - The type of model to use. Currently, BERT, XLNet, XLM, and RoBERTa models are available.
model_name: (required) str - The exact model to use. See Current Pretrained Models for all available models.
args: (optional) python dict - A dictionary containing any settings that should be overwritten from the default values.
use_cuda: (optional) bool - Default = True. Flag used to indicate whether CUDA should be used.

class methods
train_model(self, train_df, output_dir=None)

Trains the model using 'train_df'

Args:

train_df: Pandas Dataframe (no header) of two columns, first column containing the text, and the second column containing the label. The model will be trained on this Dataframe.

output_dir: The directory where model files will be saved. If not given, self.args['output_dir'] will be used.

Returns:

None

eval_model(self, eval_df, output_dir=None, verbose=False)

Evaluates the model on eval_df. Saves results to output_dir.

Args:

eval_df: Pandas Dataframe (no header) of two columns, first column containing the text, and the second column containing the label. The model will be evaluated on this Dataframe.

output_dir: The directory where model files will be saved. If not given, self.args['output_dir'] will be used.

verbose: If verbose, results will be printed to the console on completion of evaluation.

Returns:

result: Dictionary containing evaluation results. (Matthews correlation coefficient, tp, tn, fp, fn)

model_outputs: List of model outputs for each row in eval_df

wrong_preds: List of InputExample objects corresponding to each incorrect prediction by the model

predict(self, to_predict)

Performs predictions on a list of text.

Args:

to_predict: A python list of text (str) to be sent to the model for prediction.

Returns:

preds: A python list of the predictions (0 or 1) for each text. model_outputs: A python list of the raw model outputs for each text.

train(self, train_dataset, output_dir)

Trains the model on train_dataset. Utility function to be used by the train_model() method. Not intended to be used directly.

evaluate(self, eval_df, output_dir, prefix="")

Evaluates the model on eval_df. Utility function to be used by the eval_model() method. Not intended to be used directly

load_and_cache_examples(self, examples, evaluate=False)

Converts a list of InputExample objects to a TensorDataset containing InputFeatures. Caches the InputFeatures. Utility function for train() and eval() methods. Not intended to be used directly

List of InputExample objects corresponding to each incorrect prediction by the model

Computes the evaluation metrics for the model predictions.

Args:

preds: Model predictions

labels: Ground truth labels

eval_examples: List of examples on which evaluation was performed

Returns:

result: Dictionary containing evaluation results. (Matthews correlation coefficient, tp, tn, fp, fn)

wrong: List of InputExample objects corresponding to each incorrect prediction by the model

Current Pretrained Models

The table below shows the currently available model types and their models. You can use any of these by setting the model_type and model_name in the args dictionary. For more information about pretrained models, see HuggingFace docs.

Architecture	Model Type	Model Name	Details
BERT	bert	bert-base-uncased	12-layer, 768-hidden, 12-heads, 110M parameters. Trained on lower-cased English text.
BERT	bert	bert-large-uncased	24-layer, 1024-hidden, 16-heads, 340M parameters. Trained on lower-cased English text.
BERT	bert	bert-base-cased	12-layer, 768-hidden, 12-heads, 110M parameters. Trained on cased English text.
BERT	bert	bert-large-cased	24-layer, 1024-hidden, 16-heads, 340M parameters. Trained on cased English text.
BERT	bert	bert-base-multilingual-uncased	(Original, not recommended) 12-layer, 768-hidden, 12-heads, 110M parameters. Trained on lower-cased text in the top 102 languages with the largest Wikipedias
BERT	bert	bert-base-multilingual-cased	(New, recommended) 12-layer, 768-hidden, 12-heads, 110M parameters. Trained on cased text in the top 104 languages with the largest Wikipedias
BERT	bert	bert-base-chinese	12-layer, 768-hidden, 12-heads, 110M parameters. Trained on cased Chinese Simplified and Traditional text.
BERT	bert	bert-base-german-cased	12-layer, 768-hidden, 12-heads, 110M parameters. Trained on cased German text by Deepset.ai
BERT	bert	bert-large-uncased-whole-word-masking	24-layer, 1024-hidden, 16-heads, 340M parameters. Trained on lower-cased English text using Whole-Word-Masking
BERT	bert	bert-large-cased-whole-word-masking	24-layer, 1024-hidden, 16-heads, 340M parameters. Trained on cased English text using Whole-Word-Masking
BERT	bert	bert-large-uncased-whole-word-masking-finetuned-squad	24-layer, 1024-hidden, 16-heads, 340M parameters. The bert-large-uncased-whole-word-masking model fine-tuned on SQuAD
BERT	bert	bert-large-cased-whole-word-masking-finetuned-squad	24-layer, 1024-hidden, 16-heads, 340M parameters The bert-large-cased-whole-word-masking model fine-tuned on SQuAD
BERT	bert	bert-base-cased-finetuned-mrpc	12-layer, 768-hidden, 12-heads, 110M parameters. The bert-base-cased model fine-tuned on MRPC
XLNet	xlnet	xlnet-base-cased	12-layer, 768-hidden, 12-heads, 110M parameters. XLNet English model
XLNet	xlnet	xlnet-large-cased	24-layer, 1024-hidden, 16-heads, 340M parameters. XLNet Large English model
XLM	xlm	xlm-mlm-en-2048	12-layer, 2048-hidden, 16-heads XLM English model
XLM	xlm	xlm-mlm-ende-1024	6-layer, 1024-hidden, 8-heads XLM English-German Multi-language model
XLM	xlm	xlm-mlm-enfr-1024	6-layer, 1024-hidden, 8-heads XLM English-French Multi-language model
XLM	xlm	xlm-mlm-enro-1024	6-layer, 1024-hidden, 8-heads XLM English-Romanian Multi-language model
XLM	xlm	xlm-mlm-xnli15-1024	12-layer, 1024-hidden, 8-heads XLM Model pre-trained with MLM on the 15 XNLI languages
XLM	xlm	xlm-mlm-tlm-xnli15-1024	12-layer, 1024-hidden, 8-heads XLM Model pre-trained with MLM + TLM on the 15 XNLI languages
XLM	xlm	xlm-clm-enfr-1024	12-layer, 1024-hidden, 8-heads XLM English model trained with CLM (Causal Language Modeling)
XLM	xlm	xlm-clm-ende-1024	6-layer, 1024-hidden, 8-heads XLM English-German Multi-language model trained with CLM (Causal Language Modeling)
RoBERTa	roberta	roberta-base	125M parameters RoBERTa using the BERT-base architecture
RoBERTa	roberta	roberta-large	24-layer, 1024-hidden, 16-heads, 355M parameters RoBERTa using the BERT-large architecture
RoBERTa	roberta	roberta-large-mnli	24-layer, 1024-hidden, 16-heads, 355M parameters roberta-large fine-tuned on MNLI.

Acknowledgements

None of this would have been possible without the hard work by the HuggingFace team in developing the Pytorch-Transformers library.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.70.5

Aug 25, 2025

0.70.4

Aug 25, 2025

0.70.3

Aug 24, 2025

0.70.2

Aug 24, 2025

0.70.1

May 29, 2024

0.70.0

Feb 15, 2024

0.65.1

Jan 31, 2024

0.64.5

Dec 18, 2023

0.64.3

Aug 7, 2023

0.63.11

Apr 24, 2023

0.63.9

Sep 18, 2022

0.63.8

Sep 18, 2022

0.63.7

May 29, 2022

0.63.6

Mar 24, 2022

0.63.4

Jan 20, 2022

0.63.3

Nov 12, 2021

0.63.2

Nov 11, 2021

0.63.1

Nov 11, 2021

0.63.0

Nov 10, 2021

0.62.4

Nov 2, 2021

0.62.2

Oct 2, 2021

0.62.1

Oct 2, 2021

0.62.0

Sep 24, 2021

0.61.14

Sep 23, 2021

0.61.13

Jul 24, 2021

0.61.12

Jul 24, 2021

0.61.11

Jul 24, 2021

0.61.10

Jul 13, 2021

0.61.9

Jun 22, 2021

0.61.8

Jun 22, 2021

0.61.7

Jun 21, 2021

0.61.6

May 28, 2021

0.61.5

May 18, 2021

0.61.4

Mar 27, 2021

0.61.3

Mar 23, 2021

0.61.2

Mar 20, 2021

0.61.1

Mar 20, 2021

0.61.0

Mar 19, 2021

0.60.9

Feb 19, 2021

0.60.8

Feb 12, 2021

0.60.7

Feb 11, 2021

0.60.6

Feb 5, 2021

0.60.5

Feb 4, 2021

0.60.4

Feb 2, 2021

0.60.3

Feb 2, 2021

0.60.2

Feb 2, 2021

0.60.1

Feb 2, 2021

0.60.0

Feb 1, 2021

0.51.16

Jan 29, 2021

0.51.15

Jan 24, 2021

0.51.14

Jan 24, 2021

0.51.13

Jan 11, 2021

0.51.12

Jan 11, 2021

0.51.11

Jan 9, 2021

0.51.10

Jan 4, 2021

0.51.9

Dec 30, 2020

0.51.7

Dec 29, 2020

0.51.6

Dec 24, 2020

0.51.5

Dec 19, 2020

0.51.4

Dec 16, 2020

0.51.3

Dec 10, 2020

0.51.2

Dec 9, 2020

0.51.1

Dec 8, 2020

0.51.0

Dec 5, 2020

0.50.0

Dec 1, 2020

0.49.7

Nov 30, 2020

0.49.5

Nov 26, 2020

0.49.4

Nov 26, 2020

0.49.3

Nov 23, 2020

0.49.2

Nov 11, 2020

0.49.0

Nov 9, 2020

0.48.15

Oct 29, 2020

0.48.14

Oct 12, 2020

0.48.13

Oct 12, 2020

0.48.12

Oct 12, 2020

0.48.11

Oct 11, 2020

0.48.10

Oct 11, 2020

0.48.9

Oct 7, 2020

0.48.8

Oct 7, 2020

0.48.7

Oct 3, 2020

0.48.6

Sep 26, 2020

0.48.5

Sep 24, 2020

0.48.4

Sep 23, 2020

0.48.3

Sep 12, 2020

0.48.1

Sep 8, 2020

0.48.0

Sep 6, 2020

0.47.7

Sep 2, 2020

0.47.5

Sep 1, 2020

0.47.4

Aug 30, 2020

0.47.3

Aug 18, 2020

0.47.2

Aug 10, 2020

0.47.0

Aug 9, 2020

0.46.6

Aug 4, 2020

0.46.5

Aug 4, 2020

0.46.4

Aug 4, 2020

0.46.3

Aug 1, 2020

0.46.2

Aug 1, 2020

0.46.0

Jul 31, 2020

0.45.5

Jul 29, 2020

0.45.4

Jul 27, 2020

0.45.3

Jul 25, 2020

0.45.2

Jul 24, 2020

0.45.1

Jul 23, 2020

0.45.0

Jul 18, 2020

0.44.0

Jul 15, 2020

0.43.6

Jul 12, 2020

0.43.5

Jul 10, 2020

0.43.4

Jul 9, 2020

0.43.3

Jul 7, 2020

0.43.2

Jul 6, 2020

0.43.1

Jul 6, 2020

0.43.0

Jul 5, 2020

0.42.0

Jul 5, 2020

0.41.2

Jul 3, 2020

0.41.1

Jul 2, 2020

0.41.0

Jul 1, 2020

0.40.2

Jun 25, 2020

0.40.1

Jun 25, 2020

0.40.0

Jun 23, 2020

0.34.4

Jun 16, 2020

0.34.2

Jun 12, 2020

0.34.1

Jun 11, 2020

0.34.0

Jun 9, 2020

0.33.2

Jun 8, 2020

0.33.1

Jun 8, 2020

0.33.0

Jun 7, 2020

0.32.3

Jun 4, 2020

0.32.2

Jun 3, 2020

0.32.1

Jun 1, 2020

0.32.0

May 31, 2020

0.31.0

May 30, 2020

0.30.0

May 26, 2020

0.29.1

May 24, 2020

0.29.0

May 24, 2020

0.28.12

May 24, 2020

0.28.10

May 22, 2020

0.28.9

May 21, 2020

0.28.8

May 20, 2020

0.28.7

May 19, 2020

0.28.6

May 19, 2020

0.28.5

May 18, 2020

0.28.4

May 15, 2020

0.28.3

May 14, 2020

0.28.2

May 12, 2020

0.28.1

May 12, 2020

0.28.0

May 10, 2020

0.27.4

May 10, 2020

0.27.3

May 10, 2020

0.27.2

May 8, 2020

0.27.1

May 6, 2020

0.27.0

May 5, 2020

0.26.0

Apr 24, 2020

0.25.0

Apr 23, 2020

0.24.9

Apr 22, 2020

0.24.8

Apr 13, 2020

0.24.7

Apr 13, 2020

0.24.6

Apr 12, 2020

0.24.5

Apr 11, 2020

0.24.4

Apr 10, 2020

0.24.3

Apr 10, 2020

0.24.2

Apr 9, 2020

0.24.1

Apr 9, 2020

0.24.0

Apr 9, 2020

0.23.3

Apr 5, 2020

0.23.2

Apr 2, 2020

0.22.1

Mar 19, 2020

0.22.0

Mar 13, 2020

0.21.5

Mar 12, 2020

0.21.4

Mar 11, 2020

0.21.3

Mar 3, 2020

0.21.2

Mar 2, 2020

0.21.1

Feb 29, 2020

0.21.0

Feb 28, 2020

0.20.3

Feb 22, 2020

0.20.2

Feb 22, 2020

0.20.1

Feb 21, 2020

0.20.0

Feb 21, 2020

0.19.9

Feb 18, 2020

0.19.8

Feb 14, 2020

0.19.7

Feb 11, 2020

0.19.6

Feb 11, 2020

0.19.5

Feb 10, 2020

0.19.4

Feb 4, 2020

0.19.3

Feb 3, 2020

0.19.2

Jan 31, 2020

0.19.1

Jan 27, 2020

0.19.0

Jan 26, 2020

0.18.12

Jan 25, 2020

0.18.11

Jan 21, 2020

0.18.10

Jan 20, 2020

0.18.9

Jan 20, 2020

0.18.8

Jan 20, 2020

0.18.7

Jan 18, 2020

0.18.6

Jan 18, 2020

0.18.5

Jan 18, 2020

0.18.4

Jan 17, 2020

0.18.3

Jan 15, 2020

0.18.2

Jan 15, 2020

0.18.1

Jan 14, 2020

0.18.0

Jan 14, 2020

0.17.1

Jan 13, 2020

0.17.0

Jan 13, 2020

0.16.6

Jan 13, 2020

0.16.5

Jan 10, 2020

0.16.4

Jan 9, 2020

0.16.2

Jan 8, 2020

0.16.1

Jan 7, 2020

0.16.0

Jan 7, 2020

0.15.7

Jan 6, 2020

0.15.6

Jan 5, 2020

0.15.5

Jan 5, 2020

0.15.4

Jan 3, 2020

0.15.3

Dec 31, 2019

0.15.2

Dec 28, 2019

0.15.1

Dec 28, 2019

0.15.0

Dec 24, 2019

0.14.0

Dec 24, 2019

0.13.4

Dec 21, 2019

0.13.3

Dec 21, 2019

0.13.2

Dec 20, 2019

0.13.1

Dec 20, 2019

0.13.0

Dec 19, 2019

0.12.0

Dec 19, 2019

0.11.2

Dec 18, 2019

0.11.1

Dec 17, 2019

0.11.0

Dec 15, 2019

0.10.8

Dec 15, 2019

0.10.7

Dec 15, 2019

0.10.6

Dec 13, 2019

0.10.5

Dec 13, 2019

0.10.4

Dec 3, 2019

0.10.2

Dec 2, 2019

0.10.1

Dec 1, 2019

0.9.1

Nov 30, 2019

0.9.0

Nov 30, 2019

0.8.2

Nov 27, 2019

0.8.1

Nov 27, 2019

0.8.0

Nov 27, 2019

0.7.12

Nov 22, 2019

0.7.11

Nov 22, 2019

0.7.10

Nov 20, 2019

0.7.9

Nov 18, 2019

0.7.8

Nov 17, 2019

0.7.7

Nov 17, 2019

0.7.6

Nov 17, 2019

0.7.5

Nov 16, 2019

0.7.0

Nov 12, 2019

0.6.13

Nov 12, 2019

0.6.4

Nov 9, 2019

0.6.3

Nov 9, 2019

0.6.2

Nov 9, 2019

0.6.1

Nov 9, 2019

0.6.0

Nov 7, 2019

0.5.0

Oct 29, 2019

0.4.5

Oct 26, 2019

0.4.4

Oct 22, 2019

0.4.3

Oct 19, 2019

0.4.2

Oct 19, 2019

0.4.1

Oct 19, 2019

0.4.0

Oct 13, 2019

0.3.8

Oct 13, 2019

0.3.7

Oct 12, 2019

0.3.6

Oct 12, 2019

0.3.5

Oct 11, 2019

0.3.4

Oct 11, 2019

0.3.3

Oct 11, 2019

0.3.2

Oct 10, 2019

0.3.1

Oct 10, 2019

0.3.0

Oct 10, 2019

0.2.13

Oct 10, 2019

0.2.12

Oct 7, 2019

0.2.11

Oct 7, 2019

0.2.10

Oct 7, 2019

0.2.9

Oct 7, 2019

0.2.8

Oct 7, 2019

This version

0.2.7

Oct 7, 2019

0.2.6

Oct 7, 2019

0.2.5

Oct 5, 2019

0.2.4

Oct 5, 2019

0.2.1

Oct 5, 2019

0.2.0

Oct 5, 2019

0.1.9

Oct 4, 2019

0.1.8

Oct 4, 2019

0.1.7

Oct 4, 2019

0.1.6

Oct 4, 2019

0.1.5

Oct 4, 2019

0.1.4

Oct 4, 2019

0.1.3

Oct 4, 2019

0.1.2

Oct 4, 2019

0.1.1

Oct 4, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

simpletransformers-0.2.7.tar.gz (16.4 kB view details)

Uploaded Oct 7, 2019 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

simpletransformers-0.2.7-py3-none-any.whl (26.9 kB view details)

Uploaded Oct 7, 2019 Python 3

File details

Details for the file simpletransformers-0.2.7.tar.gz.

File metadata

Download URL: simpletransformers-0.2.7.tar.gz
Upload date: Oct 7, 2019
Size: 16.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4

File hashes

Hashes for simpletransformers-0.2.7.tar.gz
Algorithm	Hash digest
SHA256	`a256989db430b54c51fe832ff74a87578a4e6a0df92cdab7b1a802dffa20ea41`
MD5	`6df8acb62063bb2a99aa390f033c8874`
BLAKE2b-256	`691c355763bb8892b34505c67aaa997dadf94c065e6779a5c18d5881d934759f`

See more details on using hashes here.

File details

Details for the file simpletransformers-0.2.7-py3-none-any.whl.

File metadata

Download URL: simpletransformers-0.2.7-py3-none-any.whl
Upload date: Oct 7, 2019
Size: 26.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4

File hashes

Hashes for simpletransformers-0.2.7-py3-none-any.whl
Algorithm	Hash digest
SHA256	`813d635c4ac819da9a567545a619e61754017c31dfd02f965f62933a5a410113`
MD5	`7fbeb204624eba0ddb2f91e395abb745`
BLAKE2b-256	`569e4e42f0b7c06b6d40c6b24a2fd923c4dea0880554d0eb88cdd2ccc0df08c3`

See more details on using hashes here.

simpletransformers 0.2.7

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Simple Transformers

Table of contents

Setup

With Conda

Usage

Minimal Start

Default Settings

Args Explained

output_dir: str

cache_dir: str

fp16: bool

fp16_opt_level: str

max_seq_length: int

train_batch_size: int

gradient_accumulation_steps: int

eval_batch_size: int

num_train_epochs: int

weight_decay: float

learning_rate: float

adam_epsilon: float

max_grad_norm: float

logging_steps: int

save_steps: int

overwrite_output_dir: bool

reprocess_input_data: bool

process_count: int

TransformerModel

Current Pretrained Models

Acknowledgements

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes