transformers

State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch

These details have been verified by PyPI

Maintainers

ArthurZucker Cyril123456789 lysandre Thomwolf vasqu

These details have not been verified by PyPI

Project links

Homepage

Project description

State-of-the-art Natural Language Processing for PyTorch and TensorFlow 2.0

🤗 Transformers provides thousands of pretrained models to perform tasks on texts such as classification, information extraction, question answering, summarization, translation, text generation, etc in 100+ languages. Its aim is to make cutting-edge NLP easier to use for everyone.

🤗 Transformers provides APIs to quickly download and use those pretrained models on a given text, fine-tune them on your own datasets then share them with the community on our model hub. At the same time, each python module defining an architecture can be used as a standalone and modified to enable quick research experiments.

🤗 Transformers is backed by the two most popular deep learning libraries, PyTorch and TensorFlow, with a seamless integration between them, allowing you to train your models with one then load it for inference with the other.

Online demos

You can test most of our models directly on their pages from the model hub. We also offer private model hosting, versioning, & an inference API to use those models.

Here are a few examples:

Write With Transformer, built by the Hugging Face team, is the official demo of this repo’s text generation capabilities.

Quick tour

To immediately use a model on a given text, we provide the pipeline API. Pipelines group together a pretrained model with the preprocessing that was used during that model training. Here is how to quickly use a pipeline to classify positive versus negative texts

>>> from transformers import pipeline

# Allocate a pipeline for sentiment-analysis
>>> classifier = pipeline('sentiment-analysis')
>>> classifier('We are very happy to include pipeline into the transformers repository.')
[{'label': 'POSITIVE', 'score': 0.9978193640708923}]

The second line of code downloads and caches the pretrained model used by the pipeline, the third line evaluates it on the given text. Here the answer is "positive" with a confidence of 99.8%.

This is another example of pipeline used for that can extract question answers from some context:

>>> from transformers import pipeline

# Allocate a pipeline for question-answering
>>> question_answerer = pipeline('question-answering')
>>> question_answerer({
...     'question': 'What is the name of the repository ?',
...     'context': 'Pipeline have been included in the huggingface/transformers repository'
... })
{'score': 0.5135612454720828, 'start': 35, 'end': 59, 'answer': 'huggingface/transformers'}

On top of the answer, the pretrained model used here returned its confidence score, along with the start position and its end position in the tokenized sentence. You can learn more about the tasks supported by the pipeline API in this tutorial.

To download and use any of the pretrained models on your given task, you just need to use those three lines of codes (PyTorch version):

>>> from transformers import AutoTokenizer, AutoModel

>>> tokenizer = AutoTokenizer.from_pretrained("bert-base-uncased")
>>> model = AutoModel.from_pretrained("bert-base-uncased")

>>> inputs = tokenizer("Hello world!", return_tensors="pt")
>>> outputs = model(**inputs)

or for TensorFlow:

>>> from transformers import AutoTokenizer, TFAutoModel

>>> tokenizer = AutoTokenizer.from_pretrained("bert-base-uncased")
>>> model = TFAutoModel.from_pretrained("bert-base-uncased")

>>> inputs = tokenizer("Hello world!", return_tensors="tf")
>>> outputs = model(**inputs)

The tokenizer is responsible for all the preprocessing the pretrained model expects, and can be called directly on one (or list) of texts (as we can see on the fourth line of both code examples). It will output a dictionary you can directly pass to your model (which is done on the fifth line).

The model itself is a regular Pytorch nn.Module or a TensorFlow tf.keras.Model (depending on your backend) which you can use normally. For instance, this tutorial explains how to integrate such a model in classic PyTorch or TensorFlow training loop, or how to use our Trainer API to quickly fine-tune the on a new dataset.

Why should I use transformers?

Easy-to-use state-of-the-art models:
- High performance on NLU and NLG tasks.
- Low barrier to entry for educators and practitioners.
- Few user-facing abstractions with just three classes to learn.
- A unified API for using all our pretrained models.
Lower compute costs, smaller carbon footprint:
- Researchers can share trained models instead of always retraining.
- Practitioners can reduce compute time and production costs.
- Dozens of architectures with over 2,000 pretrained models, some in more than 100 languages.
Choose the right framework for every part of a model's lifetime:
- Train state-of-the-art models in 3 lines of code.
- Move a single model between TF2.0/PyTorch frameworks at will.
- Seamlessly pick the right framework for training, evaluation, production.
Easily customize a model or an example to your needs:
- Examples for each architecture to reproduce the results by the official authors of said architecture.
- Expose the models internal as consistently as possible.
- Model files can be used independently of the library for quick experiments.

Why shouldn't I use transformers?

This library is not a modular toolbox of building blocks for neural nets. The code in the model files is not refactored with additional abstractions on purpose, so that researchers can quickly iterate on each of the models without diving in additional abstractions/files.
The training API is not intended to work on any model but is optimized to work with the models provided by the library. For generic machine learning loops, you should use another library.
While we strive to present as many use cases as possible, the scripts in our examples folder are just that: examples. It is expected that they won't work out-of-the box on your specific problem and that you will be required to change a few lines of code to adapt them to your needs.

Installation

With pip

This repository is tested on Python 3.6+, PyTorch 1.0.0+ (PyTorch 1.3.1+ for examples) and TensorFlow 2.0.

You should install 🤗 Transformers in a virtual environment. If you're unfamiliar with Python virtual environments, check out the user guide.

First, create a virtual environment with the version of Python you're going to use and activate it.

Then, you will need to install at least one of TensorFlow 2.0, PyTorch or Flax. Please refer to TensorFlow installation page, PyTorch installation page regarding the specific install command for your platform and/or Flax installation page.

When TensorFlow 2.0 and/or PyTorch has been installed, 🤗 Transformers can be installed using pip as follows:

pip install transformers

If you'd like to play with the examples or need the bleeding edge of the code and can't wait for a new release, you must install the library from source.

With conda

Since Transformers version v4.0.0, we now have a conda channel: huggingface.

🤗 Transformers can be installed using conda as follows:

conda install -c huggingface transformers

Follow the installation pages of TensorFlow, PyTorch or Flax to see how to install them with conda.

Models architectures

All the model checkpoints provided by 🤗 Transformers are seamlessly integrated from the huggingface.co model hub where they are uploaded directly by users and organizations.

Current number of checkpoints:

🤗 Transformers currently provides the following architectures (see here for a high-level summary of each them):

ALBERT (from Google Research and the Toyota Technological Institute at Chicago) released with the paper ALBERT: A Lite BERT for Self-supervised Learning of Language Representations, by Zhenzhong Lan, Mingda Chen, Sebastian Goodman, Kevin Gimpel, Piyush Sharma, Radu Soricut.
BART (from Facebook) released with the paper BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension by Mike Lewis, Yinhan Liu, Naman Goyal, Marjan Ghazvininejad, Abdelrahman Mohamed, Omer Levy, Ves Stoyanov and Luke Zettlemoyer.
BARThez (from École polytechnique) released with the paper BARThez: a Skilled Pretrained French Sequence-to-Sequence Model by Moussa Kamal Eddine, Antoine J.-P. Tixier, Michalis Vazirgiannis.
BERT (from Google) released with the paper BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding by Jacob Devlin, Ming-Wei Chang, Kenton Lee and Kristina Toutanova.
BERT For Sequence Generation (from Google) released with the paper Leveraging Pre-trained Checkpoints for Sequence Generation Tasks by Sascha Rothe, Shashi Narayan, Aliaksei Severyn.
BigBird-RoBERTa (from Google Research) released with the paper Big Bird: Transformers for Longer Sequences by Manzil Zaheer, Guru Guruganesh, Avinava Dubey, Joshua Ainslie, Chris Alberti, Santiago Ontanon, Philip Pham, Anirudh Ravula, Qifan Wang, Li Yang, Amr Ahmed.
Blenderbot (from Facebook) released with the paper Recipes for building an open-domain chatbot by Stephen Roller, Emily Dinan, Naman Goyal, Da Ju, Mary Williamson, Yinhan Liu, Jing Xu, Myle Ott, Kurt Shuster, Eric M. Smith, Y-Lan Boureau, Jason Weston.
BlenderbotSmall (from Facebook) released with the paper Recipes for building an open-domain chatbot by Stephen Roller, Emily Dinan, Naman Goyal, Da Ju, Mary Williamson, Yinhan Liu, Jing Xu, Myle Ott, Kurt Shuster, Eric M. Smith, Y-Lan Boureau, Jason Weston.
BORT (from Alexa) released with the paper Optimal Subarchitecture Extraction For BERT by Adrian de Wynter and Daniel J. Perry.
CamemBERT (from Inria/Facebook/Sorbonne) released with the paper CamemBERT: a Tasty French Language Model by Louis Martin*, Benjamin Muller*, Pedro Javier Ortiz Suárez*, Yoann Dupont, Laurent Romary, Éric Villemonte de la Clergerie, Djamé Seddah and Benoît Sagot.
ConvBERT (from YituTech) released with the paper ConvBERT: Improving BERT with Span-based Dynamic Convolution by Zihang Jiang, Weihao Yu, Daquan Zhou, Yunpeng Chen, Jiashi Feng, Shuicheng Yan.
CTRL (from Salesforce) released with the paper CTRL: A Conditional Transformer Language Model for Controllable Generation by Nitish Shirish Keskar*, Bryan McCann*, Lav R. Varshney, Caiming Xiong and Richard Socher.
DeBERTa (from Microsoft) released with the paper DeBERTa: Decoding-enhanced BERT with Disentangled Attention by Pengcheng He, Xiaodong Liu, Jianfeng Gao, Weizhu Chen.
DeBERTa-v2 (from Microsoft) released with the paper DeBERTa: Decoding-enhanced BERT with Disentangled Attention by Pengcheng He, Xiaodong Liu, Jianfeng Gao, Weizhu Chen.
DialoGPT (from Microsoft Research) released with the paper DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation by Yizhe Zhang, Siqi Sun, Michel Galley, Yen-Chun Chen, Chris Brockett, Xiang Gao, Jianfeng Gao, Jingjing Liu, Bill Dolan.
DistilBERT (from HuggingFace), released together with the paper DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter by Victor Sanh, Lysandre Debut and Thomas Wolf. The same method has been applied to compress GPT2 into DistilGPT2, RoBERTa into DistilRoBERTa, Multilingual BERT into DistilmBERT and a German version of DistilBERT.
DPR (from Facebook) released with the paper Dense Passage Retrieval for Open-Domain Question Answering by Vladimir Karpukhin, Barlas Oğuz, Sewon Min, Patrick Lewis, Ledell Wu, Sergey Edunov, Danqi Chen, and Wen-tau Yih.
ELECTRA (from Google Research/Stanford University) released with the paper ELECTRA: Pre-training text encoders as discriminators rather than generators by Kevin Clark, Minh-Thang Luong, Quoc V. Le, Christopher D. Manning.
FlauBERT (from CNRS) released with the paper FlauBERT: Unsupervised Language Model Pre-training for French by Hang Le, Loïc Vial, Jibril Frej, Vincent Segonne, Maximin Coavoux, Benjamin Lecouteux, Alexandre Allauzen, Benoît Crabbé, Laurent Besacier, Didier Schwab.
Funnel Transformer (from CMU/Google Brain) released with the paper Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing by Zihang Dai, Guokun Lai, Yiming Yang, Quoc V. Le.
GPT (from OpenAI) released with the paper Improving Language Understanding by Generative Pre-Training by Alec Radford, Karthik Narasimhan, Tim Salimans and Ilya Sutskever.
GPT-2 (from OpenAI) released with the paper Language Models are Unsupervised Multitask Learners by Alec Radford*, Jeffrey Wu*, Rewon Child, David Luan, Dario Amodei** and Ilya Sutskever**.
GPT Neo (from EleutherAI) released in the repository EleutherAI/gpt-neo by Sid Black, Stella Biderman, Leo Gao, Phil Wang and Connor Leahy.
I-BERT (from Berkeley) released with the paper I-BERT: Integer-only BERT Quantization by Sehoon Kim, Amir Gholami, Zhewei Yao, Michael W. Mahoney, Kurt Keutzer
LayoutLM (from Microsoft Research Asia) released with the paper LayoutLM: Pre-training of Text and Layout for Document Image Understanding by Yiheng Xu, Minghao Li, Lei Cui, Shaohan Huang, Furu Wei, Ming Zhou.
LED (from AllenAI) released with the paper Longformer: The Long-Document Transformer by Iz Beltagy, Matthew E. Peters, Arman Cohan.
Longformer (from AllenAI) released with the paper Longformer: The Long-Document Transformer by Iz Beltagy, Matthew E. Peters, Arman Cohan.
LXMERT (from UNC Chapel Hill) released with the paper LXMERT: Learning Cross-Modality Encoder Representations from Transformers for Open-Domain Question Answering by Hao Tan and Mohit Bansal.
M2M100 (from Facebook) released with the paper Beyond English-Centric Multilingual Machine Translation by by Angela Fan, Shruti Bhosale, Holger Schwenk, Zhiyi Ma, Ahmed El-Kishky, Siddharth Goyal, Mandeep Baines, Onur Celebi, Guillaume Wenzek, Vishrav Chaudhary, Naman Goyal, Tom Birch, Vitaliy Liptchinsky, Sergey Edunov, Edouard Grave, Michael Auli, Armand Joulin.
MarianMT Machine translation models trained using OPUS data by Jörg Tiedemann. The Marian Framework is being developed by the Microsoft Translator Team.
MBart (from Facebook) released with the paper Multilingual Denoising Pre-training for Neural Machine Translation by Yinhan Liu, Jiatao Gu, Naman Goyal, Xian Li, Sergey Edunov, Marjan Ghazvininejad, Mike Lewis, Luke Zettlemoyer.
MBart-50 (from Facebook) released with the paper Multilingual Translation with Extensible Multilingual Pretraining and Finetuning by Yuqing Tang, Chau Tran, Xian Li, Peng-Jen Chen, Naman Goyal, Vishrav Chaudhary, Jiatao Gu, Angela Fan.
MPNet (from Microsoft Research) released with the paper MPNet: Masked and Permuted Pre-training for Language Understanding by Kaitao Song, Xu Tan, Tao Qin, Jianfeng Lu, Tie-Yan Liu.
MT5 (from Google AI) released with the paper mT5: A massively multilingual pre-trained text-to-text transformer by Linting Xue, Noah Constant, Adam Roberts, Mihir Kale, Rami Al-Rfou, Aditya Siddhant, Aditya Barua, Colin Raffel.
Pegasus (from Google) released with the paper PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization> by Jingqing Zhang, Yao Zhao, Mohammad Saleh and Peter J. Liu.
ProphetNet (from Microsoft Research) released with the paper ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training by Yu Yan, Weizhen Qi, Yeyun Gong, Dayiheng Liu, Nan Duan, Jiusheng Chen, Ruofei Zhang and Ming Zhou.
Reformer (from Google Research) released with the paper Reformer: The Efficient Transformer by Nikita Kitaev, Łukasz Kaiser, Anselm Levskaya.
RoBERTa (from Facebook), released together with the paper a Robustly Optimized BERT Pretraining Approach by Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, Veselin Stoyanov.
SpeechToTextTransformer (from Facebook), released together with the paper fairseq S2T: Fast Speech-to-Text Modeling with fairseq by Changhan Wang, Yun Tang, Xutai Ma, Anne Wu, Dmytro Okhonko, Juan Pino.
SqueezeBert released with the paper SqueezeBERT: What can computer vision teach NLP about efficient neural networks? by Forrest N. Iandola, Albert E. Shaw, Ravi Krishna, and Kurt W. Keutzer.
T5 (from Google AI) released with the paper Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer by Colin Raffel and Noam Shazeer and Adam Roberts and Katherine Lee and Sharan Narang and Michael Matena and Yanqi Zhou and Wei Li and Peter J. Liu.
TAPAS (from Google AI) released with the paper TAPAS: Weakly Supervised Table Parsing via Pre-training by Jonathan Herzig, Paweł Krzysztof Nowak, Thomas Müller, Francesco Piccinno and Julian Martin Eisenschlos.
Transformer-XL (from Google/CMU) released with the paper Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context by Zihang Dai*, Zhilin Yang*, Yiming Yang, Jaime Carbonell, Quoc V. Le, Ruslan Salakhutdinov.
Vision Transformer (ViT) (from Google AI) released with the paper An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale by Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, Neil Houlsby.
Wav2Vec2 (from Facebook AI) released with the paper wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations by Alexei Baevski, Henry Zhou, Abdelrahman Mohamed, Michael Auli.
XLM (from Facebook) released together with the paper Cross-lingual Language Model Pretraining by Guillaume Lample and Alexis Conneau.
XLM-ProphetNet (from Microsoft Research) released with the paper ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training by Yu Yan, Weizhen Qi, Yeyun Gong, Dayiheng Liu, Nan Duan, Jiusheng Chen, Ruofei Zhang and Ming Zhou.
XLM-RoBERTa (from Facebook AI), released together with the paper Unsupervised Cross-lingual Representation Learning at Scale by Alexis Conneau*, Kartikay Khandelwal*, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzmán, Edouard Grave, Myle Ott, Luke Zettlemoyer and Veselin Stoyanov.
XLNet (from Google/CMU) released with the paper XLNet: Generalized Autoregressive Pretraining for Language Understanding by Zhilin Yang*, Zihang Dai*, Yiming Yang, Jaime Carbonell, Ruslan Salakhutdinov, Quoc V. Le.
XLSR-Wav2Vec2 (from Facebook AI) released with the paper Unsupervised Cross-Lingual Representation Learning For Speech Recognition by Alexis Conneau, Alexei Baevski, Ronan Collobert, Abdelrahman Mohamed, Michael Auli.
Want to contribute a new model? We have added a detailed guide and templates to guide you in the process of adding a new model. You can find them in the templates folder of the repository. Be sure to check the contributing guidelines and contact the maintainers or open an issue to collect feedbacks before starting your PR.

To check if each model has an implementation in PyTorch/TensorFlow/Flax or has an associated tokenizer backed by the 🤗 Tokenizers library, refer to this table

These implementations have been tested on several datasets (see the example scripts) and should match the performances of the original implementations. You can find more details on the performances in the Examples section of the documentation.

Learn more

Section	Description
Documentation	Full API documentation and tutorials
Task summary	Tasks supported by 🤗 Transformers
Preprocessing tutorial	Using the `Tokenizer` class to prepare data for the models
Training and fine-tuning	Using the models provided by 🤗 Transformers in a PyTorch/TensorFlow training loop and the `Trainer` API
Quick tour: Fine-tuning/usage scripts	Example scripts for fine-tuning models on a wide range of tasks
Model sharing and uploading	Upload and share your fine-tuned models with the community
Migration	Migrate to 🤗 Transformers from `pytorch-transformers` or `pytorch-pretrained-bert`

Citation

We now have a paper you can cite for the 🤗 Transformers library:

@inproceedings{wolf-etal-2020-transformers,
    title = "Transformers: State-of-the-Art Natural Language Processing",
    author = "Thomas Wolf and Lysandre Debut and Victor Sanh and Julien Chaumond and Clement Delangue and Anthony Moi and Pierric Cistac and Tim Rault and Rémi Louf and Morgan Funtowicz and Joe Davison and Sam Shleifer and Patrick von Platen and Clara Ma and Yacine Jernite and Julien Plu and Canwen Xu and Teven Le Scao and Sylvain Gugger and Mariama Drame and Quentin Lhoest and Alexander M. Rush",
    booktitle = "Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations",
    month = oct,
    year = "2020",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/2020.emnlp-demos.6",
    pages = "38--45"
}

Project details

These details have been verified by PyPI

Maintainers

ArthurZucker Cyril123456789 lysandre Thomwolf vasqu

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

5.2.0

Feb 16, 2026

5.1.0

Feb 5, 2026

5.0.0

Jan 26, 2026

5.0.0rc3 pre-release

Jan 14, 2026

5.0.0rc2 pre-release

Jan 7, 2026

5.0.0rc1 pre-release

Dec 11, 2025

5.0.0rc0 pre-release

Dec 1, 2025

4.57.6

Jan 16, 2026

4.57.5

Jan 13, 2026

4.57.4

Jan 13, 2026

4.57.3

Nov 25, 2025

4.57.2

Nov 24, 2025

4.57.1

Oct 14, 2025

4.57.0 yanked

Oct 3, 2025

Reason this release was yanked:

Error in the setup causing installation issues

4.56.2

Sep 19, 2025

4.56.1

Sep 4, 2025

4.56.0

Aug 29, 2025

4.55.4

Aug 22, 2025

4.55.3

Aug 21, 2025

4.55.2

Aug 13, 2025

4.55.1

Aug 13, 2025

4.55.0

Aug 5, 2025

4.54.1

Jul 29, 2025

4.54.0

Jul 25, 2025

4.53.3

Jul 22, 2025

4.53.2

Jul 11, 2025

4.53.1

Jul 4, 2025

4.53.0

Jun 26, 2025

4.52.4

May 30, 2025

4.52.3

May 22, 2025

4.52.2

May 21, 2025

4.52.1

May 20, 2025

4.52.0 yanked

May 20, 2025

4.51.3

Apr 14, 2025

4.51.2

Apr 10, 2025

4.51.1

Apr 8, 2025

4.51.0

Apr 5, 2025

4.50.3

Mar 28, 2025

4.50.2

Mar 27, 2025

4.50.1

Mar 25, 2025

4.50.0

Mar 21, 2025

4.49.0

Feb 17, 2025

4.48.3

Feb 7, 2025

4.48.2

Jan 30, 2025

4.48.1

Jan 20, 2025

4.48.0

Jan 10, 2025

4.47.1

Dec 17, 2024

4.47.0

Dec 5, 2024

4.46.3

Nov 18, 2024

4.46.2

Nov 5, 2024

4.46.1

Oct 29, 2024

4.46.0 yanked

Oct 24, 2024

Reason this release was yanked:

This version unfortunately does not work with 3.8 but we did not drop the support yet

4.45.2

Oct 7, 2024

4.45.1

Sep 26, 2024

4.45.0

Sep 25, 2024

4.44.2

Aug 22, 2024

4.44.1

Aug 20, 2024

4.44.0

Aug 6, 2024

4.43.4

Aug 5, 2024

4.43.3

Jul 26, 2024

4.43.2

Jul 24, 2024

4.43.1

Jul 23, 2024

4.43.0

Jul 23, 2024

4.42.4

Jul 11, 2024

4.42.3

Jun 28, 2024

4.42.2

Jun 28, 2024

4.42.1

Jun 27, 2024

4.42.0

Jun 27, 2024

4.41.2

May 30, 2024

4.41.1

May 22, 2024

4.41.0

May 17, 2024

4.40.2

May 6, 2024

4.40.1

Apr 23, 2024

4.40.0

Apr 18, 2024

4.39.3

Apr 2, 2024

4.39.2

Mar 28, 2024

4.39.1

Mar 22, 2024

4.39.0

Mar 21, 2024

4.38.2

Mar 1, 2024

4.38.1

Feb 22, 2024

4.38.0

Feb 21, 2024

4.37.2

Jan 29, 2024

4.37.1

Jan 24, 2024

4.37.0

Jan 22, 2024

4.36.2

Dec 18, 2023

4.36.1

Dec 14, 2023

4.36.0

Dec 11, 2023

4.35.2

Nov 15, 2023

4.35.1

Nov 14, 2023

4.35.0

Nov 2, 2023

4.34.1

Oct 18, 2023

4.34.0

Oct 3, 2023

4.33.3

Sep 27, 2023

4.33.2

Sep 15, 2023

4.33.1

Sep 6, 2023

4.33.0

Sep 5, 2023

4.32.1

Aug 28, 2023

4.32.0

Aug 22, 2023

4.31.0

Jul 18, 2023

4.30.2

Jun 13, 2023

4.30.1

Jun 9, 2023

4.30.0

Jun 8, 2023

4.29.2

May 16, 2023

4.29.1

May 11, 2023

4.29.0

May 10, 2023

4.28.1

Apr 14, 2023

4.28.0

Apr 13, 2023

4.27.4

Mar 29, 2023

4.27.3

Mar 23, 2023

4.27.2

Mar 20, 2023

4.27.1

Mar 15, 2023

4.27.0

Mar 15, 2023

4.26.1

Feb 9, 2023

4.26.0

Jan 24, 2023

4.25.1

Dec 1, 2022

4.25.0 yanked

Dec 1, 2022

Reason this release was yanked:

Version was not properly set

4.24.0

Nov 1, 2022

4.23.1

Oct 11, 2022

4.23.0

Oct 10, 2022

4.22.2

Sep 27, 2022

4.22.1

Sep 16, 2022

4.22.0

Sep 14, 2022

4.21.3

Sep 5, 2022

4.21.2

Aug 24, 2022

4.21.1

Aug 4, 2022

4.21.0

Jul 27, 2022

4.20.1

Jun 21, 2022

4.20.0

Jun 16, 2022

4.19.4

Jun 10, 2022

4.19.3

Jun 9, 2022

4.19.2

May 16, 2022

4.19.1

May 13, 2022

4.19.0

May 12, 2022

4.18.0

Apr 6, 2022

4.17.0

Mar 3, 2022

4.16.2

Jan 31, 2022

4.16.1

Jan 28, 2022

4.16.0

Jan 27, 2022

4.15.0

Dec 22, 2021

4.14.1

Dec 15, 2021

4.14.0 yanked

Dec 15, 2021

Reason this release was yanked:

Circular import when both TensorFlow and Onnx are in the env

4.13.0

Dec 9, 2021

4.12.5

Nov 17, 2021

4.12.4

Nov 16, 2021

4.12.3

Nov 3, 2021

4.12.2

Oct 29, 2021

4.12.1

Oct 29, 2021

4.12.0

Oct 28, 2021

4.11.3

Oct 6, 2021

4.11.2

Sep 30, 2021

4.11.1

Sep 29, 2021

4.11.0

Sep 27, 2021

4.10.3

Sep 22, 2021

4.10.2

Sep 10, 2021

4.10.1

Sep 10, 2021

4.10.0

Aug 31, 2021

4.9.2

Aug 9, 2021

4.9.1

Jul 26, 2021

4.9.0

Jul 22, 2021

4.8.2

Jun 30, 2021

4.8.1

Jun 24, 2021

4.8.0

Jun 23, 2021

4.7.0

Jun 17, 2021

4.6.1

May 20, 2021

4.6.0

May 12, 2021

This version

4.5.1

Apr 13, 2021

4.5.0

Apr 6, 2021

4.4.2

Mar 18, 2021

4.4.1

Mar 16, 2021

4.4.0

Mar 16, 2021

4.3.3

Feb 24, 2021

4.3.2

Feb 9, 2021

4.3.1

Feb 9, 2021

4.3.0

Feb 8, 2021

4.3.0rc1 pre-release

Feb 4, 2021

4.2.2

Jan 21, 2021

4.2.1

Jan 14, 2021

4.2.0

Jan 13, 2021

4.1.1

Dec 17, 2020

4.1.0

Dec 17, 2020

4.0.1

Dec 9, 2020

4.0.0

Nov 30, 2020

4.0.0rc1 pre-release

Nov 19, 2020

3.5.1

Nov 13, 2020

3.5.0

Nov 10, 2020

3.4.0

Oct 20, 2020

3.3.1

Sep 29, 2020

3.3.0

Sep 28, 2020

3.2.0

Sep 22, 2020

3.1.0

Sep 1, 2020

3.0.2

Jul 6, 2020

3.0.1

Jul 3, 2020

3.0.0

Jun 29, 2020

2.11.0

Jun 2, 2020

2.10.0

May 22, 2020

2.9.1

May 14, 2020

2.9.0

May 7, 2020

2.8.0

Apr 6, 2020

2.7.0

Mar 30, 2020

2.6.0

Mar 24, 2020

2.5.1

Feb 24, 2020

2.5.0

Feb 19, 2020

2.4.1

Jan 31, 2020

2.4.0

Jan 31, 2020

2.3.0

Dec 20, 2019

2.2.2

Dec 13, 2019

2.2.1

Dec 3, 2019

2.2.0

Nov 26, 2019

2.1.1

Oct 11, 2019

2.1.0

Oct 9, 2019

2.0.0

Sep 26, 2019

0.1

Aug 17, 2016

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

transformers-4.5.1.tar.gz (1.7 MB view details)

Uploaded Apr 13, 2021 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

transformers-4.5.1-py3-none-any.whl (2.1 MB view details)

Uploaded Apr 13, 2021 Python 3

File details

Details for the file transformers-4.5.1.tar.gz.

File metadata

Download URL: transformers-4.5.1.tar.gz
Upload date: Apr 13, 2021
Size: 1.7 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.49.0 CPython/3.7.9

File hashes

Hashes for transformers-4.5.1.tar.gz
Algorithm	Hash digest
SHA256	`3508e3b032cf0f5342c67836de4b121aa5c435c959472a28054ba895ea59cca7`
MD5	`9f750399fe3f1e6b8bcf849b3ace0f27`
BLAKE2b-256	`2270d2a2283be01546862aea76e7cd41724c0e5a61322cedf5694c0d4eda9bcf`

See more details on using hashes here.

File details

Details for the file transformers-4.5.1-py3-none-any.whl.

File metadata

Download URL: transformers-4.5.1-py3-none-any.whl
Upload date: Apr 13, 2021
Size: 2.1 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.49.0 CPython/3.7.9

File hashes

Hashes for transformers-4.5.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0a57d1cd9301a617c7015d7184228984abdfb1ae2158c29cfb32582219756d23`
MD5	`ee8fac1c29358573cf15baed139c3bca`
BLAKE2b-256	`d8b257495b5309f09fa501866e225c84532d1fd89536ea62406b2181933fb418`

See more details on using hashes here.

transformers 4.5.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

State-of-the-art Natural Language Processing for PyTorch and TensorFlow 2.0

Online demos

Quick tour

Why should I use transformers?

Why shouldn't I use transformers?

Installation

With pip

With conda

Models architectures

Learn more

Citation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes