Ask2Transformers is a library for zero-shot classification based on Transformers.

These details have not been verified by PyPI

Project links

Homepage

Project description

Ask2Transformers

A Framework for Textual Entailment based Zero Shot text classification

This repository contains the code for out of the box ready to use zero-shot classifiers among different tasks, such as Topic Labelling or Relation Extraction. It is built on top of 🤗 HuggingFace Transformers library, so you are free to choose among hundreds of models. You can either, use a dataset specific classifier or define one yourself with just labels descriptions or templates! The repository contains the code for the following publications:

📄 Ask2Transformers - Zero Shot Domain Labelling with Pretrained Transformers accepted in GWC2021.
📄 (Coming soon) Label Verbalization and Entailment for Effective Zero- and Few-Shot Relation Extraction accepted in EMNLP2021

Supported (and benchmarked) tasks:

Follow the links to see some examples of how to use the library on each task.

Topic classification evaluated on BabelDomains (Camacho- Collados and Navigli, 2017) dataset.
Relation classification evaluated on TACRED (Zhang et al., 2017) dataset.

Instalation

By using Pip (check the last release)

pip install a2t

Or by clonning the repository

git clone https://github.com/osainz59/Ask2Transformers.git
cd Ask2Transformers
python -m pip install .

Available models

By default, roberta-large-mnli checkpoint is used to perform the inference. You can try different models to perform the zero-shot classification, but they need to be finetuned on a NLI task and be compatible with the AutoModelForSequenceClassification class from Transformers. For example:

roberta-large-mnli
joeddav/xlm-roberta-large-xnli
facebook/bart-large-mnli
microsoft/deberta-v2-xlarge-mnli

Coming soon: t5-large like generative models support.

Citation

Cite this paper if you want to cite stuff related to Relation Extraction, etc.

Coming soon.

Cite this paper if you want to cite stuff related with the library or topic labelling (A2TDomains or our paper results).

@inproceedings{sainz-rigau-2021-ask2transformers,
    title = "{A}sk2{T}ransformers: Zero-Shot Domain labelling with Pretrained Language Models",
    author = "Sainz, Oscar  and
      Rigau, German",
    booktitle = "Proceedings of the 11th Global Wordnet Conference",
    month = jan,
    year = "2021",
    address = "University of South Africa (UNISA)",
    publisher = "Global Wordnet Association",
    url = "https://www.aclweb.org/anthology/2021.gwc-1.6",
    pages = "44--52",
    abstract = "In this paper we present a system that exploits different pre-trained Language Models for assigning domain labels to WordNet synsets without any kind of supervision. Furthermore, the system is not restricted to use a particular set of domain labels. We exploit the knowledge encoded within different off-the-shelf pre-trained Language Models and task formulations to infer the domain label of a particular WordNet definition. The proposed zero-shot system achieves a new state-of-the-art on the English dataset used in the evaluation.",
}

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.4.0

May 3, 2022

0.3.0

Feb 9, 2022

This version

0.2.0

Sep 1, 2021

0.1.2

Sep 14, 2020

0.1.1

Sep 10, 2020

0.1

Sep 10, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

a2t-0.2.0.tar.gz (48.4 kB view hashes)

Uploaded Sep 1, 2021 Source

Built Distribution

a2t-0.2.0-py3-none-any.whl (56.8 kB view hashes)

Uploaded Sep 1, 2021 Python 3

Hashes for a2t-0.2.0.tar.gz

Hashes for a2t-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`03d94f08bff617c3cc3bc0e3c53f0590ae74ece33125d94104898a6c660adcfd`
MD5	`5fa234153684566098ed00281f5b4e2a`
BLAKE2b-256	`ab340bd5e2ae13ad01d628ac7834438202e0fabe4ddb74a40ea23cc8d9afe171`

Hashes for a2t-0.2.0-py3-none-any.whl

Hashes for a2t-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`05ce800d6685a88abeb72aa285295525505c3492d30bfb10b141fad9d66362c4`
MD5	`b5a0c6c12bb96b6faa5011601159ebb6`
BLAKE2b-256	`70136b98f288d464f626c83e814056f3c1024b8f8d671209708b68eddfc17294`