Explainable Leaderboards for Natural Language Processing
Project description
ExplainaBoard: An Explainable Leaderboard for NLP
What is ExplainaBoard?
When developing a natural language processing (NLP or AI) system, often one of the hardest things is to understand where your system is working and where it is failing, and deciding what to do next. ExplainaBoard is a tool that inspects your system outputs, identifies what is working and what is not working, and helps inspire you with ideas of where to go next.
It offers a number of different ways with which you can evaluate and understand your data:
- Single-system Analysis: What is a system good or bad at?
- Pairwise Analysis: Where is one system better (worse) than another?
- Data Bias Analysis: What are the characteristics of different evaluated datasets?
- Common Errors: What are common mistakes that top-5 systems made?
- Fine-grained Error Analysis: where do errors occur?
- System Combination: Is there potential complementarity between different systems?
Using Explainaboard
ExplainaBoard can be used online or offline.
Online Usage
Browse the web interface, which gives you the ability to browse outputs and upload your own system outputs.
Offline Usage
First, follow the installation directions below, then take a look at our CLI examples.
Install Method 1 - Standard Use: Simple installation from PyPI (Python 3 only)
pip install --upgrade pip # recommending the newest version of pip.
pip install explainaboard
python -m spacy download en_core_web_sm # if you plan to use the TextClassificationProcessor
Install Method 2 - Development: Install from the source and develop locally (Python 3 only)
# Clone current repo
git clone https://github.com/neulab/ExplainaBoard.git
cd ExplainaBoard
# Install the required dependencies and dev dependencies
pip install ."[dev]"
pre-commit install
- Testing: To run tests, you can run
python -m unittest
. - Linting and Code Style: This project uses flake8 (linter) and black (formatter). They are enforced in the pre-commit hook and in the CI pipeline.
- run
python -m black .
to format code - run
flake8
to lint code - You can also configure your IDE to automatically format and lint the files as you are writing code.
- run
After trying things out in the CLI, you can read how to add new features, tasks, or file formats.
Acknowledgement
ExplainaBoard is developed by Carnegie Mellon University, Inspired Cognition Inc., and other collaborators. If you find it useful in research, you can cite it in papers:
@inproceedings{liu-etal-2021-explainaboard,
title = "{E}xplaina{B}oard: An Explainable Leaderboard for {NLP}",
author = "Liu, Pengfei and Fu, Jinlan and Xiao, Yang and Yuan, Weizhe and Chang, Shuaichen and Dai, Junqi and Liu, Yixin and Ye, Zihuiwen and Neubig, Graham",
booktitle = "Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: System Demonstrations",
month = aug,
year = "2021",
address = "Online",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2021.acl-demo.34",
doi = "10.18653/v1/2021.acl-demo.34",
pages = "280--289",
}
We thanks all authors who share their system outputs with us: Ikuya Yamada, Stefan Schweter, Colin Raffel, Yang Liu, Li Dong. We also thank Vijay Viswanathan, Yiran Chen, Hiroaki Hayashi for useful discussion and feedback about ExplainaBoard.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file explainaboard-0.11.2.tar.gz
.
File metadata
- Download URL: explainaboard-0.11.2.tar.gz
- Upload date:
- Size: 144.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.14
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 96e3ddb788ac828bf304bab89783f03bb4cda9ad63de3def1167ef022be74b39 |
|
MD5 | 003bbbd9f64b74a773d0e72202510f1d |
|
BLAKE2b-256 | 33829943dd43fbf2f2c9ed8f15c0fc20281c40919a214bc52a974d019a3e9465 |
File details
Details for the file explainaboard-0.11.2-py2.py3-none-any.whl
.
File metadata
- Download URL: explainaboard-0.11.2-py2.py3-none-any.whl
- Upload date:
- Size: 225.9 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.14
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2b4482188476b62c535a5422effc0f9f8bd23afc94db08ab04531406d4e46db6 |
|
MD5 | 90f4870697fd5a3b6833826559801935 |
|
BLAKE2b-256 | d64db1dc81a0c69f8c031e7777ef6b5c7dc4d7dcbcf4500d3762c9b7132fac37 |