Evaluation as a Service for Natural Language Processing

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

EaaS_API

Documentation

Documentation at https://expressai.github.io/autoeval/. Some references for writing docs can refer to

Usage

To install the API, simply run

pip install eaas

To use the API, You should go through the following two steps.

Step 1: You should load the default configurations and make modifications based on your own needs.

from eaas import Config
config = Config()
# To see the metrics we support, run
print(config.metrics())
# dict_keys(['bart_score_summ', 'bart_score_mt', 'bert_score', 'bleu', 'chrf', 'comet', 'comet_qe', 'mover_score', 'prism', 'prism_qe', 'rouge1', 'rouge2', 'rougeL'])

# To see the default configuration of a metric, run
print(config.bleu.to_dict())
# {'smooth_method': 'exp', 'smooth_value': None, 'force': False, 'lowercase': False, 'use_effective_order': False}

# To modify the config, run
config.bleu.set_property("smooth_method", "floor")
print(config.bleu.to_dict())
# {'smooth_method': 'floor', 'smooth_value': None, 'force': False, 'lowercase': False, 'use_effective_order': False}

Step 2: Initialize the client and send your inputs.

from eaas import Client
client = Client()
client.load_config(config)  # The config you have created above

# To use this API for scoring, you need to format your input as list of dictionary. 
# Each dictionary consists of `source` (string, optional), `references` (list of string, optional) 
# and `hypothesis` (string, required). `source` and `references` are optional based on the metrics 
# you want to use. Please do not conduct any preprocessing on `source`, `references` or `hypothesis`, 
# we expect normal-cased detokenized texts. All the preprocessing steps are taken by the metrics. 
# Below is a simple example.

inputs = [{"source": "This is the source.", 
           "references": ["This is the reference one.", "This is the reference two."],
           "hypothesis": "This is the generated hypothesis."}]
metrics = ["bleu", "chrf"] # Can be None for simplicity if you consider using all metrics

score_dic = client.score(inputs, task="sum", metrics=metrics, lang="en") 
# inputs is a list of Dict, task is the name of task, metrics is metric list, lang is the two-letter code language

The output is like

# sample_level is a list of dict, corpus_level is a dict
{
    'sample_level': [
        {'bleu': 32.46679154750991,
         'attr_compression': 1.2,
         'attr_copy_len': 2.0,
         'attr_coverage': 0.8,
         'attr_density': 2.0,
         'attr_hypothesis_len': 5,
         'attr_novelty': 0.5,
         'attr_repetition': 0.0,
         'attr_source_len': 6,
         'chrf': 38.56890099861521}
    ],
    'corpus_level': {
        'corpus_bleu': 32.46679154750991,
        'corpus_attr_compression': 1.2,
        'corpus_attr_copy_len': 2.0,
        'corpus_attr_coverage': 0.8,
        'corpus_attr_density': 2.0,
        'corpus_attr_hypothesis_len': 5.0,
        'corpus_attr_novelty': 0.5,
        'corpus_attr_repetition': 0.0,
        'corpus_attr_source_len': 6.0,
        'corpus_chrf': 38.56890099861521
    }
}

Long-term TODO

完善功能
只给aws的ip (起一个api.eaas类似这样的域名)
打包成package
metric corpus-level指标计算; BLEU corpus-level的计算检查（是否其他metric也有类似的）；我们可能要设计下返回结果的json格式
我们弄个文档，总结每个指标的默认预处理方法，超参数使用，考虑是否预留个接口给用户设置
Confidence interval计算功能
Fine-grained analysis功能
优化API访问效率

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.3.10

Oct 19, 2022

0.3.9

May 5, 2022

0.3.8

Apr 9, 2022

0.3.7

Apr 8, 2022

0.3.6

Mar 28, 2022

0.3.5

Mar 8, 2022

0.3.4

Mar 1, 2022

0.3.3

Feb 5, 2022

0.3.2

Feb 4, 2022

0.3.1

Feb 3, 2022

0.3.0

Feb 3, 2022

0.2.1

Dec 18, 2021

0.2.0

Dec 12, 2021

This version

0.1.9

Nov 11, 2021

0.1.8

Nov 9, 2021

0.1.7

Nov 6, 2021

0.1.6

Oct 25, 2021

0.1.5

Sep 7, 2021

0.1.4

Sep 7, 2021

0.1.3

Sep 7, 2021

0.1.2

Sep 7, 2021

0.1.1

Sep 7, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

eaas-0.1.9.tar.gz (6.8 kB view hashes)

Uploaded Nov 11, 2021 Source

Built Distribution

eaas-0.1.9-py2.py3-none-any.whl (8.5 kB view hashes)

Uploaded Nov 11, 2021 Python 2 Python 3

Hashes for eaas-0.1.9.tar.gz

Hashes for eaas-0.1.9.tar.gz
Algorithm	Hash digest
SHA256	`55a569709e8987969c2d4f896b618bad0e258a7540fc6a6378464e5d3fac5635`
MD5	`af9770023cb90401f4521b822a3d247b`
BLAKE2b-256	`885db0e19dfbdc4e7b164b0dd619d993ef3a35e525a69a2eec34132928c916a4`

Hashes for eaas-0.1.9-py2.py3-none-any.whl

Hashes for eaas-0.1.9-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`aed0dadf7042c7711d139102dff3a1e023085b042a92958b6db0f07d94548072`
MD5	`25f2eb7b1c7ba9ea2eeb5ad32bdbc952`
BLAKE2b-256	`548b33e58609e2483038ea41413ce204bcf3c7da394c74cd691537563169d28c`