Skip to main content

Alignment and word error rate calculations

Project description

WordErrorRate.py

Alignment and word error rate calculations

Python version of WordErrorrate.jl, which, because of Julia, is much faster.

Installation

pip install git+https://github.com/davidavdav/WordErrorRate.py.git

Synopsis

Usage:

>>> import worderrorrate
>>> ## A single comparison / alignment between reference and hypothesis
>>> w = worderrorrate.WER(["this", "is", "a", "test"], "this is another".split())
>>> print(f"Full alignment:\n{w}")
Full alignment:
REF: this is a    test
HYP: this is * another
Eval    c  c d       s
>>> print("The word error rate is", w.wer())
The word error rate is 0.5
>>> refali, hypali, evalali = w.align()
>>> print("Alignment indices:", refali, hypali, evalali)
Alignment indices: [0, 1, 2, 3] [0, 1, None, 2] ['c', 'c', 'd', 's']
>>> w.nref ## number of words in the reference
4
>>> w.nhyp ## in hypothesis
3
>>> w.nsub ## substitutions
1
>>> w.nins ## insertions
0
>>> w.ndel ## deletions
1
>>> ## Aggregate over multiple pairs of reference and hypothesis
>>> ws = worderrorrate.WERs() ## multiple utterances
>>> ws.append(["this", "is", "a", "test"], "this is another".split())
>>> ws.append("a b c d".split(), "a c e".split())
>>> ws.wer()
0.5
>>> ws.nref
8
>>> ws.nhyp
6
>>> ws.nsub
2
>>> ws.nins
0
>>> ws.ndel
2

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

worderrorrate-0.1.2.tar.gz (4.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

worderrorrate-0.1.2-py3-none-any.whl (5.1 kB view details)

Uploaded Python 3

File details

Details for the file worderrorrate-0.1.2.tar.gz.

File metadata

  • Download URL: worderrorrate-0.1.2.tar.gz
  • Upload date:
  • Size: 4.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.9

File hashes

Hashes for worderrorrate-0.1.2.tar.gz
Algorithm Hash digest
SHA256 bba39608f52099eb50c1b65bbc5bd41778a81d141a6d98a384cfa477896b9fac
MD5 af94899bf3be917df9f9244b9036eaa2
BLAKE2b-256 f21ff4393d581c3e5428558c99971de3cdb4613600c01cca02fbea4f1a0427b9

See more details on using hashes here.

File details

Details for the file worderrorrate-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: worderrorrate-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 5.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.9

File hashes

Hashes for worderrorrate-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 900ddbf8a7903328202f584a10ed8a9b4febc4e38034bdb8a09c59c143e1cc22
MD5 d5ad3ab4cf002de3775077f9ae9216ca
BLAKE2b-256 5931d59c808b6488f3ce4c9065bf19b844f6cb5f942f79dc514e00dfae3bc5d5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page