Skip to main content

Voicegain Compare transcription

Project description

Utility to compare output transcript to reference

Uses Ukkonen algorithm to efficiently compute Leveshtein distance and character error rate (CER).

Additionally it can output alignment information.

Usage

Usage: transcribe-compare [OPTIONS]

  Transcription compare tool provided by VoiceGain

Options:
  -r, --reference TEXT            source string
  -o, --output TEXT               target string
  -R, --reference_file FILENAME   source file path
  -O, --output_file FILENAME      target file path
  -a, --alignment                 Do you want to see the alignment result?
                                  True/False
  -e, --error_type [CER|WER]
  -j, --output_format [JSON|TABLE]
  -l, --to_lower                  Do you want to lower all the words?
                                  True/False
  -p, --remove_punctuation        Do you want to remove all the punctuation?
                                  True/False
  -P, --to_save_plot              Do you want to see the windows? True/False
  -s, --to_edit_step INTEGER      Please enter the step
  -w, --to_edit_width INTEGER     Please enter the width
  --help                          Show this message and exit.

Dependencies

  • click
  • inflect
  • re
  • nltk
  • metaphone
  • matplotlib

Sample Commands

python transcribe-compare -R sample_data/The_Princess_and_the_Pea-reference.txt -O sample_data/The_Princess_and_the_Pea-output-1.txt -e CER

HTML Output

HTML Output: Single Compare - Stats

Related code

There is a script available that using transcribe-compare to compare results from Voicegaing and Google recognizers. You can find it here: https://github.com/voicegain/platform/tree/master/utility-scripts/test-transcribe

Acknowledgements

Contributed by VoiceGain.

VoiceGain provides Deep-Neural-Network-based Speech-to-Text (ASR) available in Cloud and also as an Edge Deployment. Accessible via RESTful Web API or MRCP v2 interface. Is suitable both for continuous large-vocabulary transcription (real-time or off-line) and for recognition using context-free grammars (e.g. GRXML). In addition to this VoiceGain platform provides API-driven method to modify models used in speech-to-text. It is possible to modify language model, pronunciation model, and the acoustic DNN model.

License

MIT © VoiceGain

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

transcribe-compare-0.2.4.tar.gz (912.5 kB view details)

Uploaded Source

File details

Details for the file transcribe-compare-0.2.4.tar.gz.

File metadata

  • Download URL: transcribe-compare-0.2.4.tar.gz
  • Upload date:
  • Size: 912.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/49.2.0 requests-toolbelt/0.9.1 tqdm/4.46.1 CPython/3.7.0

File hashes

Hashes for transcribe-compare-0.2.4.tar.gz
Algorithm Hash digest
SHA256 a856aacd094755f37b59bee1d08165ad2e8a0976079126210c293bbd2cb4bb8a
MD5 766db7aea7a1afdba80497f28f90a295
BLAKE2b-256 6e5fe2f72a23ac744c385e6d2b3b1b6728c9315ff26729b87cf6ecb5fb217a70

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page