ftvstt

Transcription APIs encapsulation

These details have not been verified by PyPI

Project description

#################################Description##################################

ftvstt is a France Télévisions python library which encapsulates multiple online speech-to-text APIs, in order to call them as easily as possible.

It currently supports : Amazon Transcribe Google Cloud speech-to-text Vocapia Voxsigma Bertin Mediaspeech

#################################Quickstart###################################

ftvstt is not currently available through pip, you have to download the package and import it directly :

import ftvstt

#################################Usage######################################

Example of transcription through the services:

Vocapia Voxsigma:

vocapiaTranscriber = ftvstt.Vocapia("https://rest1.vocapia.com:8093/voxsigma") vocapiaTranscriber.authenticate("EXAMPLE_ID","EXAMPLE_PASS") transcript = vocapiaTranscriber.transcribe("/path/to/file.wav") vocapiaTranscriber.deauthenticate()

Bertin Mediaspeech: bertinTranscriber = ftvstt.Bertin("https://demo02.mediaspeech.com:4433/api") bertinTranscriber.authenticate("EXAMPLE_ID","EXAMPLE_PASS") transcript = bertinTranscriber.transcribe("/path/to/file.wav") bertinTranscriber.deauthenticate()

Amazon transcribe: amazonTranscriber = ftvstt.Amazon("AMAZON_S3_BUCKET_NAME") amazonTranscriber.authenticate("/path/to/amazon/credentials.csv") transcript = amazonTranscriber.transcribe("/path/to/file.wav") amazonTranscriber.deauthenticate()

You need an amazon AWS S3 bucket besides Amazon AWS Transcribe in order to make transcriptions.

Google cloud speech-to-text: googleTranscriber = ftvstt.Google() googleTranscriber.authenticate("/path/to/google/credentials.json") transcript = googleTranscriber.transcribe("/path/to/file.wav") googleTranscriber.deauthenticate()

################################Custom vocabulary file##################################

For every provider except Bertin, you can add a custom vocabulary file of probable words as shown :

googleTranscriber = ftvstt.Google() googleTranscriber.authenticate("/path/to/google/credentials.json") googleTranscriber.set_vocabulary_file("/path/to/vocabulary/file.txt") transcript = googleTranscriber.transcribe("/path/to/file.wav") googleTranscriber.deauthenticate()

The vocabulary file should be of the form: word1 word2 word3 ...

#################################Results handling######################################

Once a transcription is done, the transcribe function of a Transcriber returns a Transcript instance from ftvstt.transcripts sub-module.

A Transcript instance, as transcript in previous codes, has several useful attributes :

transcript.text : a string containing the textual transcript of the audio file. transcript.words : a list of Word instances from ftvstt.transcripts sub-module, each one has a content (str), a startTime (float), an endTime (float), a speaker (Speaker instance from ftvstt.transcripts sub-module) (and can have a confidence (float) depending on the provider used) attribute. transcript.speakers : a list of Speaker instances from ftvstt.transcripts sub-module, each one has an id (int), (and can have a gender (str : "M" or "F") depending on the provider used). transcript.raw : a string containing the raw result of the transcription received from the provider, which type is transcript.rawType (str : "json" or "xml").

#################################Error handling######################################

If an error has occured during transcription, a custom python Exception from the ftvstt.exceptions sub-module will be raised. The error will also be accessbile in the exception attribute of the transcript result, as you can see in this example:

googleTranscriber = ftvstt.Google() googleTranscriber.authenticate("/path/to/google/credentials.json") try: transcript = googleTranscriber.transcribe("/path/to/file.wav") except: pass raise transcript.exception googleTranscriber.deauthenticate()

#################################Testing######################################

Incoming...

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

2.0.0

Jan 20, 2020

1.2.5

Dec 12, 2019

1.2.4

Dec 9, 2019

1.2.3

Dec 2, 2019

1.2.2

Dec 2, 2019

1.2.1

Nov 21, 2019

1.2.0

Nov 21, 2019

1.1.10

Nov 15, 2019

1.1.9

Nov 14, 2019

1.1.8

Nov 14, 2019

This version

0.0.0

Nov 14, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ftvstt-1.0.57.tar.gz (9.1 kB view hashes)

Uploaded Nov 14, 2019 Source

Built Distribution

ftvstt-1.0.57-py3-none-any.whl (11.4 kB view hashes)

Uploaded Nov 14, 2019 Python 3

Hashes for ftvstt-1.0.57.tar.gz

Hashes for ftvstt-1.0.57.tar.gz
Algorithm	Hash digest
SHA256	`06ec9ee3dc9cc2e4bceb2d255b7045add2bd826ebe8271f4defc5c354dc2649c`
MD5	`f0533a096b4989248affe21e83eb38d2`
BLAKE2b-256	`9c49bdee5b98be705cf9ad0dfee9c7a771cdded7f3b297d0d037edc7e6655d7c`

Hashes for ftvstt-1.0.57-py3-none-any.whl

Hashes for ftvstt-1.0.57-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2e4e0d4e9e091b21ab899da34f5858f24ca04c67c0dde7beab24878a85983396`
MD5	`c73de72d74c293965a22ee2a42af2c12`
BLAKE2b-256	`acfd7b0e5f431d0e7c192ec26a980f52c34b8e5ce02c92d0965573526a2a18eb`