Skip to main content

Tatt creates a uniform API for multiple speech-to-text (STT) services.

Project description

Transcribe All The Things™

tatt is a CLI for creating and managing speech-to-text transcripts.

demo

Installation

pip install tatt

Dependencies

  1. A recording to transcribe.
  2. a) An AWS account or b) a Google Cloud account with the speech-to-text API and Cloud Storage enabled.
  3. ffmpeg for some operations, but tatt will let you know when it's needed.

Usage

List All Commands

$ transcribe --help

Usage: transcribe [OPTIONS] COMMAND [ARGS]...

Options:
  --help  Show this message and exit.

Commands:
  get       Downloads and/or saves completed transcript.
  list      Lists available STT services.
  services  Lists available speech-to-text services.
  status    Check the status of a transcription job.
  this      Sends a media file to be transcribed.

List All STT Services

$ transcribe services

Here are all the available speech-to-text services:

  amazon		$0.006 per 15 seconds
  google		$0.009 per 15 seconds

Get Something Transcribed

$ transcribe this <path_to_media_file> <service_name>

Okay, transcribing <path_to_media_file> using <service_name>...
Okay, job <job_name> is being transcribed.  Use "get" command to download it.

List Transcripts

$ transcribe list

Service Job Name                                Status
------- --------                                ------
amazon  tester.mp3                              IN_PROGRESS
amazon  messed_up.mp4                           FAILED
amazon  done_test.mp3                           COMPLETED
amazon  also_done.MP3                           COMPLETED
google  hey_there.mp3                           COMPLETED


$ transcribe list <job_name>

Service Job Name                                Status
------- --------                                ------
amazon  <job_name>                              IN_PROGRESS

Get A Completed Transcript

$ transcribe get <job_name> # prints to stdout

{'accountId': '416321668733',
 'jobName': 'a1bace5e-8b08-4ce4-b08c-834a23aafcf1',
 'results': {'items': [{'alternatives': [{'confidence': '0.9774',
                                          'content': 'Hi'}],
                        'end_time': '1.5',
                        'start_time': '1.23',
                        'type': 'pronunciation'},
                       {'alternatives': [{'confidence': '0.9429',
                                          'content': 'is'}],
                        'end_time': '1.71',
                        'start_time': '1.5',
                        'type': 'pronunciation'},
                       ...

$ transcribe get --save <job_name>

Saved transcript to <job_name>.json

Services Supported

Planned

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tatt-0.977.tar.gz (10.7 kB view hashes)

Uploaded Source

Built Distribution

tatt-0.977-py3-none-any.whl (25.8 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page