Skip to main content

A wrapper for transcription results.

Project description

Transcriptor Logo

Transcriptor

A transcription service wrapper that makes it easier to work with transcriptions.

Transcription services provide a cost-effective way to add accessibility to your audio and video, but they are often a challenge to integrate into your system.

Transcriptor aims to make working with transcriptions easier.

Transcriptor looks at transcription objects as a class.

Features:

  • An object-oriented approach to Transcriptions, Markers, and Speakers
  • Nondestructive manipulation of text and references.

Installation

Install transcriptor using pip.

pip install transcriptor

Quickstart

Transcriptor currently does not support automated transcription upload, but it supports creating READABLE Transcription Job Object from their output.

Importing Transcriptions from AWS Transcribe

Because transcriptor is designed to wrap around AWS Transcribe if you have configured your environment with information for AWS, you can directly convert AWS Transcribe jobs to Transcriptor Jobs with the amazon.from_job() method.

from transcriptor import amazon

amazon.from_job(job="<TranscriptionJobName>")

Alternatively, you can load the object via the TranscriptFileUri (amazon.from_uri()) or the JSON object directly (amazon.from_json()).

A loaded Transcription Job from AWS Transcribe will give you access to Markers, Speakers (if included), the provided Alternatives. You also have the original job object that you can interact with.

Importing from an SubRip Subtitle (SRT)

Text Transcriptions from the Web can be very unique in style, but the most common format is that of at srt file.

You can load an srt file into transcriptor and use that to interact with the individual markers.

from transcriptor import Job

amazon.from_srt('FILENAME.srt')

A loaded Transcription Job from srt files will make Markers.

These markers are given in order as a list. This gives you the ability to modify a single Marker or iterate your changes across a range or all of the Marker objects.

Supported Services

  • Amazon Transcribe (boto3)

Planned Support For

  • Google Speech-To-Text

Sponsors

This and much of the work that I do is made possible by those that sponsor me on github.

Sponsors at the $20/month and higher Level

Thank you to them and all of those that continue to support this project!

Sponsor this Project

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

transcriptor-2020.7.3.tar.gz (6.9 kB view hashes)

Uploaded source

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page