Skip to main content

A wrapper for transcription results.

Project description

Transcriptor Logo

Transcriptor

A transcription service wrapper that makes it easier to work with transcriptions.

Transcription services provide a cost-effective way to add accessibility to your audio and video, but they are often a challenge to integrate into your system.

Transcriptor aims to make working with transcriptions easier.

Transcriptor looks at transcription objects as a class.

Features:

  • An object-oriented approach to Transcriptions, Markers, and Speakers
  • Nondestructive manipulation of text and references.

Installation

Install transcriptor using pip.

pip install transcriptor

Quickstart

Transcriptor currently supports transcription upload and creating a READABLE Transcription Job Object from their output.

Importing Transcriptions from AWS Transcribe

from transcriptor import amazon, AmazonJob

job = AmazonJob(filepath='<filepath>', bucket='<bucket_name>', )
job.start()
# After Some Time
job.status() # If it says COMPLETED
job.build()

Alternatively, you can load the object via the TranscriptFileUri (AmazonJob.from_uri()) or the JSON object directly (AmazonJob.from_json()).

A loaded Transcription Job from AWS Transcribe will give you access to Markers, Speakers (if included), the provided Alternatives. You also have the original job object that you can interact with.

Importing from an SubRip Subtitle (SRT)

Text Transcriptions from the Web can be very unique in style, but the most common format is that of at srt file.

You can load an srt file into transcriptor and use that to interact with the individual markers.

from transcriptor import Job

amazon.from_srt('FILENAME.srt')

A loaded Transcription Job from srt files will make Markers.

These markers are given in order as a list. This gives you the ability to modify a single Marker or iterate your changes across a range or all of the Marker objects.

Supported Services

  • Amazon Transcribe (boto3)

Planned Support For

  • Google Speech-To-Text

Sponsors

This and much of the work that I do is made possible by those that sponsor me on github.

Sponsors at the $20/month and higher Level

Thank you to them and all of those that continue to support this project!

Sponsor this Project

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

transcriptor-2020.8.8.tar.gz (8.4 kB view details)

Uploaded Source

File details

Details for the file transcriptor-2020.8.8.tar.gz.

File metadata

  • Download URL: transcriptor-2020.8.8.tar.gz
  • Upload date:
  • Size: 8.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.23.0 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.8.5

File hashes

Hashes for transcriptor-2020.8.8.tar.gz
Algorithm Hash digest
SHA256 3e943e2231e5b0a42ec0d64059a75e75259075da06e6e32ee600d97e68b4be0b
MD5 58fffb8f10050e18ba178b8ed35a393b
BLAKE2b-256 a69a717491cec24b3b8ea607ec91db6ce91ec474fec9d8a75513f2dfef6a7266

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page