A wrapper for transcription results.
Project description
Transcriptor
A transcription service wrapper that makes it easier to work with transcriptions.
Transcription services provide a cost-effective way to add accessibility to your audio and video, but they are often a challenge to integrate into your system.
Transcriptor aims to make working with transcriptions easier.
Transcriptor looks at transcription objects as a class.
Features:
- An object-oriented approach to Transcriptions, Markers, and Speakers
- Nondestructive manipulation of text and references.
Installation
Install transcriptor using pip.
pip install transcriptor
Quickstart
Transcriptor currently supports transcription upload and creating a READABLE Transcription Job Object from their output.
Importing Transcriptions from AWS Transcribe
from transcriptor import amazon, AmazonJob
job = AmazonJob(filepath='<filepath>', bucket='<bucket_name>', )
job.start()
# After Some Time
job.status() # If it says COMPLETED
job.build()
Alternatively, you can load the object via the TranscriptFileUri
(AmazonJob.from_uri()
) or the JSON object directly (AmazonJob.from_json()
).
A loaded Transcription Job from AWS Transcribe will give you access to Markers
, Speakers
(if included), the provided Alternatives
. You also have the original job
object that you can interact with.
Importing from an SubRip Subtitle (SRT)
Text Transcriptions from the Web can be very unique in style, but the most common format is that of at srt file.
You can load an srt file into transcriptor and use that to interact with the individual markers.
from transcriptor import Job
amazon.from_srt('FILENAME.srt')
A loaded Transcription Job from srt files will make Markers
.
These markers are given in order as a list. This gives you the ability to modify a single Marker or iterate your changes across a range or all of the Marker objects.
Supported Services
- Amazon Transcribe (boto3)
Planned Support For
- Google Speech-To-Text
Sponsors
This and much of the work that I do is made possible by those that sponsor me on github.
Sponsors at the $20/month and higher Level
Thank you to them and all of those that continue to support this project!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file transcriptor-2020.8.8.tar.gz
.
File metadata
- Download URL: transcriptor-2020.8.8.tar.gz
- Upload date:
- Size: 8.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.23.0 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3e943e2231e5b0a42ec0d64059a75e75259075da06e6e32ee600d97e68b4be0b |
|
MD5 | 58fffb8f10050e18ba178b8ed35a393b |
|
BLAKE2b-256 | a69a717491cec24b3b8ea607ec91db6ce91ec474fec9d8a75513f2dfef6a7266 |