framework for synchronous batch speech-to-text transcription using backends like AWS, Watson, etc.
Project description
py-transcribe-aws
AWS Transcribe implementation of py-transcribe
Python Installation
pip install --user -e git+https://github.com/ictlearningsciences/py-transcribe-aws.git@{release-tag}#egg=transcribe
With pip
pip install git+https://github.com/ictlearningsciences/py-transcribe-aws.git@{release-tag}#egg=transcribe_aws
...in a pip requirements.txt
use this syntax:
transcribe_aws @ git+https://github.com/ictlearningsciences/py-transcribe-aws.git@{release-tag}#egg=transcribe_aws
Usage
Setting the implementation module path
Set ENV var TRANSCRIBE_MODULE_PATH
, e.g.
export TRANSCRIBE_MODULE_PATH=transcribe_aws
or pass the module path at service-creation time, e.g.
from transcribe import init_transcription_service
service = init_transcription_service(
module_path="transcribe_aws"
)
Basic usage
Your code generally should not need to access any of the implementations in this module directly. See py-transcribe for docs on usage of the framework.
ENV/config vars
The following config vars can be set in ENV or passed in code, e.g. init_transcription_service(config={})
AWS_REGION
(required)
The region hosting the S3 bucket to which source audio (or video) files will be uploaded for transcription
AWS_ACCESS_KEY_ID
(required)
AWS_SECRET_ACCESS_KEY
(required)
TRANSCRIBE_AWS_S3_BUCKET_SOURCE
(required)
Bucket where source will be uploaded and then passed to AWS Transcribe
AWS Permissions
The AWS IAM used must have permissions to read/write/delete from the configured source bucket and also use AWS Transcribe
TODO: give exact details on minimum permissions/policies.
Development
Run tests during development with
make test-all
Once ready to release, create a release tag, currently using semver-ish numbering, e.g. 1.0.0(-alpha.1)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for py_transcribe_aws-1.3.0a3.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | f078126fc35357e88b1fda9d89dd3836a6917683605ea81f35c67e5efe2ac848 |
|
MD5 | 6176285f9c82771faab624473cdea4e0 |
|
BLAKE2b-256 | a5c2a100bbb16a6bf4d951b810d9027f5ed5ca51b96851c9e97d92b6cb85f933 |
Hashes for py_transcribe_aws-1.3.0a3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | abaa1504e4af3ec591826608963d171049b6bec6e1396702e3368cf68c72d719 |
|
MD5 | 13a9d93fa73433b2d9febd7c7f627127 |
|
BLAKE2b-256 | ee1cad76ee103bc5ea7b6645a049fc04936721a67ae70fb4b6a787eb5b18f025 |