Rev AI makes speech applications easy to build!

These details have not been verified by PyPI

Project links

Homepage

Intended Audience
- Developers
License
- OSI Approved :: MIT License
Natural Language
- English
Programming Language

Project description

Rev AI Python SDK

Documentation

See the API docs for more information about the API and more python examples.

Installation

You don't need this source code unless you want to modify the package. If you just want to use the package, just run:

pip install --upgrade rev_ai

Install from source with:

python setup.py install

Requirements

Python 3.8+

Usage

All you need to get started is your Access Token, which can be generated on your AccessToken Settings Page. Create a client with the generated Access Token:

from rev_ai import apiclient, RevAiApiDeploymentConfigMap, RevAiApiDeployment

# create your client
# optionally configure the Rev AI deployment to use
client = apiclient.RevAiAPIClient("ACCESS TOKEN", url=RevAiApiDeploymentConfigMap[RevAiApiDeployment.US]['base_url'])

Sending a file

Once you've set up your client with your Access Token sending a file is easy!

# you can send a local file
job = client.submit_job_local_file("FILE PATH")

# or send a link to the file you want transcribed
job = client.submit_job_url("https://example.com/file-to-transcribe.mp3")

job will contain all the information normally found in a successful response from our Submit Job endpoint.

If you want to get fancy, both submit job methods take metadata,notification_config, skip_diarization, skip_punctuation, speaker_channels_count,custom_vocabularies, filter_profanity, remove_disfluencies, delete_after_seconds,language, and custom_vocabulary_id as optional parameters.

The url submission option also supports authentication headers by using the source_config option.

You can request transcript summary.

# submitting a human transcription jobs
job = client.submit_job_url("https://example.com/file-to-transcribe.mp3",
    language='en',
    summarization_config=SummarizationOptions(
        formatting_type=SummarizationFormattingOptions.BULLETS
    ))

You can request transcript translation into up to five languages.

job = client.submit_job_url("https://example.com/file-to-transcribe.mp3",
    language='en',
    translation_config=TranslationOptions(
        target_languages: [
            TranslationLanguageOptions("es", TranslationModel.PREMIUM),
            TranslationLanguageOptions("de")
        ]
    ));

All options are described in the request body of the Submit Job endpoint.

Human Transcription

If you want transcription to be performed by a human, both methods allow you to submit human transcription jobs using transcriber=human with verbatim, rush, segments_to_transcribe and test_mode as optional parameters. Check out our documentation for Human Transcription for more details.

# submitting a human transcription jobs
job = client.submit_job_url("https://example.com/file-to-transcribe.mp3",
    transcriber='human',
    verbatim=False,
    rush=False,
    test_mode=True
    segments_to_transcribe=[{
        start: 2.0,
        end: 4.5
    }])

Checking your file's status

You can check the status of your transcription job using its id

job_details = client.get_job_details(job.id)

job_details will contain all information normally found in a successful response from our Get Job endpoint

Checking multiple files

You can retrieve a list of transcription jobs with optional parameters

jobs = client.get_list_of_jobs()

# limit amount of retrieved jobs
jobs = client.get_list_of_jobs(limits=3)

# get jobs starting after a certain job id
jobs = client.get_list_of_jobs(starting_after='Umx5c6F7pH7r')

jobs will contain a list of job details having all information normally found in a successful response from our Get List of Jobs endpoint

Deleting a job

You can delete a transcription job using its id

client.delete_job(job.id)

All data related to the job, such as input media and transcript, will be permanently deleted. A job can only by deleted once it's completed (either with success or failure).

Getting your transcript

Once your file is transcribed, you can get your transcript in a few different forms:

# as text
transcript_text = client.get_transcript_text(job.id)

# as json
transcript_json = client.get_transcript_json(job.id)

# or as a python object
transcript_object = client.get_transcript_object(job.id)

# or if you requested transcript translation(s)
transcript_object = client.get_translated_transcript_object(job.id,'es')

Both the json and object forms contain all the formation outlined in the response of the Get Transcript endpoint when using the json response schema. While the text output is a string containing just the text of your transcript

Getting transcript summary

If you requested transcript summary, you can retrieve it as plain text or structured object:

# as text
summary = client.get_transcript_summary_text(job.id)

# as json
summary = client.get_transcript_summary_json(job.id)

# or as a python object
summary = client.get_transcript_summary_object(job.id)

Getting captions output

You can also get captions output from the SDK. We offer both SRT and VTT caption formats. If you submitted your job as speaker channel audio then you must also provide a channel_id to be captioned:

captions = client.get_captions(job.id, content_type=CaptionType.SRT, channel_id=None)

# or if you requested transcript translation(s)
captions = client.get_translated_captions(job.id, 'es')

Streamed outputs

Any output format can be retrieved as a stream. In these cases we return the raw http response to you. The output can be retrieved via response.content, response.iter_lines() or response.iter_content().

text_stream = client.get_transcript_text_as_stream(job.id)

json_stream = client.get_transcript_json_as_stream(job.id)

captions_stream = client.get_captions_as_stream(job.id)

Streaming audio

In order to stream audio, you will need to setup a streaming client and a media configuration for the audio you will be sending.

from rev_ai.streamingclient import RevAiStreamingClient
from rev_ai.models import MediaConfig, RevAiApiDeploymentConfigMap, RevAiApiDeployment

#on_error(error)
#on_close(code, reason)
#on_connected(id)

# optionally configure the Rev AI deployment to use
config = MediaConfig()
streaming_client = RevAiStreamingClient("ACCESS TOKEN",
                                        config,
                                        on_error=ERRORFUNC,
                                        on_close=CLOSEFUNC,
                                        on_connected=CONNECTEDFUNC,
                                        url=RevAiApiDeploymentConfigMap[RevAiApiDeployment.US]['base_websocket_url'])

on_error, on_close, and on_connected are optional parameters that are functions to be called when the websocket errors, closes, and connects respectively. The default on_error raises the error, on_close prints out the code and reason for closing, and on_connected prints out the job ID. If passing in custom functions, make sure you provide the right parameters. See the sample code for the parameters.

Once you have a streaming client setup with a MediaConfig and access token, you can obtain a transcription generator of your audio. You can also use a custom vocabulary with your streaming job by supplying the optional custom_vocabulary_id when starting a connection!

More optional parameters can be supplied when starting a connection, these are metadata, filter_profanity, remove_disfluencies, delete_after_seconds, and detailed_partials. For a description of these optional parameters look at our streaming documentation.

response_generator = streaming_client.start(AUDIO_GENERATOR, custom_vocabulary_id="CUSTOM VOCAB ID")

response_generator is a generator object that yields the transcription results of the audio including partial and final transcriptions. The start method creates a thread sending audio pieces from the AUDIO_GENERATOR to our [streaming] endpoint.

If you want to end the connection early, you can!

streaming_client.end()

Otherwise, the connection will end when the server obtains an "EOS" message.

Submitting custom vocabularies

In addition to passing custom vocabularies as parameters in the async API client, you can create and submit your custom vocabularies independently and directly to the custom vocabularies API, as well as check on their progress.

Primarily, the custom vocabularies client allows you to submit and preprocess vocabularies for use with the streaming client, in order to have streaming jobs with custom vocabularies!

In this example you see how to construct custom vocabulary objects, submit them to the API, and check on their progress and metadata!

from rev_ai import custom_vocabularies_client
from rev_ai.models import CustomVocabulary

# Create a client
client = custom_vocabularies_client.RevAiCustomVocabulariesClient("ACCESS TOKEN")

# Construct a CustomVocabulary object using your desired phrases
custom_vocabulary = CustomVocabulary(["Patrick Henry Winston", "Robert C Berwick", "Noam Chomsky"])

# Submit the CustomVocabulary
custom_vocabularies_job = client.submit_custom_vocabularies([custom_vocabulary])

# View the job's progress
job_state = client.get_custom_vocabularies_information(custom_vocabularies_job['id'])

# Get list of previously submitted custom vocabularies
custom_vocabularies_jobs = client.get_list_of_custom_vocabularies()

# Delete the CustomVocabulary
client.delete_custom_vocabulary(custom_vocabularies_job['id'])

For more details, check out the custom vocabularies example in our examples.

For Rev AI Python SDK Developers

Remember in your development to follow the PEP8 style guide. Your code editor likely has Python PEP8 linting packages which can assist you in your development.

Local testing instructions

Prequisites: virtualenv, tox

To test locally use the following commands from the repo root

virtualenv ./sdk-test
. ./sdk-test/bin/activate
tox

This will locally run the test suite, and saves significant dev time over waiting for the CI tool to pick it up.

======= History

0.0.0 (2018-09-28)

Initial alpha release

2.1.0

Revamped official release

2.1.1

File upload bug fixes

2.2.1

Better Documentation

2.2.2

Fix pypi readme formatting

2.3.0

Add get_list_of_jobs

2.4.0

Add support for custom vocabularies

2.5.0

Add examples
Improve error handling
Add streaming client

2.6.0

Support skip_punctuation
Support .vtt captions output
Support speaker channel jobs

2.6.1

Add metadata to streaming client

2.7.0

Add custom vocabularies to streaming client

2.7.1

Use v1 of the streaming api
Add custom vocabulary to async example
Add filter_profanity to async and streaming clients, examples, and documentation
Add remove_disfluencies to async client

2.11.0

Add language selection option for multi-lingual ASR jobs to async client

2.12.0

Add custom_vocabulary_id to async client

2.13.0

Add detailed_partials to streaming client
Switch to Github Actions for automated testing

2.14.0

Add transcriber to async client
Add verbatim, rush, segments_to_transcribe, test_mode to async client for human transcription
Add start_ts and transcriber to streaming client

2.15.0

Add topic extraction client
Add speaker_names to async client for human transcription

2.16.0

Add sentiment analysis client
Add source_config and notification_config job options to support customer provided urls with authentication headers
Deprecate media_url option, replace with source_config
Deprecate callback_url option, replace with notification_config

2.17.0

Add language to the streaming client

2.18.0

Add atmospherics and speaker_count support
Deprecated support for Python versions up to 3.8

2.19.0

Add async translation and summarization

Project details

These details have not been verified by PyPI

Project links

Homepage

Intended Audience
- Developers
License
- OSI Approved :: MIT License
Natural Language
- English
Programming Language

Release history Release notifications | RSS feed

This version

2.21.0

Nov 27, 2024

2.20.0

Oct 29, 2024

2.19.5

Jun 17, 2024

2.19.4

Jan 4, 2024

2.19.3

Jan 4, 2024

2.19.2

Jan 3, 2024

2.18.0

Aug 22, 2023

2.17.1

Aug 2, 2022

2.17.0

May 19, 2022

2.16.0

May 13, 2022

2.15.0

Apr 22, 2022

2.14.0

Jan 31, 2022

2.13.0

Oct 12, 2021

2.12.0

Mar 9, 2021

2.11.0

Jan 19, 2021

2.10.1

Jan 19, 2021

2.10.0

Sep 15, 2020

2.9.0

Jul 22, 2020

2.8.0

Mar 17, 2020

2.7.0

Nov 18, 2019

2.6.1

Oct 7, 2019

2.6.0

Aug 27, 2019

2.5.0

Jun 25, 2019

2.4.0

Mar 14, 2019

2.3.0

Mar 5, 2019

2.2.2

Feb 25, 2019

2.2.1

Feb 13, 2019

2.1.1

Feb 5, 2019

2.1.0

Feb 1, 2019

2.0.0

Jan 31, 2019

0.0.3

Oct 20, 2018

0.0.2

Oct 9, 2018

0.0.1

Oct 9, 2018

0.0.0

Oct 9, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rev_ai-2.21.0.tar.gz (45.8 kB view details)

Uploaded Nov 27, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

rev_ai-2.21.0-py2.py3-none-any.whl (44.7 kB view details)

Uploaded Nov 27, 2024 Python 2Python 3

File details

Details for the file rev_ai-2.21.0.tar.gz.

File metadata

Download URL: rev_ai-2.21.0.tar.gz
Upload date: Nov 27, 2024
Size: 45.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.13.0

File hashes

Hashes for rev_ai-2.21.0.tar.gz
Algorithm	Hash digest
SHA256	`fd30cc4d7047d959e66de627f288363daee71372ea3d4177e317f029a223228d`
MD5	`8b543d5caf3be0fc34c407428a0df76d`
BLAKE2b-256	`0cc4bd180849f736cd105e350af90706daf1af90b1cea2e85b49adfbde38c85a`

See more details on using hashes here.

File details

Details for the file rev_ai-2.21.0-py2.py3-none-any.whl.

File metadata

Download URL: rev_ai-2.21.0-py2.py3-none-any.whl
Upload date: Nov 27, 2024
Size: 44.7 kB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.13.0

File hashes

Hashes for rev_ai-2.21.0-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`2d191698ade897b50655aa4b03393e51af63948b92f096da4208f4556cc894d3`
MD5	`7e8bc51eb3a1b79f264ee82edbc8eec9`
BLAKE2b-256	`53c224783b6ad3a1cac186a26d0e98916f1855c841799e5ac231bd5400e2d27b`

See more details on using hashes here.

rev-ai 2.21.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Rev AI Python SDK

Documentation

Installation

Requirements

Usage

Sending a file

Human Transcription

Checking your file's status

Checking multiple files

Deleting a job

Getting your transcript

Getting transcript summary

Getting captions output

Streamed outputs

Streaming audio

Submitting custom vocabularies

For Rev AI Python SDK Developers

Local testing instructions

======= History

0.0.0 (2018-09-28)

2.1.0

2.1.1

2.2.1

2.2.2

2.3.0

2.4.0

2.5.0

2.6.0

2.6.1

2.7.0

2.7.1

2.11.0

2.12.0

2.13.0

2.14.0

2.15.0

2.16.0

2.17.0

2.18.0

2.19.0

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes