The SaladCloud Transcription SDK. Please refer to the [SaladCloud Transcription API Documentation](https://docs.salad.com/transcription/) for more details.
Project description
SaladCloud Transcription SDK for Python
Welcome to the SaladCloud Transcription SDK documentation. This guide will help you get started with integrating and using the SaladCloud Transcription SDK in your project.
Versions
- API version:
1.0.0-alpha.1 - SDK version:
1.0.0-alpha.1
About the API
The SaladCloud REST API. Please refer to the SaladCloud API Documentation for more details.
Table of Contents
- Installation
- Quick Start
- Authentication
- Transcription Engines
- Sample Usage
- Development and Testing
- License
- Support
Installation
Install the package using pip:
pip install salad-cloud-transcription-sdk
Quick Start
from salad_cloud_transcription_sdk import SaladCloudTranscriptionSdk
from salad_cloud_transcription_sdk.models.transcription_engine import TranscriptionEngine
from salad_cloud_transcription_sdk.models.transcription_request import TranscriptionRequest
from salad_cloud_transcription_sdk.models.transcription_job_input import TranscriptionJobInput
# Initialize the SDK
sdk = SaladCloudTranscriptionSdk(api_key="your_api_key")
# Setup the request
request_object = TranscriptionRequest(
options=TranscriptionJobInput(
language_code="en",
return_as_file=False,
sentence_level_timestamps=True,
word_level_timestamps=True,
diarization=True,
srt=True
),
metadata={"project": "example_project"}
)
# Transcribe a video file using the Full Transcription engine
result = sdk.transcribe(
"path/to/video.mp4",
organization_name="your_organization_name",
request=request_object,
engine=TranscriptionEngine.Full,
auto_poll=True
)
# Print the transcription
print(result.output.text)
Authentication
API Key Authentication
The Salad Cloud Transcription API uses API keys as a form of authentication. An API key is a unique identifier used to authenticate a user, developer, or a program that is calling the API.
Setting the API Key
When you initialize the SDK, you can set the API key as follows:
sdk = SaladCloudTranscriptionSdk(api_key="YOUR_API_KEY")
If you need to set or update the API key after initializing the SDK, you can use:
sdk.set_api_key("YOUR_API_KEY")
Transcription Engines
The SDK supports two transcription modes: Full and Lite. The desired mode can be specified via the engine parameter of the transcribe method. When omitted it defaults to Full.
When using the Lite engine, the request object has to specify explicit defaults for a few of the properties:
request = TranscriptionRequest(
options=TranscriptionJobInput(
language_code="en",
return_as_file=True,
translate="to_eng",
sentence_level_timestamps=True,
word_level_timestamps=True,
diarization=True,
sentence_diarization=True,
srt=True,
# Adding required parameters with null/empty values
summarize=0,
custom_vocabulary="",
llm_translation=[],
srt_translation=[],
),
metadata={"test_id": "integration_test", "environment": "testing"},
)
Sample Usage
The source parameter
All transcription methods of the SaladCloudTranscriptionSdk take a source parameter - the input file for the transcription job.
The source can be a path pointing to a local file, like /usr/share/my_video_project.mp4. Or, it can be an URL, like https://myserver.net/files/my_video_project.mp4.
When a local file is specified, the SDK will take care to upload that to the Saldad Simple Storage Service (S4) behind the scenes. When the upload is completed, the transcription job is run using as input an S4 URL.
When a remote file is specified, that URL is passed as-is to the transcription engine. Make sure the file is publicly accessible.
Start a Transcription Job and wait for it to complete
from salad_cloud_transcription_sdk import SaladCloudTranscriptionSdk
from salad_cloud_transcription_sdk.models.transcription_engine import TranscriptionEngine
from salad_cloud_transcription_sdk.models.transcription_request import TranscriptionRequest
from salad_cloud_transcription_sdk.models.transcription_job_input import TranscriptionJobInput
# Initialize the SDK
sdk = SaladCloudTranscriptionSdk(api_key="your_api_key")
# Setup the request
request_object = TranscriptionRequest(
options=TranscriptionJobInput(
language_code="en",
return_as_file=False,
sentence_level_timestamps=True,
word_level_timestamps=True,
diarization=True,
srt=True
),
metadata={"project": "example_project"}
)
# Start a transcription job and wait for the result
result = sdk.transcribe(
source="path/to/audio.mp3",
organization_name="your_organization_name",
request=request_object,
auto_poll=True
)
# Print the transcription job output
print(result.output)
Start a Transcription Job and poll for status
from salad_cloud_transcription import SaladCloudTranscriptionSdk
from salad_cloud_sdk.models import Status
# Initialize the SDK
sdk = SaladCloudTranscriptionSdk(api_key="your_api_key")
# Setup the request
request_object = TranscriptionRequest(
options=TranscriptionJobInput(
language_code="en",
return_as_file=False,
sentence_level_timestamps=True,
word_level_timestamps=True,
diarization=True,
srt=True
),
metadata={"project": "example_project"}
)
# Start a transcription job. auto_poll = False
job = sdk.transcribe(
source = "path/to/audio.mp3",
request = request_object,
auto_poll = False)
# Poll for the job status
while True:
job = sdk.get_transcription_job(organization_name, job.id_)
if job.status in [
Status.SUCCEEDED.value,
Status.FAILED.value,
Status.CANCELLED.value,
]:
break
time.sleep(5)
if job.status == Status.SUCCEEDED.value:
print(job.output)
Start a Transcription Job and Get Updates via a Webhook
from salad_cloud_transcription_sdk import SaladCloudTranscriptionSdk
from salad_cloud_transcription_sdk.models.transcription_request import TranscriptionRequest
from salad_cloud_transcription_sdk.models.transcription_job_input import TranscriptionJobInput
# Initialize the SDK
sdk = SaladCloudTranscriptionSdk(api_key="your_api_key")
# Setup the request
request_object = TranscriptionRequest(
options=TranscriptionJobInput(
language_code="en",
return_as_file=False,
sentence_level_timestamps=True,
word_level_timestamps=True,
diarization=True,
srt=True
),
metadata={"project": "example_project"},
webhook_url="https://your-webhook-endpoint.com"
)
# Start a transcription job with a webhook URL
job = sdk.transcribe(
source="path/to/audio.mp3",
request=request_object
)
print(f"Job started with ID: {job.id}")
In your webhook handler you need to validate the payload being sent to you:
from salad_cloud_transcription_sdk import SaladCloudTranscriptionSdk
def webhook_handler(request):
# Initialize the SDK
sdk = SaladCloudTranscriptionSdk(api_key="your_api_key")
# Extract the signing parameters from the request headers
payload = request.json()
webhook_signature = request.headers.get("webhook-signature")
webhook_timestamp = request.headers.get("webhook-timestamp")
webhook_message_id = request.headers.get("webhook-id")
# Retrieve the webhook signing secret for your Salad organization
sdk = SaladCloudTranscriptionSdk(api_key="your_api_key")
secret_key_service = sdk.webhook_secret_key
secret_key_response = secret_key_service.get_webhook_secret_key(
"your_organization_name")
signing_secret = f"whsec_{secret_key_response.secret_key}"
# Process the webhook payload
var job = sdk.transcription_client.process_webhook_request(
payload=payload,
signing_secret = signing_secret,
webhook_id=webhook_message_id,
webhook_timestamp=webhook_timestamp,
webhook_signature=webhook_signature,
)
# The payload verification result is a TranscriptionWebhookPayload.
# Its data field is a InferenceEndpointJob and it contains the transcription job output.
print(job.data)
License
This SDK is licensed under the MIT License.
See the LICENSE file for more details.
Support
For support, please email support@salad.com or visit our documentation at https://docs.salad.com.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file salad_cloud_transcription_sdk-1.0.0a1.tar.gz.
File metadata
- Download URL: salad_cloud_transcription_sdk-1.0.0a1.tar.gz
- Upload date:
- Size: 40.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.23
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
830db7f187af14a5189a140a8be084e7bd35d395d91272d67d6c3867580d48b3
|
|
| MD5 |
9a8dda89f6371123686e9dc401a4138a
|
|
| BLAKE2b-256 |
a64bbb2c64287d5552ba0b48384f9b96faba51243f379cd4dc982668263cc2ca
|
File details
Details for the file salad_cloud_transcription_sdk-1.0.0a1-py3-none-any.whl.
File metadata
- Download URL: salad_cloud_transcription_sdk-1.0.0a1-py3-none-any.whl
- Upload date:
- Size: 52.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.23
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
75349a3a7db37df32a62ea91632823af856d6dbf08dbc6f4fb7f7dafb434e64d
|
|
| MD5 |
36f4bed62769e71f1d563c8dc226eca0
|
|
| BLAKE2b-256 |
ca82514da71171557fe48e220959caa6c64a9be9a13152f8efb9ab7b4b10f086
|