The official Python SDK for the Deepgram automated speech recognition platform.
Project description
Deepgram Python SDK
Official Python SDK for Deepgram. Power your apps with world-class speech and Language AI models.
This SDK only supports hosted usage of api.deepgram.com.
- Deepgram Python SDK
- Documentation
- Getting an API Key
- Requirements
- Installation
- Quickstarts
- Examples
- Development and Contributing
- Getting Help
Documentation
You can learn more about the Deepgram API at developers.deepgram.com.
Getting an API Key
🔑 To access the Deepgram API you will need a free Deepgram API Key.
Requirements
Python (version ^3.10)
Installation
To install the latest version available (which will guarantee change over time):
pip install deepgram-sdk
If you are going to write an application to consume this SDK, it's highly recommended and a programming staple to pin to at least a major version of an SDK (ie ==2.*
) or with due diligence, to a minor and/or specific version (ie ==2.1.*
or ==2.12.0
, respectively). If you are unfamiliar with semantic versioning or semver, it's a must-read.
In a requirements.txt
file, pinning to a major (or minor) version, like if you want to stick to using the SDK v2.12.0
release, that can be done like this:
deepgram-sdk==2.*
Or using pip:
pip install deepgram-sdk==2.*
Pinning to a specific version can be done like this in a requirements.txt
file:
deepgram-sdk==2.12.0
Or using pip:
pip install deepgram-sdk==2.12.0
We guarantee that major interfaces will not break in a given major semver (ie 2.*
release). However, all bets are off moving from a 2.*
to 3.*
major release. This follows standard semver best-practices.
Quickstarts
This SDK aims to reduce complexity and abtract/hide some internal Deepgram details that clients shouldn't need to know about. However you can still tweak options and settings if you need.
PreRecorded Audio Transcription Quickstart
You can find a walkthrough on our documentation site. Transcribing Pre-Recorded Audio can be done using the following sample code:
AUDIO_URL = {
"url": "https://static.deepgram.com/examples/Bueller-Life-moves-pretty-fast.wav"
}
# STEP 1 Create a Deepgram client using the API key from environment variables
deepgram = DeepgramClient()
# STEP 2 Call the transcribe_url method on the prerecorded class
options = PrerecordedOptions(
model="nova",
smart_format=True,
summarize="v2",
)
url_response = deepgram.listen.prerecorded.v("1").transcribe_url(AUDIO_URL, options)
print(url_response)
Live Audio Transcription Quickstart
You can find a walkthrough on our documentation site. Transcribing Live Audio can be done using the following sample code:
deepgram = DeepgramClient()
# Create a websocket connection to Deepgram
options = LiveOptions(
punctuate=True,
language="en-US",
encoding="linear16",
channels=1,
sample_rate=16000,
)
def on_message(result=None):
if result is None:
return
sentence = result.channel.alternatives[0].transcript
if len(sentence) == 0:
return
print(f"speaker: {sentence}")
def on_metadata(metadata=None):
if metadata is None:
return
print(f"\n{metadata}\n")
def on_error(error=None):
if error is None:
return
print(f"\n{error}\n")
dg_connection = deepgram.listen.live.v("1")
dg_connection.start(options)
dg_connection.on(LiveTranscriptionEvents.Transcript, on_message)
dg_connection.on(LiveTranscriptionEvents.Metadata, on_metadata)
dg_connection.on(LiveTranscriptionEvents.Error, on_error)
# create microphone
microphone = Microphone(dg_connection.send)
# start microphone
microphone.start()
# wait until finished
input("Press Enter to stop recording...\n\n")
# Wait for the microphone to close
microphone.finish()
# Indicate that we've finished
dg_connection.finish()
print("Finished")
Examples
There are examples for every API call in this SDK. You can find all of these examples in the examples folder at the root of this repo.
These examples provide:
-
PreRecorded Audio Transcription:
- From an Audio File - examples/prerecorded/file
- From an URL - examples/prerecorded/url
-
Live Audio Transcription:
- From a Microphone - examples/streaming/microphone
- From an HTTP Endpoint - examples/streaming/http
-
Management API exercise the full CRUD operations for:
- Balances - examples/manage/balances
- Invitations - examples/manage/invitations
- Keys - examples/manage/keys
- Members - examples/manage/members
- Projects - examples/manage/projects
- Scopes - examples/manage/scopes
- Usage - examples/manage/usage
To run each example set the DEEPGRAM_API_KEY
as an environment variable, then cd
into each example folder and execute the example: go run main.py
.
Development and Contributing
Interested in contributing? We ❤️ pull requests!
To make sure our community is safe for all, be sure to review and agree to our Code of Conduct. Then see the Contribution guidelines for more information.
Prerequisites
In order to develop new features for the SDK itself, you first need to uninstall any previous installation of the deepgram-sdk
and then install/pip the dependencies contained in the requirements.txt
then instruct python (via pip) to use the SDK by installing it locally.
From the root of the repo, that would entail:
pip uninstall deepgram-sdk
pip install -r requirements.txt
pip install -e .
Testing
If you are looking to contribute or modify pytest code, then you need to install the following dependencies:
pip install -r requirements-dev.txt
Getting Help
We love to hear from you so if you have questions, comments or find a bug in the project, let us know! You can either:
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for deepgram_sdk-3.1.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9a71b726eed88d0fa4e24da25e5a70d75014ece6fb531ddfbb6a773f30a858b7 |
|
MD5 | 7bcd8490e64617d2186d59c66f7dc077 |
|
BLAKE2b-256 | ce809fb262c4e477e49da5c086ecec1b629173d501baffbebac0c011fd11fad1 |