The official Python SDK for the Deepgram automated speech recognition platform.

These details have not been verified by PyPI

Project links

Homepage

Project description

Deepgram Python SDK

Official Python SDK for Deepgram. Power your apps with world-class speech and Language AI models.

Deepgram Python SDK
Documentation
Getting an API Key
Requirements
Installation
Quickstarts
- PreRecorded Audio Transcription Quickstart
- Live Audio Transcription Quickstart
Examples
Logging
Backwards Compatibility
Development and Contributing
Getting Help

Documentation

You can learn more about the Deepgram API at developers.deepgram.com.

Getting an API Key

🔑 To access the Deepgram API you will need a free Deepgram API Key.

Requirements

Python (version ^3.10)

Installation

To install the latest version available (which will guarantee change over time):

pip install deepgram-sdk

If you are going to write an application to consume this SDK, it's highly recommended and a programming staple to pin to at least a major version of an SDK (ie ==2.*) or with due diligence, to a minor and/or specific version (ie ==2.1.* or ==2.12.0, respectively). If you are unfamiliar with semantic versioning or semver, it's a must-read.

In a requirements.txt file, pinning to a major (or minor) version, like if you want to stick to using the SDK v2.12.0 release, that can be done like this:

deepgram-sdk==2.*

Or using pip:

pip install deepgram-sdk==2.*

Pinning to a specific version can be done like this in a requirements.txt file:

deepgram-sdk==2.12.0

Or using pip:

pip install deepgram-sdk==2.12.0

We guarantee that major interfaces will not break in a given major semver (ie 2.* release). However, all bets are off moving from a 2.* to 3.* major release. This follows standard semver best-practices.

Quickstarts

This SDK aims to reduce complexity and abtract/hide some internal Deepgram details that clients shouldn't need to know about. However you can still tweak options and settings if you need.

PreRecorded Audio Transcription Quickstart

You can find a walkthrough on our documentation site. Transcribing Pre-Recorded Audio can be done using the following sample code:

AUDIO_URL = {
    "url": "https://static.deepgram.com/examples/Bueller-Life-moves-pretty-fast.wav"
}

## STEP 1 Create a Deepgram client using the API key from environment variables
deepgram: DeepgramClient = DeepgramClient("", ClientOptionsFromEnv())

## STEP 2 Call the transcribe_url method on the prerecorded class
options: PrerecordedOptions = PrerecordedOptions(
    model="nova-2",
    smart_format=True,
)
response = deepgram.listen.rest.v("1").transcribe_url(AUDIO_URL, options)
print(f"response: {response}\n\n")

Live Audio Transcription Quickstart

You can find a walkthrough on our documentation site. Transcribing Live Audio can be done using the following sample code:

deepgram: DeepgramClient = DeepgramClient()

dg_connection = deepgram.listen.websocket.v("1")

def on_open(self, open, **kwargs):
    print(f"\n\n{open}\n\n")

def on_message(self, result, **kwargs):
    sentence = result.channel.alternatives[0].transcript
    if len(sentence) == 0:
        return
    print(f"speaker: {sentence}")

def on_metadata(self, metadata, **kwargs):
    print(f"\n\n{metadata}\n\n")

def on_speech_started(self, speech_started, **kwargs):
    print(f"\n\n{speech_started}\n\n")

def on_utterance_end(self, utterance_end, **kwargs):
    print(f"\n\n{utterance_end}\n\n")

def on_error(self, error, **kwargs):
    print(f"\n\n{error}\n\n")

def on_close(self, close, **kwargs):
    print(f"\n\n{close}\n\n")

dg_connection.on(LiveTranscriptionEvents.Open, on_open)
dg_connection.on(LiveTranscriptionEvents.Transcript, on_message)
dg_connection.on(LiveTranscriptionEvents.Metadata, on_metadata)
dg_connection.on(LiveTranscriptionEvents.SpeechStarted, on_speech_started)
dg_connection.on(LiveTranscriptionEvents.UtteranceEnd, on_utterance_end)
dg_connection.on(LiveTranscriptionEvents.Error, on_error)
dg_connection.on(LiveTranscriptionEvents.Close, on_close)

options: LiveOptions = LiveOptions(
    model="nova-2",
    punctuate=True,
    language="en-US",
    encoding="linear16",
    channels=1,
    sample_rate=16000,
    ## To get UtteranceEnd, the following must be set:
    interim_results=True,
    utterance_end_ms="1000",
    vad_events=True,
)
dg_connection.start(options)

## create microphone
microphone = Microphone(dg_connection.send)

## start microphone
microphone.start()

## wait until finished
input("Press Enter to stop recording...\n\n")

## Wait for the microphone to close
microphone.finish()

## Indicate that we've finished
dg_connection.finish()

print("Finished")

Examples

There are examples for every API call in this SDK. You can find all of these examples in the examples folder at the root of this repo.

Before running any of these examples, then you need to take a look at the README and install the following dependencies:

pip install -r examples/requirements-examples.txt

Text to Speech:

Asynchronous - examples/text-to-speech
Synchronous - examples/text-to-speech

Analyze Text:

Intent Recognition - examples/analyze/intent
Sentiment Analysis - examples/sentiment/intent
Summarization - examples/analyze/intent
Topic Detection - examples/analyze/intent

PreRecorded Audio:

Transcription From an Audio File - examples/prerecorded/file
Transcription From an URL - examples/prerecorded/url
Intent Recognition - examples/speech-to-text/rest/intent
Sentiment Analysis - examples/speech-to-text/rest/sentiment
Summarization - examples/speech-to-text/rest/summary
Topic Detection - examples/speech-to-text/rest/topic

Live Audio Transcription:

From a Microphone - examples/streaming/microphone
From an HTTP Endpoint - examples/streaming/http

Management API exercise the full CRUD operations for:

Balances - examples/manage/balances
Invitations - examples/manage/invitations
Keys - examples/manage/keys
Members - examples/manage/members
Projects - examples/manage/projects
Scopes - examples/manage/scopes
Usage - examples/manage/usage

To run each example set the DEEPGRAM_API_KEY as an environment variable, then cd into each example folder and execute the example: go run main.py.

Logging

This SDK provides logging as a means to troubleshoot and debug issues encountered. By default, this SDK will enable Information level messages and higher (ie Warning, Error, etc) when you initialize the library as follows:

deepgram: DeepgramClient = DeepgramClient()

To increase the logging output/verbosity for debug or troubleshooting purposes, you can set the DEBUG level but using this code:

config: DeepgramClientOptions = DeepgramClientOptions(
    verbose=logging.DEBUG,
)
deepgram: DeepgramClient = DeepgramClient("", config)

Backwards Compatibility

Older SDK versions will receive Priority 1 (P1) bug support only. Security issues, both in our code and dependencies, are promptly addressed. Significant bugs without clear workarounds are also given priority attention.

Development and Contributing

Interested in contributing? We ❤️ pull requests!

To make sure our community is safe for all, be sure to review and agree to our Code of Conduct. Then see the Contribution guidelines for more information.

Prerequisites

In order to develop new features for the SDK itself, you first need to uninstall any previous installation of the deepgram-sdk and then install/pip the dependencies contained in the requirements.txt then instruct python (via pip) to use the SDK by installing it locally.

From the root of the repo, that would entail:

pip uninstall deepgram-sdk
pip install -r requirements.txt
pip install -e .

Daily and Unit Tests

If you are looking to use, run, contribute or modify to the daily/unit tests, then you need to install the following dependencies:

pip install -r requirements-dev.txt

Daily Tests

The daily tests invoke a series of checks against the actual/real API endpoint and save the results in the tests/response_data folder. This response data is updated nightly to reflect the latest response from the server. Running the daily tests does require a DEEPGRAM_API_KEY set in your environment variables.

To run the Daily Tests:

make daily-test

Unit Tests

The unit tests invoke a series of checks against mock endpoints using the responses saved in tests/response_data from the daily tests. These tests are meant to simulate running against the endpoint without actually reaching out to the endpoint; running the unit tests does require a DEEPGRAM_API_KEY set in your environment variables, but you will not actually reach out to the server.

make unit-test

Getting Help

We love to hear from you so if you have questions, comments or find a bug in the project, let us know! You can either:

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

3.8.0.dev4 pre-release

Nov 7, 2024

This version

3.8.0.dev3 pre-release

Nov 4, 2024

3.8.0.dev2 pre-release

Oct 24, 2024

3.8.0.dev1 pre-release

Oct 18, 2024

3.7.0.dev2 pre-release

Sep 24, 2024

3.7.0.dev1 pre-release

Sep 24, 2024

3.6.0.dev3 pre-release

Sep 10, 2024

3.6.0.dev2 pre-release

Sep 5, 2024

3.6.0.dev1 pre-release

Aug 29, 2024

3.3.0.dev6 pre-release

Jun 12, 2024

3.3.0.dev5 pre-release

Jun 12, 2024

3.3.0.dev4 pre-release

Jun 12, 2024

3.3.0.dev3 pre-release

Jun 12, 2024

3.3.0.dev2 pre-release

Jun 12, 2024

3.3.0.dev1 pre-release

Jun 12, 2024

3.2.0.dev7 pre-release

Mar 8, 2024

3.2.0.dev6 pre-release

Mar 8, 2024

3.2.0.dev5 pre-release

Mar 8, 2024

3.2.0.dev4 pre-release

Mar 1, 2024

3.2.0.dev3 pre-release

Feb 29, 2024

3.2.0.dev2 pre-release

Feb 29, 2024

3.2.0.dev1 pre-release

Feb 28, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deepgram_unstable_sdk-3.8.0.dev3.tar.gz (90.6 kB view details)

Uploaded Nov 4, 2024 Source

Built Distribution

deepgram_unstable_sdk-3.8.0.dev3-py3-none-any.whl (147.7 kB view details)

Uploaded Nov 4, 2024 Python 3

File details

Details for the file deepgram_unstable_sdk-3.8.0.dev3.tar.gz.

File metadata

Download URL: deepgram_unstable_sdk-3.8.0.dev3.tar.gz
Upload date: Nov 4, 2024
Size: 90.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for deepgram_unstable_sdk-3.8.0.dev3.tar.gz
Algorithm	Hash digest
SHA256	`f1bc697b096119c15a0d7b4095d07cbe2664a5835b96ea83bb9dbbf43c1db368`
MD5	`9817241247052d2d7fb80cb05fe89d86`
BLAKE2b-256	`ebfa6aceaad3a94eb475a889ec3511e253b523b69cc7e6a785251edc297e1a3f`

See more details on using hashes here.

File details

Details for the file deepgram_unstable_sdk-3.8.0.dev3-py3-none-any.whl.

File metadata

Download URL: deepgram_unstable_sdk-3.8.0.dev3-py3-none-any.whl
Upload date: Nov 4, 2024
Size: 147.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for deepgram_unstable_sdk-3.8.0.dev3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9d966df357247e9e12d23f56c7829ad5e8bd4eb710878caa5a0d6f3e77292e28`
MD5	`7f25a31da68938ba86a6a8e1ffe4060b`
BLAKE2b-256	`688c44554c08bf81f118b1df96dacf9a685875b43db95c722b2dd96a7967d31d`