Skip to main content

ngChat Speech to Text and Text to Speech SDK

Project description

ngChat Speech to Text and Text to Speech SDK

Welcome to ngChat, Next Generation Chat, a revolutionary enterprise conversational platform that aims to:

  1. addressing issues from the first version of NLU/ChatFlow, and
  2. adding significant new features that support enterprise dialogue applications for the next 5 years.

To know more, please visit Seasalt.ai

Example to use ngChat Speech to Text SDK:

Prerequisites

You'll need a ngchat speech-to-text server url to run this example. Please contact Seasalt.ai to have one.

Install and import the Speech SDK

First you'll need to install the Speech SDK.

pip install ngchat-speech-sdk

After the Speech SDK is installed, import it inot your Python project with this.

import ngchat_speech.speech as speechsdk

Create a speech configuration

To call the Speech service using the Speech SDK, you need to create a SpeechConfig. You'll need a ngchat speech-to-text server url to run this example. Please contact Seasalt.ai to have one.

    speech_config = speechsdk.SpeechConfig(
        host="ws://NGCHAT_STT_SERVER/client/ws/speech"
    )

Recognize from a file

In this example, we'll show how to recognize speech from an audio file, if you want to recognize a stream, you'll need to use SpeechRecognizer.start_continuous_recognition_async() instead of SpeechRecognizer.recognize_once(). Create an AudioConfig and use the filename parameter.

    audio_stream = speechsdk.audio.PushAudioInputStream()
    audio_config = speechsdk.audio.AudioConfig(
        filename="test.wav"
    )

Initialize a recognizer

After you've created a SpeechConfig and an AudioConfig, the next step is to initialize a SpeechRecognizer.

    speech_recognizer = speechsdk.SpeechRecognizer(
        speech_config=speech_config,
        audio_config=audio_config
    )

Connect callbacks to recognizer

SpeechRecognizer has 5 kinds of callbacks.

  • Recognizing - called when some words were recognized, but not finished recognizing a single utterance.
  • Recognized - called when a single utterance was recognized.
  • Canceled - called when a continuous recognition was interrupted.
  • Session_started - called when a recognition sesstion was started.
  • Session_stopped - called when a recognition sesstion was stopped.
    speech_recognizer.recognizing.connect(
        lambda evt: print(f"Recognizing: {evt.result.text}"))
    speech_recognizer.recognized.connect(
        lambda evt: print(f'Recognized: {evt.result.text}'))
    speech_recognizer.canceled.connect(
        lambda evt: print(f'Canceled: {evt}'))
    speech_recognizer.session_started.connect(
        lambda evt: print(f'Session_started: {evt}'))
    speech_recognizer.session_stopped.connect(
        lambda evt: print(f'Session_stopped: {evt}'))

Recognize speech

Now you're ready to run SpeechRecognizer. SpeechRecognizer has two ways for speech recognition.

  • Single-shot recognition - Performs recognition once. This is to recognize a single audio file. Stop recognizing after a single utterance is recognized.
  • Continuous recognition (async) - Asynchronously initiates continuous recognition operation. Connect to Recognizing and Recognized callbacks to receive recognition results. To stop asynchronous continuous recognition, call stop_continuous_recognition_async().

speech_recognizer.recognize_once()

Put all together

We put all these steps together, the example code to test ngChat Speech SDK will look like this.

import speech as speechsdk
import audio as audio
import asyncio
import threading
import sys
import time

if __name__=="__main__":
    # this is an example to show how to use the ngChat Speech SDK to recognize once

    try:
        speech_config = speechsdk.SpeechConfig(
            host="ws://NGCHAT_STT_SERVER/client/ws/speech"
        )
        audio_stream = audio.PushAudioInputStream()
        audio_config = audio.AudioConfig(filename="test.wav")
        speech_recognizer = speechsdk.SpeechRecognizer(
            speech_config=speech_config,
            audio_config=audio_config
        )

        speech_recognizer.recognizing.connect(
            lambda evt: print(f"Recognizing: {evt.result.text}"))
        speech_recognizer.recognized.connect(
            lambda evt: print(f'Recognized: {evt.result.text}'))
        speech_recognizer.canceled.connect(
            lambda evt: print(f'Canceled: {evt}'))
        speech_recognizer.session_started.connect(
            lambda evt: print(f'Session_started: {evt}'))
        speech_recognizer.session_stopped.connect(
            lambda evt: print(f'Session_stopped: {evt}'))

        speech_recognizer.recognize_once()
        time.sleep(3)

    except KeyboardInterrupt:
        print("Caught keyboard interrupt. Canceling tasks...")
    except Exception as e:
        print(f"Exception: {e}")
    finally:
        sys.exit()

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ngchat-speech-sdk-0.1.1.tar.gz (13.8 kB view details)

Uploaded Source

Built Distribution

ngchat_speech_sdk-0.1.1-py3-none-any.whl (14.5 kB view details)

Uploaded Python 3

File details

Details for the file ngchat-speech-sdk-0.1.1.tar.gz.

File metadata

  • Download URL: ngchat-speech-sdk-0.1.1.tar.gz
  • Upload date:
  • Size: 13.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.9

File hashes

Hashes for ngchat-speech-sdk-0.1.1.tar.gz
Algorithm Hash digest
SHA256 24d9f472ffa21abd9da685a0d1ff15370ea5e4c47102240387e5e8ad53fe3dfc
MD5 bf3ac9883a86269211fe2cb4c122233b
BLAKE2b-256 0b80cfa9f887156e45ed10ad08f1dc65959911119ce09d5f0834a347529c2345

See more details on using hashes here.

File details

Details for the file ngchat_speech_sdk-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: ngchat_speech_sdk-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 14.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.9

File hashes

Hashes for ngchat_speech_sdk-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 fda1141461a9d1931fc3f47979c509fc9308a2fcaba9f0848881c8ef9fc1e2c7
MD5 a437aa80425b315920e08ac79ddecbf2
BLAKE2b-256 0672d7f2e80857c4fbd00bce6f8e70c3c6d2c8f9b93275ddf6be2fb51e5ef8cc

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page