Skip to main content

llama-index tools azure_speech integration

Project description

Azure Speech Tool

This tool allows Agents to use Microsoft Azure speech services to transcribe audio files to text, and create audio files from text. To see more and get started, visit https://azure.microsoft.com/en-us/products/ai-services/ai-speech

Usage

This tool has a more extensive example usage documented in a Jupyter notebook here

from llama_index.tools.azure_speech import AzureSpeechToolSpec
from llama_index.agent import OpenAIAgent

speech_tool = AzureSpeechToolSpec(speech_key="your-key", region="eastus")

agent = OpenAIAgent.from_tools(
    speech_tool.to_tool_list(),
    verbose=True,
)
print(agent.chat('Say "hello world"'))
print(
    agent.chat("summarize the data/speech.wav audio file into a few sentences")
)

text_to_speech: Takes an input string and synthesizes audio to play on the users computer speech_to_text: Takes a .wav file and transcribes it into text

This loader is designed to be used as a way to load data as a Tool in a Agent. See here for examples.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_tools_azure_speech-0.1.3.tar.gz (2.9 kB view hashes)

Uploaded Source

Built Distribution

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page