Skip to main content

llama-index tools azure_speech integration

Project description

Azure Speech Tool

This tool allows Agents to use Microsoft Azure speech services to transcribe audio files to text, and create audio files from text. To see more and get started, visit https://azure.microsoft.com/en-us/products/ai-services/ai-speech

Usage

This tool has a more extensive example usage documented in a Jupyter notebook here

from llama_index.tools.azure_speech import AzureSpeechToolSpec
from llama_index.core.agent.workflow import FunctionAgent
from llama_index.llms.openai import OpenAI

speech_tool = AzureSpeechToolSpec(speech_key="your-key", region="eastus")

agent = FunctionAgent(
    tools=speech_tool.to_tool_list(),
    llm=OpenAI(model="gpt-4.1"),
)
print(await agent.run('Say "hello world"'))
print(
    await agent.run(
        "summarize the data/speech.wav audio file into a few sentences"
    )
)

text_to_speech: Takes an input string and synthesizes audio to play on the users computer speech_to_text: Takes a .wav file and transcribes it into text

This loader is designed to be used as a way to load data as a Tool in a Agent.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_tools_azure_speech-0.5.0.tar.gz (4.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file llama_index_tools_azure_speech-0.5.0.tar.gz.

File metadata

  • Download URL: llama_index_tools_azure_speech-0.5.0.tar.gz
  • Upload date:
  • Size: 4.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for llama_index_tools_azure_speech-0.5.0.tar.gz
Algorithm Hash digest
SHA256 dbf8c724f8220264ecb8a4a08e6d7c479d423666e9cc85a20873dc275385e288
MD5 bb16107b3923552ef2624aabc37c3fb4
BLAKE2b-256 00f11a0a65cf5cfaefbdcdd46a522c6909c18f1e431e532adfd34688fe16dbce

See more details on using hashes here.

File details

Details for the file llama_index_tools_azure_speech-0.5.0-py3-none-any.whl.

File metadata

  • Download URL: llama_index_tools_azure_speech-0.5.0-py3-none-any.whl
  • Upload date:
  • Size: 4.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for llama_index_tools_azure_speech-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 fe34671cfc091bd4e0fb13a6296c289446178dbb27e9bd3f933f828baa890795
MD5 74ac6ad08a99de2c65715febaeb8e2d2
BLAKE2b-256 a2036bbbba94bdeead6f8378b10aacdfaeb17321e7e0b17143d14f2fe21e1391

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page