Skip to main content

This library provides audio interface with OpenAI endpoint.

Project description

Wyn Voice: A Conversational AI and Audio Processing Library

image

Introduction and Motivation

🎙️ WYN-Voice is a Python library designed to simplify the process of creating conversational AI applications that leverage OpenAI's GPT models. 🤖 The library provides an easy-to-use interface for generating responses to user inputs and includes functionality for recording and processing audio, 🎧 making it suitable for building interactive voice-based applications. 🗣️

Directory Structure

The project directory is organized as follows:

.
├── pyproject.toml
├── README.md
└── wyn_voice
    └── chat.py
  • pyproject.toml: Contains the project's dependencies and other configuration settings.
  • README.md: This file, providing an overview and usage instructions.
  • wyn_voice: A folder containing the main library code.
    • chat.py: The script defining the ChatBot and AudioProcessor classes.

Example Usage

To get started with Wyn Voice, follow these steps:

Installation

First, install the necessary packages using pip:

pip install wyn-voice pyautogen pydub openai

Using the ChatBot Class

The ChatBot class allows you to interact with OpenAI's GPT models to generate responses based on user input.

from wyn_voice.chat import ChatBot

# Initialize the ChatBot with your OpenAI API key
api_key = 'your-openai-api-key'
chatbot = ChatBot(api_key)

# Generate a response from the chatbot
prompt = "Hello, how are you?"
response = chatbot.generate_response(prompt)
print("ChatBot:", response)

# Retrieve the conversation history
history = chatbot.get_history()
print("Conversation History:", history)

Using the AudioProcessor Class

The AudioProcessor class provides functionality to record audio, process it, and interact with the ChatBot.

from wyn_voice.chat import ChatBot, AudioProcessor

# Initialize the ChatBot with your OpenAI API key
api_key = 'your-openai-api-key'
chatbot = ChatBot(api_key)

# Initialize the AudioProcessor with the ChatBot
audio_processor = AudioProcessor(chatbot)

# Record audio and generate a response
transcript = audio_processor.process_audio_and_generate_response()
print("Transcript:", transcript)

# Record audio and get the transcribed text
text = audio_processor.voice_to_text()
print("Transcribed Text:", text)

# Convert text to speech and save it as an mp3 file
response_text = "This is a test response."
output_file = audio_processor.text_to_voice(response_text)
print("Saved audio response to:", output_file)

# Play the saved audio file
audio_processor.play_audio(output_file)

Using the ChatEnvironment Class

The ChatEnvironment class allows you to create a conversation environment to interact with ChatBot using voice command.

from wyn_voice.chat import ChatBot, AudioProcessor, ChatEnvironment
from google.colab import userdata
OPENAI_API_KEY = userdata.get('OPENAI_API_KEY')

# Create instances of ChatBot and AudioProcessor
chatbot = ChatBot(
    api_key=OPENAI_API_KEY,
    protocol="You are a live translator."
    "When you hear Chinese, speak English."
    "When you hear English, speak Chinese.")
audio_processor = AudioProcessor(chatbot)

# Create an instance of ChatEnvironment
chat_env = ChatEnvironment(chatbot, audio_processor)

# Start the chat loop
chat_env.start_chat(exit_command="Exit the program")

Author

Yiqiao Yin

Site

https://www.y-yin.io/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wyn_voice-0.2.0.tar.gz (5.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

wyn_voice-0.2.0-py3-none-any.whl (5.4 kB view details)

Uploaded Python 3

File details

Details for the file wyn_voice-0.2.0.tar.gz.

File metadata

  • Download URL: wyn_voice-0.2.0.tar.gz
  • Upload date:
  • Size: 5.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.9.13 Windows/10

File hashes

Hashes for wyn_voice-0.2.0.tar.gz
Algorithm Hash digest
SHA256 b6ba67ea4498e2016d9ff35560ec04d8a5370c4274165765887f424dca6d597f
MD5 5160cc3c38e19c267f4c7de7fc430376
BLAKE2b-256 aaf633799d06b450ff44497a984a7e72876d548fb0d9dbc19f1208624fe9afc5

See more details on using hashes here.

File details

Details for the file wyn_voice-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: wyn_voice-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 5.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.9.13 Windows/10

File hashes

Hashes for wyn_voice-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 188b4140257c10cebeb11c20c306e7235bf3090e958afda00ed613e05ef1ff1c
MD5 237631e8e6377a937dc93add23888d1b
BLAKE2b-256 c20862144383b7f53194d40b2846402c8c15b7d113e9d05f009859ad59a23d18

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page