Skip to main content

This library provides audio interface with OpenAI endpoint.

Project description

Wyn Voice: A Conversational AI and Audio Processing Library

Thumbnail Image

Introduction and Motivation

🎙️ WYN-Voice is a Python library designed to simplify the process of creating conversational AI applications that leverage OpenAI's GPT models. 🤖 The library provides an easy-to-use interface for generating responses to user inputs and includes functionality for recording and processing audio, 🎧 making it suitable for building interactive voice-based applications. 🗣️

Directory Structure

The project directory is organized as follows:

.
├── pyproject.toml
├── README.md
└── wyn_voice
    └── chat.py
  • pyproject.toml: Contains the project's dependencies and other configuration settings.
  • README.md: This file, providing an overview and usage instructions.
  • wyn_voice: A folder containing the main library code.
    • chat.py: The script defining the ChatBot and AudioProcessor classes.

Example Usage

To get started with Wyn Voice, follow these steps:

Installation

First, install the necessary packages using pip:

pip install wyn-voice pyautogen pydub openai

Using the ChatBot Class

The ChatBot class allows you to interact with OpenAI's GPT models to generate responses based on user input.

from wyn_voice.chat import ChatBot

# Initialize the ChatBot with your OpenAI API key
api_key = 'your-openai-api-key'
chatbot = ChatBot(api_key)

# Generate a response from the chatbot
prompt = "Hello, how are you?"
response = chatbot.generate_response(prompt)
print("ChatBot:", response)

# Retrieve the conversation history
history = chatbot.get_history()
print("Conversation History:", history)

Using the AudioProcessor Class

The AudioProcessor class provides functionality to record audio, process it, and interact with the ChatBot.

from wyn_voice.chat import ChatBot, AudioProcessor

# Initialize the ChatBot with your OpenAI API key
api_key = 'your-openai-api-key'
chatbot = ChatBot(api_key)

# Initialize the AudioProcessor with the ChatBot
audio_processor = AudioProcessor(chatbot)

# Record audio and generate a response
transcript = audio_processor.process_audio_and_generate_response()
print("Transcript:", transcript)

# Record audio and get the transcribed text
text = audio_processor.voice_to_text()
print("Transcribed Text:", text)

# Convert text to speech and save it as an mp3 file
response_text = "This is a test response."
output_file = audio_processor.text_to_voice(response_text)
print("Saved audio response to:", output_file)

# Play the saved audio file
audio_processor.play_audio(output_file)

Using the ChatEnvironment Class

The ChatEnvironment class allows you to create a conversation environment to interact with ChatBot using voice command.

from wyn_voice.chat import ChatBot, AudioProcessor, ChatEnvironment
from google.colab import userdata
OPENAI_API_KEY = userdata.get('OPENAI_API_KEY')

# Create instances of ChatBot and AudioProcessor
chatbot = ChatBot(
    api_key=OPENAI_API_KEY,
    protocol="You are a live translator."
    "When you hear Chinese, speak English."
    "When you hear English, speak Chinese.")
audio_processor = AudioProcessor(chatbot)

# Create an instance of ChatEnvironment
chat_env = ChatEnvironment(chatbot, audio_processor)

# Start the chat loop
chat_env.start_chat(exit_command="Exit the program")

Author

Yiqiao Yin

Site

https://www.y-yin.io/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wyn_voice-0.2.1.tar.gz (5.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

wyn_voice-0.2.1-py3-none-any.whl (5.4 kB view details)

Uploaded Python 3

File details

Details for the file wyn_voice-0.2.1.tar.gz.

File metadata

  • Download URL: wyn_voice-0.2.1.tar.gz
  • Upload date:
  • Size: 5.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.9.13 Windows/10

File hashes

Hashes for wyn_voice-0.2.1.tar.gz
Algorithm Hash digest
SHA256 2ce0639a62a8c9342b10cd5610712933bb5569e08de969c81df55b62fde625ec
MD5 a0fe1d36b1bf992b2e54fdb2100cdd19
BLAKE2b-256 389593e944680aea3f53c0702e2e5c439291aaa3e3d485613fba19d2f1df354c

See more details on using hashes here.

File details

Details for the file wyn_voice-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: wyn_voice-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 5.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.9.13 Windows/10

File hashes

Hashes for wyn_voice-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 90980b747e8f93b99f0fb42c3add51fad3d615de96b4b790b084b08b54e8a436
MD5 7972512286bd87f03243bde862c81172
BLAKE2b-256 f8a144664a56d9da89478734d5763b9d169f885a62c454a19713ee2c3815c1bd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page