Skip to main content

A Python library for speech-to-text integration using Selenium WebDriver.

Project description

WebSpeechRecognition Library

Introduction

WebSpeechRecognition is a Python library that helps turn speech into text in real time. It uses the Selenium WebDriver and the HTML5 Web Speech API. This tool is built to work with Chrome WebDriver and is easy to use for speech recognition tasks.

Installation

To install WebSpeechRecognition, you can use pip:

pip install WebSpeechRecognition

Features

  • Works in Many Languages: You can set the language you want using simple codes like "en-US" for English.
  • Change Language Easily: You can pick a language at any time while using it.
  • Simple Functions: It has easy-to-understand methods to start and stop speech recognition.

Usage

Initialization

To start using WebSpeechRecognition, initialize the SpeechRecognition class with the path to your ChromeDriver executable and the language code:

from WebSpeechRecognition import SpeechRecognition

recognizer = SpeechRecognition("path/to/chromedriver", language="en-US")
recognizer.Init()

Speech Recognition

To start listening and transcribing speech, use the Listen method. This method returns the transcribed text:

text = recognizer.Listen(print_allowed=True)
print(f"You said: {text}")

Close the Driver

Once you are done, use the Quit method to close the ChromeDriver instance:

recognizer.Quit()

Examples

Here is a complete example:

from WebSpeechRecognition import SpeechRecognition

recognizer = SpeechRecognition("path/to/chromedriver", language="en-US")
recognizer.Init()

while True:
    try:
        text = recognizer.Listen(print_allowed=True)
        print(f"You said: {text}")
    except KeyboardInterrupt:
        recognizer.Quit()
        break

Parameters

SpeechRecognition(driver_path, language)

  • driver_path: The path to the Chrome WebDriver file.
  • language: The language code for recognition (default: "en-US").

Init()

Sets up the WebDriver with options for speech recognition.

Listen(print_allowed=False)

Starts listening and returns the text it hears.

  • print_allowed: If True, prints recognized text to the console.

Quit()

Stops the WebDriver and closes everything.

Contributing

We welcome contributions! If you find a bug or have an idea for a new feature, feel free to open an issue or a pull request on our GitHub repository.

Want to help improve SpeechRecognition? Here's how:

  1. Fork the repository.
  2. Make a new branch (git checkout -b feature-branch).
  3. Add your changes (git commit -m 'Add feature').
  4. Push your branch (git push origin feature-branch).
  5. Open a pull request for review.

License

This project uses the MIT License. See the LICENSE file for details.

Contact

Have questions or suggestions? Contact us:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

WebSpeechRecognition-0.1.4.tar.gz (4.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

WebSpeechRecognition-0.1.4-py3-none-any.whl (5.8 kB view details)

Uploaded Python 3

File details

Details for the file WebSpeechRecognition-0.1.4.tar.gz.

File metadata

  • Download URL: WebSpeechRecognition-0.1.4.tar.gz
  • Upload date:
  • Size: 4.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.8.0

File hashes

Hashes for WebSpeechRecognition-0.1.4.tar.gz
Algorithm Hash digest
SHA256 3d94b2cb0b086482ac5f088ef0cbb2bde0e21a6ee8b1800699d31c7ad311bef5
MD5 66321696812856ebb9d3607dbf5cc540
BLAKE2b-256 d4c3295d46683cd169df97eb461541b3aa897cc3210b4f2c5d4fe00bb4fc3511

See more details on using hashes here.

File details

Details for the file WebSpeechRecognition-0.1.4-py3-none-any.whl.

File metadata

File hashes

Hashes for WebSpeechRecognition-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 7f39b01d3c435fdcdb13711405ee3fb5371749d56d5b2ca55a92c6da8d41c126
MD5 43530af06fb3abcec51cbdeea70778c9
BLAKE2b-256 780294186fa083f70a51e32350ce3a3f3d272e6d85b725866a80461eded33b25

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page