Skip to main content

A simple speech-to-text application using Wit.ai

Project description

bhashan2pathtak

Simple Python application to convert speech into text using Wit.ai

Prerequisites for MacOS

  • brew install portaudio
  • brew install flac

Features

  • Continuous speech recognition
  • Improved accuracy using Wit.ai
  • Ambient noise adjustment
  • Error handling and user-friendly feedback

Setup

Before using bhashan2pathtak, you need to set up a Wit.ai account and obtain a token:

  1. Go to https://wit.ai/ and create an account if you haven't already.
  2. Create a new Wit.ai app and copy your Client Access Token.
  3. Set your token using one of these methods: a. Set an environment variable:
    export WIT_AI_TOKEN=your_token_here
    
    b. Create a config.json file in the directory where you'll run the application, with the following content:
    {
        "WIT_AI_TOKEN": "your_token_here"
    }
    

Installation

To use bhashan2pathtak as a package:

  1. Install the package:

    pip install bhashan2pathtak
    
  2. Run the application:

    bhashan2pathtak
    

Note: You still need to configure your Wit.ai token as described in the Setup section.

Development Setup

  1. Clone this repository
  2. Install the required packages:
    pip install -r requirements.txt
    
    Note: This project uses PyAudio 0.2.14. If you encounter issues with installation, try upgrading to this version.
  3. Sign up for a Wit.ai account and create a new app to get an access token

Development Configuration

To run this application, you need to provide your Wit.ai token. You have two options:

  1. Environment Variable: Set the WIT_AI_TOKEN environment variable:

    export WIT_AI_TOKEN=your_wit_ai_token_here
    
  2. Configuration File: Copy config.json.example to config.json and add your Wit.ai token:

    cp config.json.example config.json
    

    Then edit config.json and replace your_wit_ai_token_here with your actual token.

Important: Never commit your config.json file to the repository. It's listed in .gitignore to prevent accidental commits.

How to run

After setting up the configuration:

python3 speech_to_text.py

Troubleshooting

If you encounter any issues with PyAudio, make sure you have version 0.2.14 installed:

pip install PyAudio==0.2.14

For any other issues, please check the Wit.ai documentation or open an issue in this repository.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bhashan2pathtak-0.1.0.tar.gz (19.0 kB view details)

Uploaded Source

Built Distribution

bhashan2pathtak-0.1.0-py3-none-any.whl (17.1 kB view details)

Uploaded Python 3

File details

Details for the file bhashan2pathtak-0.1.0.tar.gz.

File metadata

  • Download URL: bhashan2pathtak-0.1.0.tar.gz
  • Upload date:
  • Size: 19.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.4

File hashes

Hashes for bhashan2pathtak-0.1.0.tar.gz
Algorithm Hash digest
SHA256 2cc9e46eca6a9d5aee43e2c4e1c58d1785e8c945cb77b4f78a757e5c1a79b374
MD5 b4e8f8218e9e878c928eb4a89de93b65
BLAKE2b-256 10aeca54f8a410a760246ea14037c7bee9762d1ea0a50c06f842db6ddc972bbb

See more details on using hashes here.

File details

Details for the file bhashan2pathtak-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for bhashan2pathtak-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 9c5019610cbcf0868d4e475aa869d223c34c637f072677dd3deb20638645b402
MD5 181a93386f9cbb07573c1fff4e493de0
BLAKE2b-256 54e243930cd68528cee888e13de1fd425055fcf9c77dc675836923c0eb3bf2eb

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page