A simple speech-to-text application using Wit.ai
Project description
bhashan2pathtak
Simple Python application to convert speech into text using Wit.ai
Prerequisites for MacOS
brew install portaudio
brew install flac
Features
- Continuous speech recognition
- Improved accuracy using Wit.ai
- Ambient noise adjustment
- Error handling and user-friendly feedback
Setup
Before using bhashan2pathtak, you need to set up a Wit.ai account and obtain a token:
- Go to https://wit.ai/ and create an account if you haven't already.
- Create a new Wit.ai app and copy your Client Access Token.
- Set your token using one of these methods:
a. Set an environment variable:
b. Create aexport WIT_AI_TOKEN=your_token_here
config.json
file in the directory where you'll run the application, with the following content:{ "WIT_AI_TOKEN": "your_token_here" }
Installation
To use bhashan2pathtak as a package:
-
Install the package:
pip install bhashan2pathtak
-
Run the application:
bhashan2pathtak
Note: You still need to configure your Wit.ai token as described in the Setup section.
Development Setup
- Clone this repository
- Install the required packages:
Note: This project uses PyAudio 0.2.14. If you encounter issues with installation, try upgrading to this version.pip install -r requirements.txt
- Sign up for a Wit.ai account and create a new app to get an access token
Development Configuration
To run this application, you need to provide your Wit.ai token. You have two options:
-
Environment Variable: Set the
WIT_AI_TOKEN
environment variable:export WIT_AI_TOKEN=your_wit_ai_token_here
-
Configuration File: Copy
config.json.example
toconfig.json
and add your Wit.ai token:cp config.json.example config.json
Then edit
config.json
and replaceyour_wit_ai_token_here
with your actual token.
Important: Never commit your config.json
file to the repository. It's listed in .gitignore
to prevent accidental commits.
How to run
After setting up the configuration:
python3 speech_to_text.py
Troubleshooting
If you encounter any issues with PyAudio, make sure you have version 0.2.14 installed:
pip install PyAudio==0.2.14
For any other issues, please check the Wit.ai documentation or open an issue in this repository.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file bhashan2pathtak-0.1.0.tar.gz
.
File metadata
- Download URL: bhashan2pathtak-0.1.0.tar.gz
- Upload date:
- Size: 19.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2cc9e46eca6a9d5aee43e2c4e1c58d1785e8c945cb77b4f78a757e5c1a79b374 |
|
MD5 | b4e8f8218e9e878c928eb4a89de93b65 |
|
BLAKE2b-256 | 10aeca54f8a410a760246ea14037c7bee9762d1ea0a50c06f842db6ddc972bbb |
File details
Details for the file bhashan2pathtak-0.1.0-py3-none-any.whl
.
File metadata
- Download URL: bhashan2pathtak-0.1.0-py3-none-any.whl
- Upload date:
- Size: 17.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9c5019610cbcf0868d4e475aa869d223c34c637f072677dd3deb20638645b402 |
|
MD5 | 181a93386f9cbb07573c1fff4e493de0 |
|
BLAKE2b-256 | 54e243930cd68528cee888e13de1fd425055fcf9c77dc675836923c0eb3bf2eb |