A simple speech-to-text application using Wit.ai
Project description
bhashan2pathtak
Simple Python application to convert speech into text using Wit.ai
Prerequisites for MacOS
brew install portaudiobrew install flac
Features
- Continuous speech recognition
- Improved accuracy using Wit.ai
- Ambient noise adjustment
- Error handling and user-friendly feedback
Setup
Before using bhashan2pathtak, you need to set up a Wit.ai account and obtain a token:
- Go to https://wit.ai/ and create an account if you haven't already.
- Create a new Wit.ai app and copy your Client Access Token.
- Set your token using one of these methods:
a. Set an environment variable:
b. Create aexport WIT_AI_TOKEN=your_token_hereconfig.jsonfile in the directory where you'll run the application, with the following content:{ "WIT_AI_TOKEN": "your_token_here" }
Installation
To use bhashan2pathtak as a package:
-
Install the package:
pip install bhashan2pathtak -
Run the application:
bhashan2pathtak
Note: You still need to configure your Wit.ai token as described in the Setup section.
Development Setup
- Clone this repository
- Install the required packages:
Note: This project uses PyAudio 0.2.14. If you encounter issues with installation, try upgrading to this version.pip install -r requirements.txt - Sign up for a Wit.ai account and create a new app to get an access token
Development Configuration
To run this application, you need to provide your Wit.ai token. You have two options:
-
Environment Variable: Set the
WIT_AI_TOKENenvironment variable:export WIT_AI_TOKEN=your_wit_ai_token_here -
Configuration File: Copy
config.json.exampletoconfig.jsonand add your Wit.ai token:cp config.json.example config.jsonThen edit
config.jsonand replaceyour_wit_ai_token_herewith your actual token.
Important: Never commit your config.json file to the repository. It's listed in .gitignore to prevent accidental commits.
How to run
After setting up the configuration:
python3 speech_to_text.py
Troubleshooting
If you encounter any issues with PyAudio, make sure you have version 0.2.14 installed:
pip install PyAudio==0.2.14
For any other issues, please check the Wit.ai documentation or open an issue in this repository.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file bhashan2pathtak-0.1.0.tar.gz.
File metadata
- Download URL: bhashan2pathtak-0.1.0.tar.gz
- Upload date:
- Size: 19.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2cc9e46eca6a9d5aee43e2c4e1c58d1785e8c945cb77b4f78a757e5c1a79b374
|
|
| MD5 |
b4e8f8218e9e878c928eb4a89de93b65
|
|
| BLAKE2b-256 |
10aeca54f8a410a760246ea14037c7bee9762d1ea0a50c06f842db6ddc972bbb
|
File details
Details for the file bhashan2pathtak-0.1.0-py3-none-any.whl.
File metadata
- Download URL: bhashan2pathtak-0.1.0-py3-none-any.whl
- Upload date:
- Size: 17.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9c5019610cbcf0868d4e475aa869d223c34c637f072677dd3deb20638645b402
|
|
| MD5 |
181a93386f9cbb07573c1fff4e493de0
|
|
| BLAKE2b-256 |
54e243930cd68528cee888e13de1fd425055fcf9c77dc675836923c0eb3bf2eb
|