Skip to main content

Whisper for your microphone

Project description

Whisper Mic

This repo is based on the work done here by OpenAI. This repo allows you use use a mic as demo. This repo copies some of the README from original project.

Video Tutorial

See the video tutorial for this repo here

Professional Assistance

If are in need of paid professional help, that is available through this email

Setup

  1. Create a venv of your choice.
  2. Run pip install -r requirements.txt

Available models and languages

There are five model sizes, four with English-only versions, offering speed and accuracy tradeoffs. Below are the names of the available models and their approximate memory requirements and relative speed.

Size Parameters English-only model Multilingual model Required VRAM Relative speed
tiny 39 M tiny.en tiny ~1 GB ~32x
base 74 M base.en base ~1 GB ~16x
small 244 M small.en small ~2 GB ~6x
medium 769 M medium.en medium ~5 GB ~2x
large 1550 M N/A large ~10 GB 1x

For English-only applications, the .en models tend to perform better, especially for the tiny.en and base.en models. We observed that the difference becomes less significant for the small.en and medium.en models.

Microphone Demo

You can use the model with a microphone using the mic.py program. Use -h to see flag options.

Some of the more important flags are the --model and --english flags.

Troubleshooting

If you are having issues with the mic.py not running try the following:

sudo apt install portaudio19-dev python3-pyaudio

License

The code and the model weights of Whisper are released under the MIT License. See their repo for more information.

The code under this repo is under the AGPL license. See LICENSE for further details.

Thanks

Until recently, access to high performing speech to text models was only available through paid serviecs. With this release, I am excited for the many applications that will come.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

whisper_mic-0.0.1.tar.gz (4.1 kB view details)

Uploaded Source

Built Distribution

whisper_mic-0.0.1-py3-none-any.whl (4.4 kB view details)

Uploaded Python 3

File details

Details for the file whisper_mic-0.0.1.tar.gz.

File metadata

  • Download URL: whisper_mic-0.0.1.tar.gz
  • Upload date:
  • Size: 4.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for whisper_mic-0.0.1.tar.gz
Algorithm Hash digest
SHA256 934d3e73abf1b3217d8e78da5e6f26ea1593e29ea16e0cd7a2d1d0e2b0a0a3f0
MD5 3490f859bce8c6fb423eb880b83cc109
BLAKE2b-256 41cdbab1e9b685332510392c98ea4a04c1321e728cf56fdf739774e669180a60

See more details on using hashes here.

File details

Details for the file whisper_mic-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: whisper_mic-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 4.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for whisper_mic-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 cfc998552e72b83e19f3acc2e5e62dfff8db3131ed4d150dc8de0d39489c17f1
MD5 1422b336da19daa39eccffff751b6d4d
BLAKE2b-256 dcceb0e076d84e0b7d8164320448d466be555bc9decd7d58c773baef93197255

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page