Skip to main content

Voice commands for JupyterLab

Project description

jupyterlab-voice-control

Github Actions Status Binder PyPI version

⚠️ This extension is experimental ⚠️

Voice commands for JupyterLab extension relies on experimental Speech Recognition API and is not suitable for use in production environments. You can check the current support on caniuse.com/speech-recognition.

Please be aware that the Speech Recognition method is an implementation detail of the browser, and some browsers, including Google Chrome, may send recordings of your voice to cloud for recognition.

Text edition capabilities are severely limited (e.g. it is only possible to delete words typed using the voice commands), but many issues are expected to be resolved with upgrade to CodeMirror 6, anticipated for JupyterLab 4.

While it is currently just proof of concept, please do get in touch by opening a new issue if you found this extension useful/promising and would like to leave any feedback which would be greatly appreciated.

Usage

To use this extension look for a microphone (microphone icon) icon on the status bar.

Initial icon location in the status bar

Press the disabled microphone icon (disabled microphone icon) to start voice recognition (or use Alt + v shortcut). You will need to agree for the browser to use your microphone:

Microphone access dialog in Chrome

You can use any of the commands available in the command palette, for example try "Download" or "Run selected cells".

Sometimes the speech recognition may be inaccurate, or you may misremember the name of the command. In those cases the voice control assistant will prompt you with suggestions with words "Did you mean..." and one or more suggestions on the status bar:

Suggestion saying "Did you mean New Launcher?"

  • You can accept the first suggestion by saying "yes".
  • To choose the second suggestion say "second suggestion", and so on.
  • Speech synthesis option is available (in the settings) if you prefer it over checking the status bar continuously.
  • The threshold for inclusion of a similar-sounding phrase as a suggestion can be adjusted in the settings.

If there are no errors or suggestions, the recognised speech will be shown on the status bar together with a confidence estimate:

Recognised phrase "close all tabs" with 87% confidence

By default commands recognised with low confidence (<50%) will not be executed (as it is deemed safer in case of any potentially destructive actions), but you can change the threshold in the settings.

Creating custom commands

In the settings editor you can assign custom phrases to commands in the same way as you can configure shortcuts. We refer to the words you need to say in order to execute a command as trigger phrase.

The voice control extension provides additional commands for interaction with the interface such as typing text (vc:insert-text), accepting suggestions (vc:accept-suggestion), stopping recognition (vc:stop-listening), etc.

The trigger phrase can be a regular expression and include named capturing groups. The captured groups will be passed as arguments to the command (/merged with pre-specified arguments if any, with groups overriding pre-specified arguments in case of a collision). For example, to create a command for opening files we could specify:

{
  "phrase": "^open file (?<path>.*)",
  "command": "filebrowser:open-path"
}

^ ensures that only phrases starting with "open file" will be matched.

We provide command vc:open-notebook (by default under open notebook (?<path>.*)) which automatically appends notebook file extension (.ipynb) as it can be problematic for speech recognition software.

Requirements

  • JupyterLab >= 3.4 (may work with older versions of 3.x, but not tested)

Install

To install the extension, execute:

pip install jupyterlab-voice-control

Uninstall

To remove the extension, execute:

pip uninstall jupyterlab-voice-control

Contributing

Development install

Note: You will need NodeJS to build the extension package.

The jlpm command is JupyterLab's pinned version of yarn that is installed with JupyterLab. You may use yarn or npm in lieu of jlpm below.

# Clone the repo to your local environment
# Change directory to the jupyterlab-voice-control directory
# Install package in development mode
pip install -e .
# Link your development version of the extension with JupyterLab
jupyter labextension develop . --overwrite
# Rebuild extension Typescript source after making changes
jlpm build

You can watch the source directory and run JupyterLab at the same time in different terminals to watch for changes in the extension's source and automatically rebuild the extension.

# Watch the source directory in one terminal, automatically rebuilding when needed
jlpm watch
# Run JupyterLab in another terminal
jupyter lab

With the watch command running, every saved change will immediately be built locally and available in your running JupyterLab. Refresh JupyterLab to load the change in your browser (you may need to wait several seconds for the extension to be rebuilt).

By default, the jlpm build command generates the source maps for this extension to make it easier to debug using the browser dev tools. To also generate source maps for the JupyterLab core extensions, you can run the following command:

jupyter lab build --minimize=False

Development uninstall

pip uninstall jupyterlab-voice-control

In development mode, you will also need to remove the symlink created by jupyter labextension develop command. To find its location, you can run jupyter labextension list to figure out where the labextensions folder is located. Then you can remove the symlink named jupyterlab-voice-control within that folder.

Testing the extension

Frontend tests

This extension is using Jest for JavaScript code testing.

To execute them, execute:

jlpm
jlpm test

Integration tests

This extension uses Playwright for the integration tests (aka user level tests). More precisely, the JupyterLab helper Galata is used to handle testing the extension in JupyterLab.

More information are provided within the ui-tests README.

Packaging the extension

See RELEASE

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jupyterlab-voice-control-0.2.0.tar.gz (344.8 kB view details)

Uploaded Source

Built Distribution

jupyterlab_voice_control-0.2.0-py3-none-any.whl (453.8 kB view details)

Uploaded Python 3

File details

Details for the file jupyterlab-voice-control-0.2.0.tar.gz.

File metadata

File hashes

Hashes for jupyterlab-voice-control-0.2.0.tar.gz
Algorithm Hash digest
SHA256 5aa296534153b6cb382015fdf093389206ce01e70a6356c5ac79814bdc0ead75
MD5 f5dbc1f4c584ae25303191fe9e259bcc
BLAKE2b-256 76050f92127b8eacb9e2d9546433ffa5437dd8dbd2ab558e54d17d435bc5fbe4

See more details on using hashes here.

File details

Details for the file jupyterlab_voice_control-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for jupyterlab_voice_control-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6c8bf8982f1e9bbea4784b9875c4466ee8562d36e49f978b991f0bba697d0a3d
MD5 ac4e5b5d5579d449f15475bfb24ebac8
BLAKE2b-256 e5137b7be9bd9be9f603e81c92c1f031e0b7e65a2f094fc3cda37bc6c4374695

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page