Skip to main content

senselab is a Python package that simplifies building pipelines for speech and voice analysis.

Project description

Build codecov Ruff

PyPI Python Version License

pages Ask DeepWiki

senselab

This Python package streamlines, optimizes, and enforces best open-science practices for processing and analyzing behavioral data (primarily voice and speech, but also text and video) using robust reproducible pipelines and utilities.

Quick start

from senselab.audio.data_structures import Audio
from senselab.audio.tasks.preprocessing import resample_audios
from senselab.audio.tasks.features_extraction import extract_features_from_audios
from senselab.audio.tasks.speech_to_text import transcribe_audios

audio = Audio(filepath='path_to_audio_file.wav')
print(audio.sampling_rate)
# ➡️ 44100

[resampled_audio] = resample_audios([audio], resample_rate=16000)
print(resampled_audio.sampling_rate)
# ➡️ 16000

audio_features = extract_features_from_audios([audio])
print(audio_features[0].keys())
# ➡️ dict_keys(['opensmile', 'praat_parselmouth', 'torchaudio', 'torchaudio_squim', ...])

transcript = transcribe_audios([audio])
print(transcript)
# ➡️ "The quick brown fox jumps over the lazy dog."

For more detailed information, check out our Documentation and our Tutorials.

💡 Tip: Many tutorials include Google Colab badges and you can try them instantly without installing anything on your local machine.

Why should you use senselab?

  • Modular design: Easily integrate or use standalone transformations for flexible data manipulation.
  • Pre-built pipelines: Access pre-configured pipelines to reduce setup time and effort.
  • Reproducibility: Ensure consistent and verifiable results with fixed seeds and version-controlled steps.
  • Easy integration: Seamlessly fit into existing workflows with minimal configuration.
  • Extensible: Modify and contribute custom transformations and pipelines to meet specific research needs.
  • Comprehensive documentation: Detailed guides, examples, and documentation for all features and modules.
  • Performance optimized: Efficiently process large datasets with optimized code and algorithms.
  • Interactive examples: Jupyter notebooks provide practical examples for deriving insights from real-world datasets.
  • senselab AI: Interact with your data through an AI-based chatbot. The AI agent generates and runs senselab-based code for you, making exploration easier and giving you both the results and the code used to produce them (perfect for quick experiments or for users who prefer not to code).

⚠️ System Requirements

  1. If on macOS, this package requires an ARM64 architecture due to PyTorch 2.2.2+ dropping support for x86-64 on macOS.

    ❌ Unsupported systems include:

    • macOS (Intel x86-64)
    • Other platforms where dependencies are unavailable

    To check your system compatibility, please run this command:

    python -c "import platform; print(platform.machine())"
    

    If the output is:

    • arm64 → ✅ Your system is compatible.
    • x86_64 → ❌ Your system is not supported.

    If you attempt to install this package on an unsupported system, the installation or execution will fail.

  2. FFmpeg is required by some audio and video dependencies (e.g., torchaudio). Please make sure you have FFmpeg properly installed on your machine before installing and using senselab (see here for detailed platform-dependent instructions).

  3. CUDA libraries matching the CUDA version expected by the PyTorch wheels (e.g., the latest pytorch 2.8 expects cuda-12.8). To install those with conda, please do:

  • conda config --add channels nvidia

  • conda install -y nvidia/label/cuda-12.8.1::cuda-libraries-dev

    Hosts with newer system CUDA (e.g., CUDA 12.9): the subprocess-venv backends (nemo-canary-qwen, nemo, qwen-asr) auto-detect the host's CUDA version via nvidia-smi and route their torch/torchaudio installs through the matching PyTorch wheel index (cu128 / cu126 / cu124 / cu121 / cpu). No manual configuration needed.

    Internal mirrors / unsupported CUDA / CPU fallback: set the SENSELAB_TORCH_INDEX_URL environment variable to override the chosen index. Common values:

    # Force CPU wheels (e.g. testing CPU path on a GPU host, or unsupported CUDA)
    export SENSELAB_TORCH_INDEX_URL=https://download.pytorch.org/whl/cpu
    
    # Internal PyPI mirror that proxies PyTorch wheels
    export SENSELAB_TORCH_INDEX_URL=https://pypi.internal.example.com/pytorch/cu128
    

    When no compatible torch+torchaudio binary pair exists for your host (rare; happens in the days after a CUDA major release), installation fails with a named SenselabCudaCompatibilityError that lists the detected host CUDA, the attempted index URL, and the recommended action — no opaque stack traces from inside torchaudio.

  1. Docker is required and must be running for some video models (e.g., MediaPipe-based estimators). Please follow the official installation instructions for your platform: Install Docker.
  2. Some functionalities rely on HuggingFace models, and increasingly, models require authentication and signed license agreements. Instructions on how to generate a Hugging Face access token can be found here: https://huggingface.co/docs/hub/security-tokens
  • You can provide your HuggingFace token either by exporting it in your shell:
    export HF_TOKEN=your_token_here
    
  • or by adding it to your .env file (see .env.example for reference).

Installation

Install this package via:

pip install 'senselab[all]'

Or get the newest development version via:

pip install 'git+https://github.com/sensein/senselab.git#egg=senselab[all]'

If you want to install only audio dependencies, you do:

pip install 'senselab'

To install articulatory, video, text, and senselab-ai extras, please do:

pip install 'senselab[articulatory,video,text,senselab-ai]'

senselab AI (our AI-based chatbot)

Development (with uv)

uv sync --extra senselab-ai
uv run senselab-ai

Production (with pip)

pip install 'senselab[senselab-ai]'
senselab-ai

Once started, you can open the provided JupyterLab interface, setup the agent and chat with it, and let it create and execute code for you.

Example of how senselab-ai works

For a walkthrough, see: tutorials/senselab-ai/senselab_ai_intro.ipynb.


Contributing

We welcome contributions from the community! Before proceeding with that, please review our CONTRIBUTING.md.


Funding

senselab is mostly supported by the following organizations and initiatives:

  • McGovern Institute ICON Fellowship
  • NIH Bridge2AI Precision Public Health (OT2OD032720)
  • Child Mind Institute
  • ReadNet Project
  • Chris and Lann Woehrle Psychiatric Fund

Acknowledgments

senselab builds on the work of many open-source projects. We gratefully acknowledge the developers and maintainers of the following key dependencies:

We are thankful to the open-source community for enabling this project! 🙏

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

senselab-1.3.1a31.tar.gz (34.8 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

senselab-1.3.1a31-py3-none-any.whl (445.2 kB view details)

Uploaded Python 3

File details

Details for the file senselab-1.3.1a31.tar.gz.

File metadata

  • Download URL: senselab-1.3.1a31.tar.gz
  • Upload date:
  • Size: 34.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.14 {"installer":{"name":"uv","version":"0.11.14","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for senselab-1.3.1a31.tar.gz
Algorithm Hash digest
SHA256 d99c6d2c7232924e121f76570f7223e0bb81a4d47f59f92b6e6dd7acd6e75f56
MD5 cdbe41055272ad9426c64e229b503e67
BLAKE2b-256 58b502e5706e43b3410c170fe5f904938067d52600382e60a283c68efb99aaad

See more details on using hashes here.

File details

Details for the file senselab-1.3.1a31-py3-none-any.whl.

File metadata

  • Download URL: senselab-1.3.1a31-py3-none-any.whl
  • Upload date:
  • Size: 445.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.14 {"installer":{"name":"uv","version":"0.11.14","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for senselab-1.3.1a31-py3-none-any.whl
Algorithm Hash digest
SHA256 fcaa55288098c9bced82d8a0b4475dc770c899a5fba71fef8fd0db09acf83278
MD5 10c927a1bb34d4510a1835330f1ed38c
BLAKE2b-256 7abb4fb1547787f09588785d4ea0ba6ed042ba43e5a8d4f82236b9c40afea29b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page