Skip to main content

Pyannote ONNX

Project description

Speaker Diarization

pyannote-audio is an open-source toolkit written in Python for speaker diarization.

pyannote-onnx is used to convert the pretrained model defined in PyTorch into the ONNX format and then run it with ONNX Runtime (in C++ or Python).

Only Python 3.8+ is supported.

Usage

  1. Download the pretrained model from Hugging Face pyannote/segmentation-3.0.
  2. Export the pretrained model to ONNX model.
  3. Run the ONNX model with ONNX Runtime in C++ or Python.
$ pip install torch onnx https://github.com/pyannote/pyannote-audio/archive/refs/heads/develop.zip
$ python export_onnx.py pytorch_model.bin segmentation-3.0.onnx

$ pip install -r requirements.txt
$ python main.py data/test_16k.wav

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyannote-onnx-0.0.9.tar.gz (5.5 MB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page