Pytorch implementation of CREPE pitch tracker

These details have not been verified by PyPI

Project links

Homepage

Project description

torchcrepe

Pytorch implementation of the CREPE pitch tracker. The original Tensorflow implementation can be found here. The provided model weights were obtained by converting the "tiny" and "full" models using MMdnn, an open-source model management framework.

Installation

Perform the system-dependent PyTorch install using the instructions found here.

pip install torchcrepe

Usage

Computing pitch and harmonicity from audio

import torchcrepe


# Load audio
audio, sr = torchcrepe.load.audio( ... )

# Place the audio on the device you want CREPE to run on
audio = audio.to( ... )

# Here we'll use a 5 millisecond hop length
hop_length = int(sr / 200.)

# Provide a sensible frequency range for your domain (upper limit is 2006 Hz)
# This would be a reasonable range for speech
fmin = 50
fmax = 550

# Select a model capacity--one of "tiny" or "full"
model = 'tiny'

# Compute pitch and harmonicity
pitch = torchcrepe.predict(audio, sr, hop_length, fmin, fmax, model)

A harmonicity metric similar to the Crepe confidence score can also be extracted by passing return_harmonicity=True to torchcrepe.predict.

By default, torchcrepe uses Viterbi decoding on the softmax of the network output. This is different than the original implementation, which uses a weighted average near the argmax of binary cross-entropy probabilities. The argmax operation can cause double/half frequency errors. These can be removed by penalizing large pitch jumps via Viterbi decoding. The decode submodule provides some options for decoding.

# Decode using viterbi decoding (default)
torchcrepe.predict(..., decoder=torchcrepe.decode.viterbi)

# Decode using weighted argmax (as in the original implementation)
torchcrepe.predict(..., decoder=torchcrepe.decode.weighted_argmax)

# Decode using argmax
torchcrepe.predict(..., decoder=torchcrepe.decode.argmax)

When harmonicity is low, the pitch is less reliable. For some problems, it makes sense to mask these less reliable pitch values. However, the harmonicity can be noisy and the pitch has quantization artifacts. torchcrepe provides submodules filter and threshold for this purpose. The filter and threshold parameters should be tuned to your data. For clean speech, a 10-20 millisecond window with a threshold of 0.21 has worked.

# We'll use a 15 millisecond window assuming a hop length of 5 milliseconds
win_length = 3

# Median filter noisy confidence value
harmonicity = torchcrepe.filter.median(harmonicity, win_length)

# Remove inharmonic regions
pitch = torchcrepe.threshold.At(.21)(pitch, harmonicity)

# Optionally smooth pitch to remove quantization artifacts
pitch = torchcrepe.filter.mean(pitch, win_length)

For more fine-grained control over pitch thresholding, see torchcrepe.threshold.Hysteresis. This is especially useful for removing spurious voiced regions caused by noise in the harmonicity values, but has more parameters and may require more manual tuning to your data.

Computing the CREPE model output activations

probabilities = torchcrepe.infer(torchcrepe.preprocess(audio, sr, hop_length))

Computing the CREPE embedding space

As in Differentiable Digital Signal Processing, this uses the output of the fifth max-pooling layer as a pretrained pitch embedding

embeddings = torchcrepe.embed(audio, sr, hop_length)

Computing from files

torchcrepe defines the following functions convenient for predicting directly from audio files on disk. Each of these functions also takes a device argument that can be used for device placement (e.g., device='gpu:0').

torchcrepe.predict_from_file(audio_file, ...)
torchcrepe.predict_from_file_to_file(
    audio_file, ..., output_pitch_file, output_harmonicity_file, ...)

torchcrepe.embed_from_file(audio_file, ...)
torchcrepe.embed_from_file_to_file(audio_file, ..., output_file, ...)

Command-line interface

usage: python -m torchcrepe
    [-h] [--output_harmonicity_file OUTPUT_HARMONICITY_FILE]
    [--embed] [--fmin FMIN] [--fmax FMAX] [--model MODEL]
    [--decoder DECODER] [--device DEVICE]
    audio_file output_file hop_length

positional arguments:
  audio_file            The audio file to process
  output_file           The file to save pitch or embedding
  hop_length            The hop length of the analysis window

optional arguments:
  -h, --help            show this help message and exit
  --output_harmonicity_file OUTPUT_HARMONICITY_FILE
                        The file to save harmonicity
  --embed               Performs embedding instead of pitch prediction
  --fmin FMIN           The minimum frequency allowed
  --fmax FMAX           The maximum frequency allowed
  --model MODEL         The model capacity. One of "tiny" or "full"
  --decoder DECODER     The decoder to use. One of "argmax", "viterbi", or
                        "weighted_argmax"
  --device DEVICE       The device to perform inference on

Tests

The module tests can be run as follows.

pip install pytest
pytest

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.0.24

May 16, 2025

0.0.23

Jun 17, 2024

0.0.22

Oct 9, 2023

0.0.21

Jul 28, 2023

0.0.20

Jun 3, 2023

0.0.19

May 9, 2023

0.0.18

Mar 22, 2023

0.0.17

Sep 12, 2022

0.0.16

Apr 3, 2022

0.0.15

Jun 25, 2021

0.0.14

Apr 29, 2021

0.0.13

Feb 23, 2021

0.0.12

Dec 15, 2020

0.0.11

Sep 23, 2020

0.0.10

Sep 23, 2020

0.0.9

Aug 26, 2020

0.0.8

Aug 22, 2020

0.0.7

Aug 12, 2020

0.0.6

Aug 10, 2020

0.0.5

Aug 9, 2020

This version

0.0.4

Aug 9, 2020

0.0.3

Aug 9, 2020

0.0.2

Aug 7, 2020

0.0.1

Aug 1, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

torchcrepe-0.0.4.tar.gz (72.3 MB view details)

Uploaded Aug 9, 2020 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

torchcrepe-0.0.4-py3-none-any.whl (72.3 MB view details)

Uploaded Aug 9, 2020 Python 3

File details

Details for the file torchcrepe-0.0.4.tar.gz.

File metadata

Download URL: torchcrepe-0.0.4.tar.gz
Upload date: Aug 9, 2020
Size: 72.3 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.9.1 tqdm/4.28.1 CPython/3.7.1

File hashes

Hashes for torchcrepe-0.0.4.tar.gz
Algorithm	Hash digest
SHA256	`1dd1ad0ba8e63815a633273d799fc347e305c65658018c8d8a1bc4a9916760eb`
MD5	`46a12f531c594cf1245a904982dc444d`
BLAKE2b-256	`c1ae91206dda98abd09d4e50c5d440f435c966774bc353093d7636e5fda3c663`

See more details on using hashes here.

File details

Details for the file torchcrepe-0.0.4-py3-none-any.whl.

File metadata

Download URL: torchcrepe-0.0.4-py3-none-any.whl
Upload date: Aug 9, 2020
Size: 72.3 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.9.1 tqdm/4.28.1 CPython/3.7.1

File hashes

Hashes for torchcrepe-0.0.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d26c6ef0f61f786e7e0a35d065d231985c982989894453d7a1cee33f778e398d`
MD5	`f402f7e4227e5a7578e4b76134f72a1e`
BLAKE2b-256	`16e75faf658e22b795315966e7d9676ab25f6cfb48ad1d2bb900f0df25b95e88`

See more details on using hashes here.

torchcrepe 0.0.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

torchcrepe

Installation

Usage

Computing pitch and harmonicity from audio

Computing the CREPE model output activations

Computing the CREPE embedding space

Computing from files

Command-line interface

Tests

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes