Praat-compatible acoustic analysis in Python (Rust implementation)
Project description
praatfan-gpl
Praat-compatible acoustic analysis in Python, powered by Rust.
This package provides exact reimplementations of Praat's acoustic analysis algorithms, designed to produce bit-accurate output matching Praat/parselmouth.
Installation
pip install praatfan-gpl
Building from source
Requires Rust and maturin:
cd python
pip install maturin
maturin develop --release
Quick Start
import praatfan_gpl as pc
# Load an audio file
sound = pc.Sound.from_file("speech.wav")
# Or create from numpy array
import numpy as np
samples = np.sin(2 * np.pi * 440 * np.arange(44100) / 44100)
sound = pc.Sound(samples, 44100.0)
# Compute pitch (F0)
pitch = sound.to_pitch(0.01, 75.0, 600.0)
print(f"Pitch at 0.5s: {pitch.get_value_at_time(0.5, 'hertz', 'linear')}")
# Get all pitch values as numpy array
f0_values = pitch.values() # NaN for unvoiced frames
times = pitch.times()
# Compute formants
formant = sound.to_formant_burg(
time_step=0.01,
max_num_formants=5,
max_formant_hz=5500.0, # Use 5000 for female speakers
window_length=0.025,
pre_emphasis_from=50.0
)
# Get F1, F2 at specific time
f1 = formant.get_value_at_time(1, 0.5, "hertz", "linear")
f2 = formant.get_value_at_time(2, 0.5, "hertz", "linear")
# Get all F1 values
f1_values = formant.formant_values(1) # numpy array
# Compute intensity
intensity = sound.to_intensity(100.0, 0.01)
db_values = intensity.values()
# Compute spectrum (single-frame FFT)
spectrum = sound.to_spectrum(fast=True)
cog = spectrum.get_center_of_gravity(2.0)
std = spectrum.get_standard_deviation(2.0)
# Compute spectrogram
spectrogram = sound.to_spectrogram(
effective_analysis_width=0.005,
max_frequency=5000.0,
time_step=0.002,
frequency_step=20.0,
window_shape="gaussian"
)
spec_data = spectrogram.values() # 2D numpy array [freq, time]
# Compute harmonicity (HNR)
hnr = sound.to_harmonicity_ac(0.01, 75.0, 0.1, 1.0)
# Or cross-correlation method:
hnr_cc = sound.to_harmonicity_cc(0.01, 75.0, 0.1, 1.0)
API Reference
Sound
# Loading
Sound.from_file(path) # Load from WAV, MP3, FLAC, OGG
Sound(samples, sample_rate) # From numpy array
# Properties
sound.sample_rate # Sample rate in Hz
sound.duration # Duration in seconds
sound.num_samples # Number of samples
sound.start_time, sound.end_time # Time bounds
# Methods
sound.samples() # Get samples as numpy array
sound.get_value_at_time(time) # Linear interpolation
sound.extract_part(start, end, ...) # Extract segment
sound.pre_emphasis(from_freq) # High-pass filter
sound.rms(), sound.peak() # Amplitude measures
# Analysis
sound.to_pitch(dt, floor, ceil) # Pitch extraction
sound.to_formant_burg(...) # Formant analysis
sound.to_intensity(min_pitch, dt) # Intensity contour
sound.to_spectrum(fast) # Single-frame FFT
sound.to_spectrogram(...) # Time-frequency analysis
sound.to_harmonicity_ac/cc(...) # HNR analysis
# Speech-referenced variants (robust on long recordings — see below)
sound.to_pitch_ac_referenced(dt, floor, ceil, *, reference_peak=None)
sound.to_pitch_cc_referenced(dt, floor, ceil, *, reference_peak=None)
sound.to_harmonicity_ac_referenced(dt, min_pitch, silence, ppw, *, reference_peak=None)
sound.to_harmonicity_cc_referenced(dt, min_pitch, silence, ppw, *, reference_peak=None)
Speech-referenced normalization (long recordings)
Praat references each frame's amplitude against the whole-file peak. On utterance-length audio that peak is the speech peak, which is correct. On long conversational recordings (e.g. 10-minute Buckeye files) the peak is set by whatever is loudest anywhere — a click, laugh, or interviewer burst — so every quiet voiced frame's relative intensity drops and gets misclassified as silent, which then NaN-gates downstream formants and HNR.
The *_referenced methods substitute a robust amplitude reference for the
whole-file peak. The plain to_pitch / to_harmonicity_* methods are
unchanged and remain bit-exact with Praat.
import praatfan_gpl as pg
# Estimate one reference per recording, then share it across pitch and HNR.
# Standards are frame-level robust statistics over speech-active frames, so a
# short loud burst cannot dominate them:
ref = pg.estimate_speech_reference(samples, sample_rate)
ref.reference_peak # 75th-pct of per-frame peak |x| — a typical-speech peak
ref.std # median per-frame RMS (z-norm scale)
ref.mean # median per-frame mean (DC reference)
ref.speech_mask # per-frame bool, hop_s=0.01 grid
ref.frame_times # frame centers (s)
ref.speech_fraction # fraction of frames in the speech mask
pitch = sound.to_pitch_ac_referenced(0.01, 75.0, 600.0,
reference_peak=ref.reference_peak)
hnr = sound.to_harmonicity_cc_referenced(0.01, 75.0, 0.1, 1.0,
reference_peak=ref.reference_peak)
# Or pass reference_peak=None (the default) to have each call estimate the
# reference internally.
The estimator definition and constants are shared across the praatfan package
family (praatfan, praatfan-rust, praatfan-gpl), so a reference_peak
computed by one package is interchangeable with the others.
Pitch
pitch.get_value_at_time(time, unit, interp) # Query pitch
pitch.values() # All values (NaN=unvoiced)
pitch.times() # Frame times
pitch.num_frames, pitch.time_step # Frame info
pitch.pitch_floor, pitch.pitch_ceiling # Analysis parameters
Formant
formant.get_value_at_time(n, time, unit, interp) # Fn frequency
formant.get_bandwidth_at_time(n, time, unit, interp) # Fn bandwidth
formant.formant_values(n) # All Fn values
formant.bandwidth_values(n) # All Bn values
formant.times() # Frame times
Intensity
intensity.get_value_at_time(time, interp) # Query dB
intensity.values() # All values
intensity.times() # Frame times
intensity.min(), intensity.max(), intensity.mean()
Spectrum
spectrum.get_band_energy(f_min, f_max) # Energy in band (Pa² s)
spectrum.get_center_of_gravity(power) # Spectral centroid
spectrum.get_standard_deviation(power) # Spectral spread
spectrum.get_skewness(power) # Spectral skewness
spectrum.get_kurtosis(power) # Spectral kurtosis
spectrum.num_bins, spectrum.df # Frequency info
Spectrogram
spectrogram.values() # 2D array [freq, time]
spectrogram.get_time_from_frame(i) # Frame time
spectrogram.get_frequency_from_bin(i) # Bin frequency
spectrogram.num_frames, spectrogram.num_freq_bins
spectrogram.time_step, spectrogram.freq_step
Harmonicity
harmonicity.get_value_at_time(time, interp) # Query HNR (dB)
harmonicity.values() # All values
harmonicity.times() # Frame times
harmonicity.min(), harmonicity.max(), harmonicity.mean()
Unit Options
- Pitch units:
"hertz","mel","semitones","erb" - Frequency units:
"hertz","bark","mel","erb" - Interpolation:
"nearest","linear","cubic" - Window shapes:
"gaussian","hanning","hamming","rectangular"
Comparison with parselmouth
praatfan-gpl aims for bit-accurate compatibility with Praat/parselmouth:
import parselmouth
import praatfan_gpl as pc
import numpy as np
sound_pm = parselmouth.Sound("speech.wav")
sound_pc = pc.Sound.from_file("speech.wav")
# Formant comparison
formant_pm = sound_pm.to_formant_burg(0.01, 5, 5500, 0.025, 50)
formant_pc = sound_pc.to_formant_burg(0.01, 5, 5500.0, 0.025, 50.0)
for t in np.arange(0, sound_pm.duration, 0.01):
f1_pm = formant_pm.get_value_at_time(1, t)
f1_pc = formant_pc.get_value_at_time(1, t, "hertz", "linear")
if f1_pm is not None and f1_pc is not None:
assert abs(f1_pm - f1_pc) < 1.0 # Within 1 Hz
License
GPL-3.0 (same as Praat).
Citing
If you use praatfan-gpl in published work, please cite Praat, Parselmouth,
and — if you use FormantPath — Weenink (2015):
- Boersma, P. & Weenink, D. (2024). Praat: doing phonetics by computer. https://www.fon.hum.uva.nl/praat/
- Jadoul, Y., Thompson, B., & de Boer, B. (2018). "Introducing Parselmouth: A Python interface to Praat." Journal of Phonetics, 71, 1–15.
- Weenink, D. (2015). "Improved formant frequency measurements of short segments." Proceedings of ICPhS 2015, Glasgow.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file praatfan_gpl-0.1.9-cp39-abi3-win_arm64.whl.
File metadata
- Download URL: praatfan_gpl-0.1.9-cp39-abi3-win_arm64.whl
- Upload date:
- Size: 1.3 MB
- Tags: CPython 3.9+, Windows ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ae5b44dec0d170c044b906cb32331f583489ea2066e09a469e76974d71eb2b5b
|
|
| MD5 |
221983813d5eba67460dfefd35e7cbac
|
|
| BLAKE2b-256 |
6d779922778492290b449730d414d27f53ac79086e7b6e57d33359b08e41b9f6
|
File details
Details for the file praatfan_gpl-0.1.9-cp39-abi3-win_amd64.whl.
File metadata
- Download URL: praatfan_gpl-0.1.9-cp39-abi3-win_amd64.whl
- Upload date:
- Size: 1.7 MB
- Tags: CPython 3.9+, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
99c7fd4a8094c109f6fe1e10c3484a62e5963af31798813880ff4769b894df44
|
|
| MD5 |
db0cdc32dfa53b2c391ed2476110eb38
|
|
| BLAKE2b-256 |
e7c3ec7e3f488be3723bf58893c6c926e5b0c35ff5b14be8ff6b2e2298452fd6
|
File details
Details for the file praatfan_gpl-0.1.9-cp39-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.
File metadata
- Download URL: praatfan_gpl-0.1.9-cp39-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 2.1 MB
- Tags: CPython 3.9+, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b4f5d27d6d015032cde321c96709f4c5b85c2e209ac963a6f56b5f496dddc918
|
|
| MD5 |
c5b07bc3dffca751e74027d975014a2e
|
|
| BLAKE2b-256 |
558b6152b03cfd29879f377639564a89b8f46693cd4af199706c1ab74337c056
|
File details
Details for the file praatfan_gpl-0.1.9-cp39-abi3-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.
File metadata
- Download URL: praatfan_gpl-0.1.9-cp39-abi3-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
- Upload date:
- Size: 1.7 MB
- Tags: CPython 3.9+, manylinux: glibc 2.17+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e67c4a276abc81b0c63d19e9459ccf08c39a2ea0cf369af6810d305861a0e3c1
|
|
| MD5 |
82e27c382e45a3d11e4de7d61d9e7e38
|
|
| BLAKE2b-256 |
2ab819cfc357986b05b263a8967aafdb5f6af1723bc6abaab5969a0d376fa872
|
File details
Details for the file praatfan_gpl-0.1.9-cp39-abi3-macosx_11_0_arm64.whl.
File metadata
- Download URL: praatfan_gpl-0.1.9-cp39-abi3-macosx_11_0_arm64.whl
- Upload date:
- Size: 1.5 MB
- Tags: CPython 3.9+, macOS 11.0+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
757ac8ca73cd9e090a64d78d3c4ba389a46e9883ac2ed5252abc6cbed36a7db8
|
|
| MD5 |
08af544f8ab1fca34460d567627b8932
|
|
| BLAKE2b-256 |
78e0638623591b7576a75a4c183224b038c37638f8f7fb8e361bed9d0d56d45f
|
File details
Details for the file praatfan_gpl-0.1.9-cp39-abi3-macosx_10_12_x86_64.whl.
File metadata
- Download URL: praatfan_gpl-0.1.9-cp39-abi3-macosx_10_12_x86_64.whl
- Upload date:
- Size: 1.9 MB
- Tags: CPython 3.9+, macOS 10.12+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
203ace25ae21d6545cf204b0d6d438834bd328ec565b99b6359ebae918fc7086
|
|
| MD5 |
a114369115fc96b295e94ba5a782a039
|
|
| BLAKE2b-256 |
1d9070aef417498729e2bc8f2c3ee035ce1249faedc0c528428628dd36894a3e
|