Skip to main content

Spectra at your fingertips

Project description

SignalSnap: Signal Analysis In Python Made Easy

by M. Sifft and D. Hägele

We present a fast Python toolbox for the higher-order spectral analysis of time series. The usual second-order power spectrum and its higher-order generalization - so-called bi- and trispectra - are efficiently calculated on any platform. The toolbox supports GPU acceleration using the ArrayFire library. The treatment of large datasets is not limited by available RAM. We were able to process 11.6 GB of data (given in the hdf5 format) within just one minute to obtain a power spectrum with a one-million-point resolution using a five-year-old Nvidia GeForce GTX 1080. Similarly, 1000x1000 points of a trispectrum were obtained from processing 3.3 GB of data per minute.

Here, a few outstanding features of SignalSnap:

  • Errors of spectral values are automatically calculated (beginners example, example)
  • Support for just-in-time import from hdf data (dataset does not have to fit in RAM) (example)
  • Functions for conversion of Numpy array to hdf data is also provided (example)
  • Functions for storing and loading calculated data together with metadata (example)
  • Correlations between two time series can be calculated (example)
  • All calculation can be performed on GPU (NVidia and AMD) (see Arrayfire) (example)
  • Advanced plotting options for two-dimensional higher-order spectra
  • Usage of unbiased estimators for higher-order cumulants (see Literature)
  • Efficient implementation of the confined Gaussian window for an optimal RMS time-bandwidth product (see Literature)
  • Special functions for the verification of the stationarity of a signal (example)

Installation

SignalSnap is available on pip and can be installed with

pip install signalsnap

Documentation

A comprehensive documentation of SignalSnap will follow soon.

Examples

Examples for every function of the package are currently added to the folder Examples. Here are a few lines to get you started. We will generate some white noise as signal/dataset store it as Numpy array called y.

import SignalSnap as snp
import numpy as np
rng = np.random.default_rng()

# ------ Generation of white noise -----
f_unit = 'kHz'
fs = 10e3 # sampling rate in kHz
N = 1e5 # number of points
t = np.arange(N) / fs # unit is automatically chosen to be 1/f_unit = ms
y = rng.normal(scale=1, size=t.shape)

Now we creat a spectrum object and feed it with the data. This object will store the dataset, later the spectra and errors, all freely chosen variables and contains the methods for calculating the spectra, plotting and storing.

spec = snp.Spectrum(data=y, delta_t=1/fs, f_unit=f_unit)
T_window = 0.02 # in ms
f_max = 5e3 # in kHz
f, s, serr = spec.calc_spec(order_in=[2], T_window=T_window, f_max=f_max, backend='cpu')
Actual T_window: 0.02
Maximum frequency: 5000.0
Number of points: 101

data in first window

The output will show you the actual length of a window (in case your T_window is not a multiple of 1/fs), the maximum frequency (Nyquist frequency) and the number of point of the calculated spectrum. The data points in the first window are plotted, so you can verify the window length (which is also given in points by chunk shape). The function will return f the frequencies at which the spectrum has been calculated, s the spectral values, and serr the error of the spectra value (1 sigma).

Visualization of the results is just as easy as the calculation.

fig = spec.plot(order_in=[2], f_max=f_max/2)

power spectrum of the data

Besides the power spectrum (blue line) the error bands (1 to 5 sigma) are shown as grey lines in the plot. Now, we can even verify that we are dealing with true Gaussian noise by calculating the higher-order spectra of the time series.

Why higher-order spectra?

Higher-order spectra contain additional information that is not contained within a power spectrum. The toolbox is capable of calculating the third- and four-order spectrum (also called bi- and trispectrum, respectively). These have the following properties:

  • Spectra beyond second order are not sensitive to Gaussian noise.
  • Bispectrum: shows contributions whenever the phase of two frequencies and their sum are phase correlated (e.g. by mixing two signals)
  • Trispectrum: can be interpreted as intensity correlation between two frequencies

Let's calculate all spectra up to fourth order of the dataset above and verify that the signal does not deviate significantly from Gaussian noise using the first property (has no significant higher-order contributions). We only have to change the order_in argument:

f, s, serr = spec.calc_spec(order_in=[2, 3, 4], T_window=T_window, f_max=f_max, backend='cpu')

Plotting can also be done as before by changing the order_in argument:

fig = spec.plot(order_in=[2, 3, 4], f_max=f_max/2, green_alpha=0)

polyspectra of the data

Now, the third-and fourth-order spectra (S3 and S4) are visible. Just like the power spectrum they are noisy. To decide which of the fluctuations are significant we need a way of displaying errors in the two-dimensional plots. Here, errors are visualized be overlaying a green color on the spectral contributions which deviate from zero less than a certain number of standard deviations.

fig = spec.plot(order_in=[2, 3, 4], f_max=f_max/2, sigma=3)

polyspectra of the data

Clearly, all higher-order contributions are nothing but noise and we have, therefore, verifed that our original dataset was Gaussian noise (and even white noise due to the flat power spectrum).

Support

The development of the SignalSnap package is supported by the working group Spectroscopy of Condensed Matter of the Faculty of Physics and Astronomy at the Ruhr University Bochum.

Dependencies

For the package multiple libraries are used for the numerics and displaying the results:

  • NumPy
  • SciPy
  • MatPlotLib
  • tqdm
  • Numba
  • h5py
  • ArrayFire

Literature

Unbiased estimators are used for the calculation of higher-order cumulants. Their derivation can be found in this paper. An explanation for how the spectra are calculated can be found in Appendix B of this paper.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

signalsnap-0.1.3.tar.gz (26.5 MB view hashes)

Uploaded Source

Built Distribution

signalsnap-0.1.3-py3-none-any.whl (22.2 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page