spectralCrest

Spectral crest for audio signals and auditory spectrograms

Syntax

crest = spectralCrest(x,f)

crest = spectralCrest(x,f,Name,Value)

[crest,spectralPeak,spectralMean] = spectralCrest(___)

Description

crest = spectralCrest(x,f) returns the spectral crest of the signal, x, over time. How the function interprets x depends on the shape of f.

example

crest = spectralCrest(x,f,Name,Value) specifies options using one or more Name,Value pair arguments.

[crest,spectralPeak,spectralMean] = spectralCrest(___) returns the spectral peak and spectral mean.

Examples

collapse all

Spectral Crest of Time-Domain Audio

Open Live Script

Read in an audio file, calculate the crest using default parameters, and then plot the results.

[audioIn,fs] = audioread('Counting-16-44p1-mono-15secs.wav');
crest = spectralCrest(audioIn,fs);

t = linspace(0,size(audioIn,1)/fs,size(crest,1));
plot(t,crest)
xlabel('Time (s)')
ylabel('Crest')

Spectral Crest of Frequency-Domain Audio Data

Open Live Script

Read in an audio file and then calculate the mel spectrogram using the melSpectrogram function.

[audioIn,fs] = audioread('Counting-16-44p1-mono-15secs.wav');
[s,cf] = melSpectrogram(audioIn,fs);

Calculate the crest of the mel spectrogram over time. Plot the results.

crest = spectralCrest(s,cf);

t = linspace(0,size(audioIn,1)/fs,size(crest,1));
plot(t,crest)
xlabel('Time (s)')
ylabel('Crest')

Specify Nondefault Parameters

Open Live Script

Read in an audio file.

[audioIn,fs] = audioread('Counting-16-44p1-mono-15secs.wav');

Calculate the crest of the power spectrum over time. Calculate the crest for 50 ms Hamming windows of data with 25 ms overlap. Use the range from 62.5 Hz to fs/2 for the crest calculation. Plot the results.

crest = spectralCrest(audioIn,fs, ...
                      'Window',hamming(round(0.05*fs)), ...
                      'OverlapLength',round(0.025*fs), ...
                      'Range',[62.5,fs/2]);
                        
t = linspace(0,size(audioIn,1)/fs,size(crest,1));
plot(t,crest)
xlabel('Time (s)')
ylabel('Crest')

Calculate Spectral Crest of Streaming Audio

Open Live Script

Create a dsp.AudioFileReader object to read in audio data frame-by-frame. Create a dsp.SignalSink to log the spectral crest calculation.

fileReader = dsp.AudioFileReader('Counting-16-44p1-mono-15secs.wav');
logger = dsp.SignalSink;

In an audio stream loop:

Read in a frame of audio data.
Calculate the spectral crest for the frame of audio.
Log the spectral crest for later plotting.

To calculate the spectral crest for only a given input frame, specify a window with the same number of samples as the input, and set the overlap length to zero.

Plot the logged data.

while ~isDone(fileReader)
    audioIn = fileReader();
    crest = spectralCrest(audioIn,fileReader.SampleRate, ...
                          'Window',hamming(size(audioIn,1)), ...
                          'OverlapLength',0);
    logger(crest)
end

plot(logger.Buffer)
ylabel('Crest')

Use dsp.AsyncBuffer if

The input to your audio stream loop has a variable samples-per-frame.
The input to your audio stream loop has an inconsistent samples-per-frame with the analysis window of spectralCrest.
You want to calculate the spectral crest for overlapped data.

Create a dsp.AsyncBuffer object, reset the logger, and release the file reader.

buff = dsp.AsyncBuffer;
reset(logger)
release(fileReader)

Specify that the spectral crest is calculated for 50 ms frames with a 25 ms overlap.

fs = fileReader.SampleRate;

samplesPerFrame = round(fs*0.05);
samplesOverlap = round(fs*0.025);

samplesPerHop = samplesPerFrame - samplesOverlap;

win = hamming(samplesPerFrame);

while ~isDone(fileReader)
    audioIn = fileReader();
    write(buff,audioIn);
    
    while buff.NumUnreadSamples >= samplesPerHop
        audioBuffered = read(buff,samplesPerFrame,samplesOverlap);
        
        crest = spectralCrest(audioBuffered,fs, ...
                              'Window',win, ...
                              'OverlapLength',0);
        logger(crest)
    end
    
end
release(fileReader)

Plot the logged data.

plot(logger.Buffer)
ylabel('Crest (Hz)')

Input Arguments

collapse all

`x` — Input signal
column vector | matrix | 3-D array

Input signal, specified as a vector, matrix, or 3-D array. How the function interprets x depends on the shape of f.

Data Types: single | double

`f` — Sample rate or frequency vector (Hz)
scalar | vector

Sample rate or frequency vector in Hz, specified as a scalar or vector, respectively. How the function interprets x depends on the shape of f:

If f is a scalar, x is interpreted as a time-domain signal, and f is interpreted as the sample rate. In this case, x must be a real vector or matrix. If x is specified as a matrix, the columns are interpreted as individual channels.
If f is a vector, x is interpreted as a frequency-domain signal, and f is interpreted as the frequencies, in Hz, corresponding to the rows of x. In this case, x must be a real L-by-M-by-N array, where L is the number of spectral values at given frequencies of f, M is the number of individual spectrums, and N is the number of channels.
The number of rows of x, L, must be equal to the number of elements of f.

Data Types: single | double

Name-Value Pair Arguments

Specify optional comma-separated pairs of Name,Value arguments. Name is the argument name and Value is the corresponding value. Name must appear inside quotes. You can specify several name and value pair arguments in any order as Name1,Value1,...,NameN,ValueN.

Example: 'Window',hamming(256)

Note

The following name-value pair arguments apply if x is a time-domain signal. If x is a frequency-domain signal, name-value pair arguments are ignored.

`'Window'` — Window applied in time domain
`rectwin(round(f*0.03))` (default) | vector

Window applied in the time domain, specified as the comma-separated pair consisting of 'Window' and a real vector. The number of elements in the vector must be in the range [1, size(x,1)]. The number of elements in the vector must also be greater than OverlapLength.

Data Types: single | double

`'OverlapLength'` — Number of samples overlapped between adjacent windows
`round(f*0.02)` (default) | non-negative scalar

Number of samples overlapped between adjacent windows, specified as the comma-separated pair consisting of 'OverlapLength' and an integer in the range [0, size(Window,1)).

Data Types: single | double

`'FFTLength'` — Number of bins in DFT
`numel(Window)` (default) | positive scalar integer

Number of bins used to calculate the DFT of windowed input samples, specified as the comma-separated pair consisting of 'FFTLength' and a positive scalar integer. If unspecified, FFTLength defaults to the number of elements in the Window.

Data Types: single | double

`'Range'` — Frequency range (Hz)
`[0,f/2]` (default) | two-element row vector

Frequency range in Hz, specified as the comma-separated pair consisting of 'Range' and a two-element row vector of increasing real values in the range [0, f/2].

Data Types: single | double

`'SpectrumType'` — Spectrum type
`'power'` (default) | `'magnitude'`

Spectrum type, specified as the comma-separated pair consisting of 'SpectrumType' and 'power' or 'magnitude':

'power' –– The spectral crest is calculated for the one-sided power spectrum.
'magnitude' –– The spectral crest is calculated for the one-sided magnitude spectrum.

Data Types: char | string

Output Arguments

collapse all

`crest` — Spectral crest
scalar | vector | matrix

Spectral crest, returned as a scalar, vector, or matrix. Each row of crest corresponds to the spectral crest of a window of x. Each column of crest corresponds to an independent channel.

`spectralPeak` — Spectral peak
scalar | vector | matrix

Spectral peak, returned as a scalar, vector, or matrix. Each row of spectralPeak corresponds to the spectral crest of a window of x. Each column of spectralPeak corresponds to an independent channel.

`spectralMean` — Spectral mean
scalar | vector | matrix

Spectral mean, returned as a scalar, vector, or matrix. Each row of spectralMean corresponds to the spectral crest of a window of x. Each column of spectralMean corresponds to an independent channel.

Algorithms

The spectral crest is calculated as described in [1]:

$crest = \frac{\max (s_{k \in [b_{1}, b_{2}]})}{\frac{1}{b_{2} - b_{1}} \sum_{k = b_{1}}^{b_{2}} s_{k}}$

where

s_k is the spectral value at bin k.
b₁ and b₂ are the band edges, in bins, over which to calculate the spectral crest.

References

[1] Peeters, G. "A Large Set of Audio Features for Sound Description (Similarity and Classification) in the CUIDADO Project." Technical Report; IRCAM: Paris, France, 2004.

Documentation

spectralCrest

Syntax

Description

Examples

Spectral Crest of Time-Domain Audio

Spectral Crest of Frequency-Domain Audio Data

Specify Nondefault Parameters

Calculate Spectral Crest of Streaming Audio

Input Arguments

`x` — Input signal
column vector | matrix | 3-D array

`f` — Sample rate or frequency vector (Hz)
scalar | vector

Name-Value Pair Arguments

`'Window'` — Window applied in time domain
`rectwin(round(f*0.03))` (default) | vector

`'OverlapLength'` — Number of samples overlapped between adjacent windows
`round(f*0.02)` (default) | non-negative scalar

`'FFTLength'` — Number of bins in DFT
`numel(Window)` (default) | positive scalar integer

`'Range'` — Frequency range (Hz)
`[0,f/2]` (default) | two-element row vector

`'SpectrumType'` — Spectrum type
`'power'` (default) | `'magnitude'`

Output Arguments

`crest` — Spectral crest
scalar | vector | matrix

`spectralPeak` — Spectral peak
scalar | vector | matrix

`spectralMean` — Spectral mean
scalar | vector | matrix

Algorithms

References

Extended Capabilities

C/C++ Code Generation
Generate C and C++ code using MATLAB® Coder™.

GPU Arrays
Accelerate code by running on a graphics processing unit (GPU) using Parallel Computing Toolbox™.

See Also

Topics

Audio Toolbox Documentation

Support

Documentation

spectralCrest

Syntax

Description

Examples

Spectral Crest of Time-Domain Audio

Spectral Crest of Frequency-Domain Audio Data

Specify Nondefault Parameters

Calculate Spectral Crest of Streaming Audio

Input Arguments

x — Input signal column vector | matrix | 3-D array

f — Sample rate or frequency vector (Hz) scalar | vector

Name-Value Pair Arguments

'Window' — Window applied in time domain rectwin(round(f*0.03)) (default) | vector

'OverlapLength' — Number of samples overlapped between adjacent windows round(f*0.02) (default) | non-negative scalar

'FFTLength' — Number of bins in DFT numel(Window) (default) | positive scalar integer

'Range' — Frequency range (Hz) [0,f/2] (default) | two-element row vector

'SpectrumType' — Spectrum type 'power' (default) | 'magnitude'

Output Arguments

crest — Spectral crest scalar | vector | matrix

spectralPeak — Spectral peak scalar | vector | matrix

spectralMean — Spectral mean scalar | vector | matrix

Algorithms

References

Extended Capabilities

C/C++ Code Generation Generate C and C++ code using MATLAB® Coder™.

GPU Arrays Accelerate code by running on a graphics processing unit (GPU) using Parallel Computing Toolbox™.

See Also

Topics

Audio Toolbox Documentation

Support

`x` — Input signal
column vector | matrix | 3-D array

`f` — Sample rate or frequency vector (Hz)
scalar | vector

`'Window'` — Window applied in time domain
`rectwin(round(f*0.03))` (default) | vector

`'OverlapLength'` — Number of samples overlapped between adjacent windows
`round(f*0.02)` (default) | non-negative scalar

`'FFTLength'` — Number of bins in DFT
`numel(Window)` (default) | positive scalar integer

`'Range'` — Frequency range (Hz)
`[0,f/2]` (default) | two-element row vector

`'SpectrumType'` — Spectrum type
`'power'` (default) | `'magnitude'`

`crest` — Spectral crest
scalar | vector | matrix

`spectralPeak` — Spectral peak
scalar | vector | matrix

`spectralMean` — Spectral mean
scalar | vector | matrix

C/C++ Code Generation
Generate C and C++ code using MATLAB® Coder™.

GPU Arrays
Accelerate code by running on a graphics processing unit (GPU) using Parallel Computing Toolbox™.