Main Content

Feature Extraction

Mel spectrogram, MFCC, pitch, spectral descriptors

Extract features from audio signals for use as input to machine learning or deep learning systems. Use individual functions, such as melSpectrogram, mfcc, pitch, and spectralCentroid, or use the audioFeatureExtractor object to create a feature extraction pipeline that minimizes redundant calculations. In live scripts, use Extract Audio Features to graphically select the features to extract.

Objects

audioFeatureExtractorStreamline audio feature extraction
cepstralFeatureExtractorExtract cepstral features from audio segment
ivectorSystemCreate i-vector system

Live Editor Tasks

Extract Audio FeaturesStreamline audio feature extraction in the Live Editor

Functions

expand all

audioDeltaCompute delta features
designAuditoryFilterBankDesign auditory filter bank
melSpectrogramMel spectrogram
audioDeltaCompute delta features
cepstralCoefficientsExtract cepstral coefficients
gtccExtract gammatone cepstral coefficients, log-energy, delta, and delta-delta
mfccExtract MFCC, log energy, delta, and delta-delta of audio signal
openl3EmbeddingsExtract OpenL3 feature embeddings
vggishEmbeddingsExtract VGGish feature embeddings
audioDeltaCompute delta features
harmonicRatioHarmonic ratio
pitchEstimate fundamental frequency of audio signal
pitchnnEstimate pitch with deep learning neural network
audioDeltaCompute delta features
spectralCentroidSpectral centroid for audio signals and auditory spectrograms
spectralCrestSpectral crest for audio signals and auditory spectrograms
spectralDecreaseSpectral decrease for audio signals and auditory spectrograms
spectralEntropySpectral entropy for audio signals and auditory spectrograms
spectralFlatnessSpectral flatness for audio signals and auditory spectrograms
spectralFluxSpectral flux for audio signals and auditory spectrograms
spectralKurtosisSpectral kurtosis for audio signals and auditory spectrograms
spectralRolloffPointSpectral rolloff point for audio signals and auditory spectrograms
spectralSkewnessSpectral skewness for audio signals and auditory spectrograms
spectralSlopeSpectral slope for audio signals and auditory spectrograms
spectralSpreadSpectral spread for audio signals and auditory spectrograms
erb2hzConvert from equivalent rectangular bandwidth (ERB) scale to hertz
bark2hzConvert from Bark scale to hertz
mel2hzConvert from mel scale to hertz
hz2erbConvert from hertz to equivalent rectangular bandwidth (ERB) scale
hz2barkConvert from hertz to Bark scale
hz2melConvert from hertz to mel scale
phon2soneConvert from phon to sone
sone2phonConvert from sone to phon

Blocks

Auditory SpectrogramExtract mel, Bark, or ERB spectrogram from audio
Cepstral Feature ExtractorExtract cepstral features from audio segment
Design Auditory Filter BankDesign frequency-domain auditory filter bank
Design Mel Filter BankDesign frequency-domain mel filter bank
Mel SpectrogramExtract mel spectrogram from audio

Topics