dcase_util.features.MfccStaticExtractor
- class dcase_util.features.MfccStaticExtractor(fs=44100, win_length_samples=None, hop_length_samples=None, win_length_seconds=0.04, hop_length_seconds=0.02, spectrogram_type='magnitude', n_fft=2048, window_type='hamming_asymmetric', n_mels=40, fmin=0, fmax=None, normalize_mel_bands=False, htk=False, n_mfcc=20, omit_zeroth=False, **kwargs)[source]
Feature extractor class to extract static MFCC features
Constructor
- Parameters
- fsint
Sampling rate of the incoming signal. Default value 44100
- win_length_samplesint
Window length in samples. Default value None
- hop_length_samplesint
Hop length in samples. Default value None
- win_length_secondsfloat
Window length in seconds. Default value 0.04
- hop_length_secondsfloat
Hop length in seconds. Default value 0.02
- spectrogram_typestr
Spectrogram type, magnitude or power spectrogram. Default value ‘magnitude’
- n_fftint
Length of the FFT window. Default value 2048
- window_typestr
Window function type. Default value ‘hamming_asymmetric’
- n_melsint
Number of mel bands to generate. Default value 40
- fminint
Lowest frequency in mel bands (in Hz). Default value 0
- fmaxint
Highest frequency in mel bands (in Hz), if None, fmax = fs/2.0 Default value None
- normalize_mel_bandsbool
Normalize mel band to have peak at 1.0 Default value False
- htkbool
Use HTK formula for mel band creation instead of Slaney Default value False
- n_mfccint
Number of MFCC coefficients Default value 20
- omit_zerothbool
Omit 0th coefficient Default value False
- __init__(fs=44100, win_length_samples=None, hop_length_samples=None, win_length_seconds=0.04, hop_length_seconds=0.02, spectrogram_type='magnitude', n_fft=2048, window_type='hamming_asymmetric', n_mels=40, fmin=0, fmax=None, normalize_mel_bands=False, htk=False, n_mfcc=20, omit_zeroth=False, **kwargs)[source]
Constructor
- Parameters
- fsint
Sampling rate of the incoming signal. Default value 44100
- win_length_samplesint
Window length in samples. Default value None
- hop_length_samplesint
Hop length in samples. Default value None
- win_length_secondsfloat
Window length in seconds. Default value 0.04
- hop_length_secondsfloat
Hop length in seconds. Default value 0.02
- spectrogram_typestr
Spectrogram type, magnitude or power spectrogram. Default value ‘magnitude’
- n_fftint
Length of the FFT window. Default value 2048
- window_typestr
Window function type. Default value ‘hamming_asymmetric’
- n_melsint
Number of mel bands to generate. Default value 40
- fminint
Lowest frequency in mel bands (in Hz). Default value 0
- fmaxint
Highest frequency in mel bands (in Hz), if None, fmax = fs/2.0 Default value None
- normalize_mel_bandsbool
Normalize mel band to have peak at 1.0 Default value False
- htkbool
Use HTK formula for mel band creation instead of Slaney Default value False
- n_mfccint
Number of MFCC coefficients Default value 20
- omit_zerothbool
Omit 0th coefficient Default value False
Methods
__init__
([fs, win_length_samples, ...])Constructor
extract
(y)Extract features for the audio signal.
get_spectrogram
(y[, n_fft, ...])Spectrogram
get_window_function
(n[, window_type])Window function
log
([level])Log container content
show
([mode, indent, visualize])Print container content
to_html
([indent])Get container information in a HTML formatted string
to_string
([ui, indent])Get container information in a string
Attributes
description
Extractor description
label
Extractor label
logger
Logger instance