dcase_util.processors.OpenL3ExtractorProcessor

class dcase_util.processors.OpenL3ExtractorProcessor(fs=44100, hop_length_samples=None, hop_length_seconds=0.02, model=None, input_repr='mel256', content_type='music', embedding_size=6144, center=True, batch_size=32, verbose=False, **kwargs)[source]

Constructor

Parameters

fsint: Sampling rate of the incoming signal.
hop_length_samplesint: Hop length in samples. Default value None
hop_length_secondsfloat: Hop length in seconds. Default value 0.02
modelkeras.models.Model or None: Loaded model object. If a model is provided, then input_repr, content_type, and embedding_size will be ignored. If None is provided, the model will be loaded using the provided values of input_repr, content_type and embedding_size. Default value None
input_repr“linear”, “mel128”, or “mel256”: Spectrogram representation used for model. Ignored if model is a valid Keras model. Default value “mel256”
content_type“music” or “env”: Type of content used to train the embedding model. Ignored if model is a valid Keras model. Default value “music”
embedding_size6144 or 512: Embedding dimensionality. Ignored if model is a valid Keras model. Default value 6144
centerbool: If True, pads beginning of signal so timestamps correspond to center of window. Default value True
batch_sizeint: Batch size used for input to embedding model Default value 32
verbosebool: If True, prints verbose messages. Default value False

__init__(fs=44100, hop_length_samples=None, hop_length_seconds=0.02, model=None, input_repr='mel256', content_type='music', embedding_size=6144, center=True, batch_size=32, verbose=False, **kwargs)[source]

Constructor

Parameters

fsint: Sampling rate of the incoming signal.
hop_length_samplesint: Hop length in samples. Default value None
hop_length_secondsfloat: Hop length in seconds. Default value 0.02
modelkeras.models.Model or None: Loaded model object. If a model is provided, then input_repr, content_type, and embedding_size will be ignored. If None is provided, the model will be loaded using the provided values of input_repr, content_type and embedding_size. Default value None
input_repr“linear”, “mel128”, or “mel256”: Spectrogram representation used for model. Ignored if model is a valid Keras model. Default value “mel256”
content_type“music” or “env”: Type of content used to train the embedding model. Ignored if model is a valid Keras model. Default value “music”
embedding_size6144 or 512: Embedding dimensionality. Ignored if model is a valid Keras model. Default value 6144
centerbool: If True, pads beginning of signal so timestamps correspond to center of window. Default value True
batch_sizeint: Batch size used for input to embedding model Default value 32
verbosebool: If True, prints verbose messages. Default value False

Methods

`__init__`([fs, hop_length_samples, ...])	Constructor
`delimiter`([exclude_delimiters])	Use csv.sniffer to guess delimiter for CSV file
`detect_file_format`([filename])	Detect file format from extension
`empty`()	Check if file is empty
`exists`()	Checks that file exists
`extract`(y)	Extract features for the audio signal.
`get_file_information`()	Get file information, filename
`get_processing_chain_item`()	Get processing chain item with current Processor data.
`is_package`([filename])	Determine if the file is compressed package.
`load`([filename])	Load file
`log`([level])	Log container content
`process`([data, store_processing_chain])	Extract features
`save`([filename])	Save file
`show`([mode, indent, visualize])	Print container content
`to_html`([indent])	Get container information in a HTML formatted string
`to_string`([ui, indent])	Get container information in a string
`validate_format`()	Validate file format

Attributes

`bytes`	File size in bytes
`description`	Extractor description
`input_type`	Input data type
`label`	Extractor label
`logger`	Logger instance
`md5`	Checksum for file.
`output_type`	Output data type
`valid_formats`	Valid file formats