dcase_util.processors.AudioReadingProcessor.process

AudioReadingProcessor.process(data=None, filename=None, focus_start_samples=None, focus_stop_samples=None, focus_duration_samples=None, focus_start_seconds=None, focus_stop_seconds=None, focus_duration_seconds=None, focus_channel=None, store_processing_chain=False, **kwargs)[source]

Audio reading

Parameters
data
filenamestr

Filename

focus_start_samplesint

Sample index of focus segment start

focus_stop_samplesint

Sample index of focus segment stop

focus_duration_samplesint

Sample count of focus segment

focus_start_secondsfloat

Time stamp (in seconds) of focus segment start

focus_stop_secondsfloat

Time stamp (in seconds) of focus segment stop

focus_duration_secondsfloat

Duration (in seconds) of focus segment

focus_channelint or str

Audio channel id or name to focus. In case of stereo signal, valid channel labels to select single channel are ‘L’, ‘R’, ‘left’, and ‘right’ or 0, 1, and to get mixed down version of all channels ‘mixdown’.

store_processing_chainbool

Store processing chain to data container returned Default value False

Returns
AudioContainer