Release notes
v0.2.20
Updates
Add MKV as valid file extension in
AudioContainerAdd
overlaymethod inAudioContainerAdd balancing_mode parameter to
validation_files_balancedmethod inSoundEventDatasetAdd
datasetfield toMetaDataItemand adddatasetfield related properties toMetaDataContainerAdd identifier count reporting for tags in
MetaDataContainerAdd
map_tagsmethod inMetaDataContainerAdd
TAUUrbanAcousticScenes_2022_Mobile_EvaluationSetdatasetUpdate
unique_source_labelsmethod inMetaDataContainerto be more efficientUpdate
model_summary_stringfunction to work with latest Keras versions
Bug fixes
Fix field override in
non_hashable_fieldsmethod inAppParameterContainerFix
process_meta_containerinAcousticSceneDatasetto retain container filenameFix
is_jupyterfunction to work when IPython is not installedFix
plotinAudioContainermethod to use figsize parameter for dual plots
v0.2.19
Updates
Add
TAUUrbanAcousticScenes_2022_Mobile_DevelopmentSetAdd
active_scenesandactive_eventsparameter for Datasets class to select scene and event classes.Add event activity and inactivity calculation for
MetaDataContainerAdd
TorchOpenL3ExtractorandTorchOpenL3ExtractorProcessorAdd
float5andfloat6value types toformatted_valuemethod inFancyStringifier.Add valid content type ‘all’ for
RemoteFile
Bug fixes
Fix
unique_filesinMetaDataContainerto be more efficient with large number of files.Fix
check_metadatainDatasetto be more efficient with large number of files.Fix field ‘event_label’ processing to be robust for non-string values in
MetaDataItemFix data loading to be robust for empty files in
DictContainerFix ‘float1_percentage+ci’, ‘float2_percentage+ci’, ‘float3_percentage+ci’, and ‘float4_percentage+ci’ value types in
FancyStringifierto have fallback data types if values do not have full data.
v0.2.18
Updates
Add
TAUUrbanAudioVisualScenes_2021_EvaluationSetclassTAUUrbanAcousticScenes_2021_Mobile_EvaluationSet
Bug fixes
Fix
normalizemethod inAudioContainerto work with multi and single channel audio.Fix
packmethod inDatasetPackercorrectly identify changed files and trigger package regenerationFix
TAUUrbanAcousticScenes_2021_Mobile_EvaluationSetto correctly extract zip-packages
v0.2.17
Updates
Add TensorFlow keras utilities (tf.keras)
Add
get_media_durationandmerge_media_filesfunctionsAdd
filename_audioandfilename_videoproperties toMetaDataItemclassAdd
TAUUrbanAudioVisualScenes_2021_DevelopmentSetclassUpdate travis tests: Python 2.7 tests are dropped, only Python 3.X tests are used
Bug fixes
Fix base_path removal in
DatasetPacker
v0.2.16
Updates
Add scene_labels for
TAUUrbanAcousticScenes_2020_Mobile_EvaluationSetandTAUUrbanAcousticScenes_2020_3Class_EvaluationSet.Update
SubmissionChecker
v0.2.15
Bug fixes
Fix all_data fold handling in
TAUUrbanAcousticScenes_2020_Mobile_EvaluationSetandTAUUrbanAcousticScenes_2020_3Class_EvaluationSetto be uniform.
v0.2.14
Updates
Update PIP package
v0.2.13
New features
TAUUrbanAcousticScenes_2020_Mobile_EvaluationSetandTAUUrbanAcousticScenes_2020_3Class_EvaluationSetdatasets
v0.2.12
Updates
Update
TAUUrbanAcousticScenes_2020_Mobile_DevelopmentSetto use dataset version 2.0
v0.2.11
New features
Add
OpenL3Extractor,EdgeL3Extractor, andEmbeddingExtractorfeature extractor classesAdd
OpenL3ExtractorProcessorandEdgeL3ExtractorProcessorprocessorsAdd
TAUUrbanAcousticScenes_2020_Mobile_DevelopmentSetandTAUUrbanAcousticScenes_2020_3Class_DevelopmentSetdatasetsAdd
get_audio_infofunction to allow fetching audio file information without reading full file.Add MP3 audio example file
Updates
Update
AudioContainerconstructor to allow initialization with multi-channel audio data in form of list of audio data vectors.Update
loadmethod inAudioContainerto have parameterauto_trimmingto automatically trim stop parameter to audio file lengthUpdate
loadmethod inAudioContainerto check start and stop parameters against actual audio file durationUpdate
AudioContainerto storechannel_labels, updateplot_waveandplot_specmethods withchannel_labelsas well.Update
plot_wavemethod inAudioContainerto supportmax_srparameter and different color per channel.Update
plotmethod inAudioContainerto plot both waveform and spectrogram at the same time (dualplotting mode)Update
segmentsmethod inAudioContainerto supportactive_segmentsUpdate
FancyLogger,FancyHTMLPrinter, andFancyStringifierto accumulate row value when usingrowmethod, and addrow_sumandrow_averagemethodsUpdate
setup_kerasto suppress TensorFlow warningsUpdate
debug_packagesmethod ofDatasetclass to show more information about local filesUpdate
FileMixinto allow overridingvalid_formatthrough constructor parameterUpdate code to support Librosa 0.7.0
Bug fixes
Fix
MetaDataContainersorting to work with numeric filenamesFix
get_byte_stringto work with small valuesFix filename handling in FeatureRepository when dict of filenames is used
Fix
collapse_probabilities_windowedmethod inProbabilityEncoderto accept arrays of probabilitiesFix example system
sed_gmm.pyto work with current version
v0.2.10
Bug fixes
Fix cross-validation data loading for datasets without reference meta data in
load_crossvalidation_datamethod ofDatasetclass
v0.2.9
New features
Add
TAUUrbanAcousticScenes_2019_EvaluationSet,TAUUrbanAcousticScenes_2019_Mobile_EvaluationSet, andTAUUrbanAcousticScenes_2019_Openset_EvaluationSetdatasets.
v0.2.8
New features
Add
TAUUrbanAcousticScenes_2019_LeaderboardSet,TAUUrbanAcousticScenes_2019_Mobile_LeaderboardSet, andTAUUrbanAcousticScenes_2019_Openset_LeaderboardSetdatasets.Add
is_jupyterfunction to detect if code is running inside jupyterAdd
shortenmethod inPathto shorten long paths for visualization purposeAdd
FancyHTMLStringifier,FancyHTMLPrinterclasses for HTML outputAdd
plotmethod inDataArrayContainerAdd
plotmethod inNormalizer
Updates
Update YAML serialization to use
yaml.FullLoaderformatted_valuemethod inFancyStringifierto be staticRefactor printing methods in containers to allow automatic output mode switching between HTML (Jypyter) and string (console)
Update data printing mechanism for containers
Update
plotmethods API to includefigsizeparameterUpdate default parameters in
plotmethod inAudioContainer(color bar is hidden by default)Update error messages in
AudioContainerto be more informativeUpdate
loadmethod inMetaDataContainerto support additional row formatsUpdate
feature_extractor_listmethod to have option to return string or display (print to console or print as HTML output in Jupyter)Update
dataset_listto use table layout and add option to return string or display (print to console or print as HTML output in Jupyter)Update
to_stringmethod inMetaDataContainerwith optionshow_infoto control what data is printUpdate API for methods
showandloginDatasetto includeshow_metaparameter andmodeparameter to control output formatUpdate printing
validation_files_balancedmethod inAcousticSceneDatasetto support different output modes (print to console or print as HTML output in Jupyter)Update
ProgressLoggerCallbackto includeshow_timingparameter andnotebookoutput typeUpdate
StasherCallbackwithto_stringandshowUpdate printing inside
setup_kerasfunctionUpdate
model_summary_stringfunction with new parameters (show_parametersanddisplay)Update
plotmethod inDataMatrix2DContainerwithxlabelandylabelparametersUpdate
plotmethod inBinaryMatrix2DContainerwithpanel_title_positionparametersUpdate usage of
tqdmlibrary inDatasetto allow locally progress bar disable/enable
Bug fixes
Fix single channel audio plotting in
AudioContainer
v0.2.7
Updates
Update
TAUUrbanAcousticScenes_2019_Mobile_DevelopmentSet, andTAUUrbanAcousticScenes_2019_Openset_DevelopmentSetdatasets.
v0.2.6
New features
Add
TAUUrbanAcousticScenes_2019_DevelopmentSet,TAUUrbanAcousticScenes_2019_Mobile_DevelopmentSet, andTAUUrbanAcousticScenes_2019_Openset_DevelopmentSetdatasets.Add
OneHotEncoderandOneHotEncodingProcessorto allow unknown labels.Add automatic meta data check ups in datasets classes, and parameter to control it.
Add
AudioSequencingProcessorAdd
feature_extractor_listto show all available feature extractors classes, and add description to all feature extraction classes.
Updates
Update
debug_packagesmethod to allow better control which part of package_list is checked: remote or local.Update
data_collectorto have generic data axis handling.Update
loadmethod inListDictContainerto skip empty rows in CSV files.Update
savemethod inListDictContainerfor TXT and CSV to avoid extra empty lines under Windows.Update
savemethod inMetaDataContainerfor TXT and CSV to avoid extra empty lines under Windows.Update
relative_to_absolute_pathandabsolute_to_relative_pathto give more informative error messages.Update
EventRollEncodingProcessorto supportpad_lengthparameter.Update unit tests to be cross-platform compatible (Linux / Windows)
Update
SuppressStdoutAndStderrto be more robustUpdate
MetaDataItemto keep filename field to be posix path when relative path is used.Update dtypes to be compatible with numpy v1.14
Update
setup_kerasto warn when GPU was not found.Update
model_summary_stringto show activation function of the output layer.Update all processors, encoders, and manipulators have __call__ magic class method.
Bug fixes
Fix delimiter detection in
loadmethod inMetaDataContainerFix
MetaDataItemto better handle empty fields (onset, offset, and event_label).Fix how
validation_splitandvalidation_files_datasetmethod usestraining_metaparameter.
v0.2.5
New features
Add
SoundDatasetbase class.Add
feature_extractor_factoryto get feature extractor class based on feature label.Add
OneHotLabelEncoderlabel based encoder.Add
OneHotLabelEncodingProcessorclass.Add
DBR_Datasetclass.Add
map_eventsmethod toMetaDataContainerto map multiple event labels into single target event label.Add
event_inactivitymethod toMetaDataContainerto get inactivity segments between events.Add
__version__variable to the module.Add
check_installationfunction to check module installation.Add
TUTAcousticScenes_2017_FeaturesSetdataset class.Add
check_metadatamethod to dataset classes to double check meta and cross-validation setups automatically during the dataset initialization.
Updates
Update
ProcessingChainto verify that all items in the chain are instances ofProcessorclass.Update
ProbabilityItemto have index property.Update
ProbabilityContainerto support pickle saving and loading.Update
ProbabilityContainerto haveas_matrixmethod.Update
majority_votemethod inDecisionEncoderto be more generic (works with both labels and class IDs).Move processor classes related to encoding into separate file.
Update
loadmethod inMetaDataContainerto translate between decimal comma and point.Update
data_collectorfunction to be more generic.Update
formatted_valuemethod inFancyStringifierto support fixed length strings (stf).Refactor
SubmissionCheckerto be more flexible.Update
DCASEAppParameterContainerto support secondary data processing chain.Update
create_sequential_modelfunction to return optionally functional API Keras model instead of default Keras sequential model.Update
ProgressLoggerCallbackto print estimate of the remaining model learning time.
Bug fixes
Fix dataset class when no
remote_fileis set
v0.2.4
New features
Add
TUTUrbanAcousticScenes_2018_EvaluationSetandTUTUrbanAcousticScenes_2018_Mobile_EvaluationSetdataset classes.Add
DCASE2018_Task5_EvaluationSetdataset class.
Updates
Update
formatted_valuemethod inFancyStringifierto have full coverage of float formats (float precision from 1 to 4).
Bug fixes
Fix
TUTRareSoundEvents_2017_EvaluationSetdataset class to have correct audio path.
v0.2.3
New features
Add
AudioWritingProcessorandMonoAudioWritingProcessorprocessor classes.Add
FeatureWritingProcessorandRepositoryFeatureWritingProcessorprocessor classes.
Bug fixes
Fix
DataRepositorynot to have internal variables in the__dict__after loading container from disk.
v0.2.2
In this version external dependencies of this module are minimized. External modules required for non-core functionality is not anymore included in the setup.py, and not automatically installed. Once user uses functionality requiring these rarely used external modules and module is not found, ImportError is raised with instructions to install correct module through pip. All module requirements are still available in requirements.txt.
New features
Add
unique_source_labelsproperty toMetaDataContainer.Add
file_formatparameter to load and save method forListContainerandDictContainerto force specific file format.Add
label_listparameter toManyHotEncodingProcessor.Add
DatasetPackerclass to make DCASE styled dataset packages.Add
dataset_existshelper function to check Dataset classes.Add multi-channel audio example
audio_container_ch4.Add
TUTUrbanAcousticScenes_2018_LeaderboardSetandTUTUrbanAcousticScenes_2018_Mobile_LeaderboardSetdataset classes.
Updates
Update
Datasetclass handle also non-text file meta files by introducing parameterevaluation_setup_file_extension.Update package list handling in
Datasetto support custom package extraction parameters by extra parameterpackage_extract_parameters.Update
padmethod inAudioContainerto work with multi-channel audio.Update
compressmethod to produce split packages only if size limit is met.Update
compressmethod to return package filenames.Update
DCASE2018_Task5_DevelopmentSetdataset.
v0.2.1
New features
Add
md5andbytesproperties to FileMixin.Add two level hierarchical balancing to
validation_files_balancedmethod inAcousticSceneDataset.Add
TUTUrbanAcousticScenes_2018_DevelopmentSetandTUTUrbanAcousticScenes_2018_Mobile_DevelopmentSetdatasets.Add
float1_ci,float2_ci,float1_ci_bracket,float2_ci_bracket,float1_percentage+ciandfloat2_percentage+civalue types toformatted_valuemethod inFancyStringifier.Add
get_setmethod toAppParameterContainer.Add
data_collectorfunction to collect data and meta.
Updates
Update
debug_packagesmethod inDatasetto provide more information.Update validation subset generation methods (
validation_split,validation_files_dataset,validation_files_random, andvalidation_files_balanced) method inDataset,AcousticSceneDataset,SoundEventDataset, andAudioTaggingDatasetto allow external processing of meta data before processing throughtraining_metaparameter.Update
filtermethod inListDictContainerto allow filtering based on list of values.Update
set_labelproperty toMetaDataItem.Update
filtermethod inMetaDataContainerto usefiltermethod from parent class.Update example applications to use current API.
Update random seed setting for TensorFlow in
setup_kerasfunction.Update
dataset_factoryto handle dataset classes defined outside dcase_util.
Bug fixes
Fix
load_from_youtubemethod inAudioContainer.Fix example applications to work on Windows (Python 3.6).
v0.2.0
New features
Add
row_resetandrow_sephelper methods toFancyStringifier,FancyLogger, andFancyPrinterclasses.
Updates
Update
downloadmethod inRemoteFileto be more robust when encounter SSL problems.Update
AppParameterContainerto handleFEATURE_PROCESSING_CHAIN,DATA_PROCESSING_CHAIN.Update
filtermethod inMetaDataContainerto acceptsource_labelandsource_label_listparameters.Update
DCASE2018_Task5_DevelopmentSet.
Bug fixes
Fix
construct_pathmethod inApplicationPathsto work in Windows as well.Fix path creation in
AppParameterContainer.
v0.1.9
New features
Add new processors
FeatureReadingProcessor,DataShapingProcessor,RepositoryAggregationProcessor,RepositorySequencingProcessor, andRepositoryToMatrixProcessor.Add extract method to
SpectralFeatureExtractor.Add automatic conversion of numeric fields when loading CSV data to
ListDictContainer.Add filter and get_field_unique methods to
ListDictContainer.Add MP4 to valid audio formats for
AudioContainer.Add general path modification method (
Path.modify).Add Keras profile
cuda0_fast.Add Keras utility to create optimizer instance (create_optimizer).
Add
DCASE2018_Task5_DevelopmentSetandDCASE2013_Scenes_EvaluationSetdatasets.Add
DataMatrix4DContainer.Add
plot` method to ``DataMatrix3DContainer.Add support for a new annotation format for tags [filename][tab][tags] in
MetaDataContainer.Add zero padding to
Sequencer.Add header field override in load method of
MetaDataContainer.Add support for new compressed audio formats (OGG, MP3) in
AudioContainer.Add
segmentsmethod inAudioContainerto split signal into non-overlapping segments with optionally skipped regions.Add
padmethod inAudioContainerto pad signal into given length.Add
compressmethod inPackageMixin.Add
Packageclass to handle local compressed file packages.Add
change_axismethod toDataMatrix2DContainer,DataMatrix3DContainer, andDataMatrix4DContainer.Add
KerasDataSequenceclass for data generation through processing chain.Add support for data and meta processing chains to
DCASEAppParameterContainer.Add
many_hotmethod inDecisionEncoder.
Updates
Update
TUTRareSoundEvents_2017_DevelopmentSetandTUTRareSoundEvents_2017_EvaluationSetdatasets.Update Keras utility
model_summary_stringto use by default standard method from Keras.Update
FeatureRepositoryAPI to be aligned with Container classes.Update
Sequencer,SequencingProcessor, andRepositorySequencingProcessorAPI.Update
AppParameterContainerto allow change of active set even afterprocessmethod has been called.Update mechanism to store meta information about chain item when data is processed using processing chain.
Bug fixes
Fix
savemethod inMetaDataContainerwhen saving with tags in CSV format.Fix many methods to give more appropriate error messages.
API changes and compatibility
Sequencer,SequencingProcessor, andRepositorySequencingProcessorAPI changes:frameschanged tosequence_lengthhop_length_framestohop_lengthpaddingparameter accepts now strings (zeroandrepeat)
v0.1.8
New features
Add new formats for
MetaDataContainer(cpickle, CSV).Add forced file formats when reading and saving containers.
Add Keras setup function.
Add frame splitting method into
AudioContainer.
Bug fixes
Fix unicode string support when printing container information.
Fix data contamination through data references while manipulating data.
Some minor bug fixes.
v0.1.7
New features
Add intersection method for
MetaDataContainer.
Updates
Update dataset class API (e.g. copy returned metadata prevent accidental manipulation, uniform method names).
Bug fixes
Fix data sequencing when overlapping sequencing is used.
Fix datasets
CHiMEHome_DomesticAudioTag_DevelopmentSet,TUTAcousticScenes_2017_EvaluationSet, andTUTSoundEvents_2017_EvaluationSet.
v0.1.6
New features
Add
CHiMEHome_DomesticAudioTag_EvaluationSetdataset.
Updates
Update example audio to be 16-bit audio file in wav-format instead of FLAC used earlier.
Update
ProbabilityContainerAPI to be more compatible withMetaDataContainer.Update
MetaDataItemto be compatible with field naming used previously in DCASE baseline systems.Update ui utilities.
Bug fixes
Fix audio reading when target sampling rate is not set.
Some minor bug fixes.
v0.1.5
Fixing PYPI package.
v0.1.4
Release first PYPI package.
v0.1.0
Initial public release.