Utilities for Detection and Classification of Acoustic Scenes

This document describes the collection of utilities created for Detection and Classification of Acoustic Scenes and Events (DCASE). These utilities were originally created for the DCASE challenge baseline systems (2016 & 2017) and are bundled into a standalone library to allow their re-usage in other research projects.

The main goal of the utilities is to streamline the research code, make it more readable, and easier to maintain. Most of the implemented utilities are related to audio datasets: handling meta data and various forms of other structured data, and provide standardized usage API to audio datasets from various sources.

Initial version written by Toni Heittola from Audio Research Group, Tampere University, you can contact him via personal website or github.

Getting started

Easiest to get started with the library is to use pip to install the latest stable release:

pip install dcase_util

Alternative installation methods can be found in the installation instructions.

See tutorials how to use dcase_util.


If you have questions about how to use dcase_util, please consult the discussion forum. For bug reports and other, more technical issues, consult the github issues.



Indices and tables