Datasets
Classes for dataset handling
Dataset
dcase_util.datasets.Dataset
This is the base class, and all the specialized datasets are inherited from it. One should never use base class itself.
Usage examples:
1# Create class
2dataset = TUTAcousticScenes_2017_DevelopmentSet(data_path='data')
3# Initialize dataset, this will make sure dataset is downloaded, packages are extracted,
4# and needed meta files are created
5dataset.initialize()
6# Show meta data
7dataset.meta.show()
8# Get all evaluation setup folds
9folds = dataset.folds()
10# Get all evaluation setup folds
11train_data_fold1 = dataset.train(fold=folds[0])
12test_data_fold1 = dataset.test(fold=folds[0])
|
Dataset base class |
Initialize the dataset, download, extract files and prepare the dataset for the usage. |
|
|
Download dataset packages over the internet to the local path |
|
Extract the dataset packages |
|
Debug remote packages associated to the dataset. |
Prepare dataset for the usage. |
|
|
Process single meta data item |
Generates hash from file list and check does it matches with one saved in filelist.hash. |
|
|
Show dataset information. |
|
Log dataset information. |
Load dataset meta data and cross-validation sets into the container. |
|
Load meta data into the container. |
|
Load cross-validation into the container. |
|
Get all audio files in the dataset |
|
Get number of audio files in dataset |
|
Get meta data for dataset. |
|
Number of meta data items. |
|
Get audio error meta data for dataset. |
|
Number of error meta data items. |
|
|
List of fold ids |
Number of fold in the evaluation setup. |
|
Evaluation setup filename generation. |
|
|
List of training items. |
|
List of testing items. |
|
List of evaluation items. |
|
List of training files. |
|
List of testing files. |
|
List of evaluation files. |
|
List of validation files. |
|
List of validation files delivered by the dataset. |
|
List of validation files selected randomly from the training material. |
|
List of validation files randomly selecting while maintaining data balance. |
List of unique scene labels in the meta data. |
|
Number of unique scene labels in the meta data. |
|
|
List of unique event labels in the meta data. |
|
Number of unique event labels in the meta data. |
List of unique audio tags in the meta data. |
|
Number of unique audio tags in the meta data. |
|
|
Meta data for given file |
|
Error meta data for given file |
|
Pre-calculated acoustic features for given file |
Converts relative path into absolute path. |
|
Converts absolute path into relative path. |
|
Total download size of the dataset in bytes. |
|
Total download size of the dataset in a string. |
|
Total size of the dataset currently stored locally. |
AcousticSceneDataset
dcase_util.datasets.AcousticSceneDataset
|
Acoustic scene dataset baseclass |
Specialized classes inherited AcousticSceneDataset:
TUT Acoustic scenes 2016 development dataset |
|
TUT Acoustic scenes 2016 evaluation dataset |
|
TUT Acoustic scenes 2017 development dataset |
|
TUT Acoustic scenes 2017 evaluation dataset |
|
TUT Acoustic scenes 2017 features dataset |
|
TUT Urban Acoustic Scenes 2018 Development dataset |
|
TUT Urban Acoustic Scenes 2018 Leaderboard dataset |
|
TUT Urban Acoustic Scenes 2018 Evaluation dataset |
|
TUT Urban Acoustic Scenes 2018 Mobile Development dataset |
|
TUT Urban Acoustic Scenes 2018 Mobile Leaderboard dataset |
|
TUT Urban Acoustic Scenes 2018 Mobile Evaluation dataset |
|
TAU Urban Acoustic Scenes 2019 Development dataset |
|
TAU Urban Acoustic Scenes 2019 Leaderboard dataset |
|
TAU Urban Acoustic Scenes 2019 Evaluation dataset |
|
TAU Urban Acoustic Scenes 2019 Mobile Development dataset |
|
TAU Urban Acoustic Scenes 2019 Mobile Leaderboard dataset |
|
TAU Urban Acoustic Scenes 2019 Mobile Evaluation dataset |
|
TAU Urban Acoustic Scenes 2019 Open set Development dataset |
|
TAU Urban Acoustic Scenes 2019 Open set Leaderboard dataset |
|
TAU Urban Acoustic Scenes 2019 Open set Evaluation dataset |
|
TAU Urban Acoustic Scenes 2020 Mobile Development dataset |
|
TAU Urban Acoustic Scenes 2020 Mobile Evaluation dataset |
|
TAU Urban Acoustic Scenes 2020 3Class Development dataset |
|
TAU Urban Acoustic Scenes 2020 3Class Evaluation dataset |
|
TAU Urban Audio-Visual Scenes 2021 Development dataset |
|
TAU Urban Audio-Visual Scenes 2021 Evaluation dataset |
|
TAU Urban Acoustic Scenes 2021 Mobile Evaluation dataset |
|
TAU Urban Acoustic Scenes 2022 Mobile Development dataset |
|
TAU Urban Acoustic Scenes 2022 Mobile Evaluation dataset |
|
Task 5, Monitoring of domestic activities based on multi-channel acoustics, development set |
|
Task 5, Monitoring of domestic activities based on multi-channel acoustics, evaluation set |
SoundEventDataset
dcase_util.datasets.SoundEventDataset
|
Sound event dataset baseclass |
Number of unique scene labels in the meta data. |
|
|
List of unique event labels in the meta data. |
|
List of training items. |
|
List of testing items. |
Specialized classes inherited SoundEventDataset:
TUT Acoustic scenes 2017 development dataset |
|
TUT Acoustic scenes 2017 evaluation dataset |
|
TUT Sound events 2017 development dataset |
|
TUT Sound events 2017 evaluation dataset |
|
TUT Sound events 2016 development dataset |
|
TUT Sound events 2016 evaluation dataset |
|
|
TUT SED Synthetic 2016 |
AudioTaggingDataset
dcase_util.datasets.AudioTaggingDataset
|
Audio tag dataset baseclass |
DCASE 2017 Large-scale weakly supervised sound event detection for smart cars |
|
DCASE 2017 Large-scale weakly supervised sound event detection for smart cars |
|
Constructor |
Helpers
dcase_util.datasets.
Helper functions to access Dataset classes.
|
List of datasets available |
|
Factory to get correct dataset class based on name |
|
Check dataset class based on name |