captions/audiocaps |
AudioCaps: Generating Captions for Audios in The Wild |
SNU |
2019 |
captions |
Download
|
|
|
|
captions/clotho_v2 |
Clotho dataset (v2) |
TAU |
2019 |
captions |
Download
|
|
|
|
captions/macs |
MACS: Multi-Annotator Captioned Soundscapes |
TAU |
2021 |
captions |
Download
|
|
|
|
captions/audiocaption |
AudioCaption: Listen and Tell |
SJTU |
2019 |
captions |
Download
|
|
|
|
scenes/uea_noise_db_s2 |
Noise DB / Series 2 |
UEA |
2006 |
scenes |
|
|
|
|
scenes/tut_asc_2018_lb |
TUT Urban Acoustic Scenes 2018, Leaderboard dataset |
TUT |
2018 |
scenes |
Download
|
|
|
|
scenes/tau_asc_2019_eval |
TAU Urban Acoustic Scenes 2019, Evaluation dataset |
TAU |
2019 |
scenes |
Download
|
|
|
|
scenes/tau_asc_2019_openset_eval |
TAU Urban Acoustic Scenes 2019 Openset, Evaluation dataset |
TAU |
2019 |
scenes |
Download
|
|
|
|
scenes/tau_asc_2019_dev |
TAU Urban Acoustic Scenes 2019, Development dataset |
TAU |
2019 |
scenes |
Download
|
|
|
|
scenes/tau_asc_2019_mobile_dev |
TAU Urban Acoustic Scenes 2019 Mobile, Development dataset |
TAU |
2019 |
scenes |
Download
|
|
|
|
scenes/tut_asc_2018_eval |
TUT Urban Acoustic Scenes 2018, Evaluation dataset |
TUT |
2018 |
scenes |
Download
|
|
|
|
scenes/extrasensory |
ExtraSensory Dataset |
UCSD |
2017 |
scenes |
Download
|
|
|
|
scenes/tut_asc_2017_dev |
TUT Acoustic Scenes 2017, Development dataset |
TUT |
2017 |
scenes |
Download
|
|
|
|
scenes/tut_asc_2016_eval |
TUT Acoustic Scenes 2016, Evaluation dataset |
TUT |
2016 |
scenes |
Download
|
|
|
|
scenes/tau_asc_2019_lb |
TAU Urban Acoustic Scenes 2019, Leaderboard dataset |
TAU |
2019 |
scenes |
Download
|
|
|
|
scenes/demand |
Diverse Environments Multichannel Acoustic Noise Database |
Inria |
2013 |
scenes |
Download
|
|
|
|
scenes/tut_asc_2016_dev |
TUT Acoustic Scenes 2016, Development dataset |
TUT |
2016 |
scenes |
Download
|
|
|
|
scenes/uea_noise_db_s1 |
Noise DB / Series 1 |
UEA |
2006 |
scenes |
|
|
|
|
scenes/tut_asc_2017_eval |
TUT Acoustic Scenes 2017, Evaluation dataset |
TUT |
2017 |
scenes |
Download
|
|
|
|
scenes/tut_asc_2018_mobile_eval |
TUT Urban Acoustic Scenes 2018 Mobile, Evaluation dataset |
TUT |
2018 |
scenes |
Download
|
|
|
|
scenes/tut_asc_2018_dev |
TUT Urban Acoustic Scenes 2018, Development dataset |
TUT |
2018 |
scenes |
Download
|
|
|
|
scenes/tau_asc_2020_mobile_eval |
TAU Urban Acoustic Scenes 2020 Mobile, Evaluation dataset |
TAU |
2020 |
scenes |
Download
|
|
|
|
scenes/tau_asc_2021_mobile_eval |
TAU Urban Acoustic Scenes 2021 Mobile, Evaluation dataset |
TAU |
2021 |
scenes |
Download
|
|
|
|
scenes/tau_asc_2019_openset_dev |
TAU Urban Acoustic Scenes 2019 Openset, Development dataset |
TAU |
2019 |
scenes |
Download
|
|
|
|
scenes/dcase2013_public |
IEEE AASP CASA Challenge - Public Dataset for Scene Classification Task |
IEEE AASP Challenge 2013 |
2012 |
scenes |
Download
|
|
|
|
scenes/tut_asc_2018_mobile_dev |
TUT Urban Acoustic Scenes 2018 Mobile, Development dataset |
TUT |
2018 |
scenes |
Download
|
|
|
|
scenes/tut_asc_2018_mobile_lb |
TUT Urban Acoustic Scenes 2018 Mobile, Leaderboard dataset |
TUT |
2018 |
scenes |
Download
|
|
|
|
scenes/tau_asc_2019_mobile_eval |
TAU Urban Acoustic Scenes 2019 Mobile, Evaluation dataset |
TAU |
2019 |
scenes |
Download
|
|
|
|
scenes/whisper |
WSJ0 Hipster Ambient Mixtures noise dataset |
Whisper/MERL |
2019 |
scenes |
Download
|
|
|
|
scenes/cochlscene |
CochlScene: Acquisition of Acoustic Scene Data Using Crowdsourcing |
Cochl |
2022 |
scenes |
Download
|
|
|
|
scenes/dcase2013_private |
IEEE AASP CASA Challenge - Private Dataset for Scene Classification Task |
IEEE AASP Challenge 2013 |
2013 |
scenes |
Download
|
|
|
|
scenes/tau_asc_2019_mobile_lb |
TAU Urban Acoustic Scenes 2019 Mobile, Leaderboard dataset |
TAU |
2019 |
scenes |
Download
|
|
|
|
scenes/tau_avsc_2021_eval |
TAU Urban Audio-Visual Scenes 2021, Evaluation dataset |
TAU |
2021 |
scenes |
Download
|
|
|
|
scenes/tau_asc_2020_3class_eval |
TAU Urban Acoustic Scenes 2020 3Class, Evaluation dataset |
TAU |
2020 |
scenes |
Download
|
|
|
|
scenes/tau_asc_2020_mobile_dev |
TAU Urban Acoustic Scenes 2020 Mobile, Development dataset |
TAU |
2020 |
scenes |
Download
|
|
|
|
scenes/aucodefr2007 |
AucoDefr07 |
AucoDefr07 |
2015 |
scenes |
Download
|
|
|
|
scenes/tau_asc_2020_3class_dev |
TAU Urban Acoustic Scenes 2020 3Class, Development dataset |
TAU |
2020 |
scenes |
Download
|
|
|
|
scenes/tau_avsc_2021_dev |
TAU Urban Audio-Visual Scenes 2021, Development dataset |
TAU |
2021 |
scenes |
Download
|
|
|
|
scenes/tau_asc_2019_openset_lb |
TAU Urban Acoustic Scenes 2019 Openset, Leaderboard dataset |
TAU |
2019 |
scenes |
Download
|
|
|
|
sounds/eth_aed |
ETH Acoustic Event Dataset |
ETH |
2016 |
sounds |
Download
|
|
|
|
sounds/tau_mats |
MATS - Multi-Annotator Tagged Soundscapes |
TAU |
2021 |
sounds |
Download
|
|
|
|
sounds/usm_sed |
USM-SED Dataset - A Dataset for Polyphonic Sound Event Detection in Urban Sound Monitoring Scenarios |
Fraunhofer IDMT |
2021 |
sounds |
Download
|
|
|
|
sounds/urbansas |
Urban Sound & Sight |
NYU |
2022 |
sounds |
Download
|
|
|
|
sounds/audiosetzsl |
AudioSetZSL |
IIT Kanpur |
2020 |
sounds |
Download
|
|
|
|
sounds/desed2019_eval_real |
DESED public evaluation dataset |
None |
2019 |
sounds |
Download
|
|
|
|
sounds/wild_desed |
WildDESED |
Fortemedia |
2024 |
sounds |
Download
|
|
|
|
sounds/mavd |
MAVD-traffic dataset |
MAVD |
2019 |
sounds |
Download
|
|
|
|
sounds/fsl_osr |
An Open-set Recognition and Few-Shot Learning Dataset for Audio Event Classification in Domestic Environments |
Visualfy, Universitat de Valencia |
2021 |
sounds |
Download
|
|
|
|
sounds/wearable_seld_mounting |
Wearable SELD Mounting |
NTT Media Intelligence Laboratories |
2022 |
sounds |
Download
|
|
|
|
sounds/dcase2013_private_live |
IEEE AASP CASA Challenge - Development Dataset for Event Detection Task (subtask OL) |
IEEE AASP Challenge 2013 |
2012 |
sounds |
Download
|
|
|
|
sounds/urbansound |
UrbanSound |
NYU |
2014 |
sounds |
Download
|
|
|
|
sounds/nonspeech7k |
Nonspeech7k |
South China University of Technology |
2023 |
sounds |
Download
|
|
|
|
sounds/tut_rare_sound_events_2017_dev |
TUT Rare sound events 2017, Development dataset |
TUT |
2017 |
sounds |
Download
|
|
|
|
sounds/audioset |
AudioSet |
Google |
2017 |
sounds |
Download
|
|
|
|
sounds/inria_nar |
NAR |
INRIA |
2014 |
sounds |
Download
|
|
|
|
sounds/epic_sounds |
EPIC-SOUNDS |
University of Oxford, University of Bristol |
2023 |
sounds |
Download
|
|
|
|
sounds/spass |
SPASS dataset: A synthetic polyphonic dataset with spatiotemporal labels of sound sources |
SPASS |
2023 |
sounds |
Download
|
|
|
|
sounds/wearable_seld_ear |
Wearable SELD Earphone |
NTT Media Intelligence Laboratories |
2022 |
sounds |
Download
|
|
|
|
sounds/dcase2016_task2_eval |
IEEE DCASE 2016 Challenge - Task 2 - Test Dataset |
IRCCYN |
2016 |
sounds |
Download
|
|
|
|
sounds/idmt_urban_fl |
IDMT-URBAN-FL |
Fraunhofer IDMT |
2021 |
sounds |
Download
|
|
|
|
sounds/tu_dortmund_sed |
Multi-channel acoustic event dataset |
TU Dortmund |
2016 |
sounds |
Download
|
|
|
|
sounds/tau_nigens_spatial_events_2020 |
TAU-NIGENS Spatial Sound Events 2020 |
TAU |
2020 |
sounds |
Download
|
|
|
|
sounds/tut_sound_events_2017_dev |
TUT Sound events 2017, Development dataset |
TUT |
2017 |
sounds |
Download
|
|
|
|
sounds/desed2019_dev_real |
DESED development dataset (real recordings) |
None |
2019 |
sounds |
Download
|
|
|
|
sounds/tut_rare_sound_events_2017_eval |
TUT Rare sound events 2017, Evaluation dataset |
TUT |
2017 |
sounds |
Download
|
|
|
|
sounds/fsdnoisy18k |
FSDnoisy18k |
UPF |
2019 |
sounds |
Download
|
|
|
|
sounds/tau_spatial_events_2019_dev |
TAU Spatial Sound Events 2019 - Ambisonic and Microphone Array, Development Datasets |
TAU |
2019 |
sounds |
Download
|
|
|
|
sounds/tut_synthetic_2016 |
TUT-SED Synthetic 2016 |
TUT |
2016 |
sounds |
|
|
|
|
sounds/idmt_desed_fl |
IDMT-DESED-FL |
Fraunhofer IDMT |
2021 |
sounds |
Download
|
|
|
|
sounds/fsdkaggle2019 |
FSDKaggle2019 |
UPF |
2019 |
sounds |
Download
|
|
|
|
sounds/rwsp |
RWCP Sound Scene Database |
NII-SRC |
2000 |
sounds |
Download
|
|
|
|
sounds/esc_50 |
Dataset for Environmental Sound Classification, 50 classes |
ESC |
2015 |
sounds |
Download
|
|
|
|
sounds/dcase2013_public_synthetic |
IEEE AASP CASA Challenge - Development Dataset for Event Detection Task (subtask OS) |
IEEE AASP Challenge 2013 |
2015 |
sounds |
Download
|
|
|
|
sounds/ave |
AVE: The Audio-Visual Event Dataset |
UR |
2018 |
sounds |
Download
|
|
|
|
sounds/upc_talp |
UPC-TALP database of isolated meeting-room acoustic events |
CHIL |
2008 |
sounds |
Download
|
|
|
|
sounds/sins |
SINS database |
KU Leuven |
2017 |
sounds |
Download
|
|
|
|
sounds/audioset_temporal |
AudioSet with Temporally-Strong Labels |
Google |
2021 |
sounds |
Download
|
|
|
|
sounds/tau_spatial_events_2019_eval |
TAU Spatial Sound Events 2019 - Ambisonic and Microphone Array, Evaluation Datasets |
TAU |
2019 |
sounds |
Download
|
|
|
|
sounds/msos |
Making Sense of Sounds |
CVSSP |
2018 |
sounds |
Download
|
|
|
|
sounds/asped |
Audio Sensing for Pedestrian Detection Dataset |
None |
2024 |
sounds |
Download
|
|
|
|
sounds/iusd |
Isolated urban sound database |
None |
2018 |
sounds |
Download
|
|
|
|
sounds/tut_sound_events_2016_dev |
TUT Sound events 2016, Development dataset |
TUT |
2016 |
sounds |
Download
|
|
|
|
sounds/urbansound8k |
UrbanSound8K |
NYU |
2014 |
sounds |
Download
|
|
|
|
sounds/bsd10k |
BSD10k |
UPF |
2024 |
sounds |
Download
|
|
|
|
sounds/dcase2013_private_synthetic |
IEEE AASP CASA Challenge - Testing Dataset for Event Detection Task (subtask OS) |
IEEE AASP Challenge 2013 |
2015 |
sounds |
Download
|
|
|
|
sounds/ybss200 |
YBSS-200 |
YBSS |
2019 |
sounds |
Download
|
|
|
|
sounds/esc_us |
Dataset for Environmental Sound Classification, unlabeled dataset |
ESC |
2015 |
sounds |
Download
|
|
|
|
sounds/arca23k |
ARCA23K |
CVSSP |
2021 |
sounds |
Download
|
|
|
|
sounds/mivia_aed |
Audio Events Data Set for Surveillance Applications |
MIVIA |
2015 |
sounds |
Download
|
|
|
|
sounds/urbansed |
URBAN-SED |
NYU |
2017 |
sounds |
Download
|
|
|
|
sounds/fbk_irst |
FBK-Irst database of isolated meeting-room acoustic events |
CHIL |
2009 |
sounds |
Download
|
|
|
|
sounds/sonyc |
SONYC Urban Sound Tagging |
NYU |
2020 |
sounds |
Download
|
|
|
|
sounds/afpild |
Acoustic footstep dataset collected using one microphone array and LiDAR sensor for person identification and localization |
None |
2023 |
sounds |
Download
|
|
|
|
sounds/dcase2018_task5_dev |
DCASE 2018, Task 5: Monitoring of domestic activities based on multi-channel acoustics - Development dataset |
KU Leuven |
2018 |
sounds |
Download
|
|
|
|
sounds/dcase2013_public_live |
IEEE AASP CASA Challenge - Development Dataset for Event Detection Task (subtask OL) |
IEEE AASP Challenge 2013 |
2012 |
sounds |
Download
|
|
|
|
sounds/tut_sound_events_2017_eval |
TUT Sound events 2017, Evaluation dataset |
TUT |
2017 |
sounds |
Download
|
|
|
|
sounds/chime_home |
CHiME-Home, development & evaluation dataset |
QMUL |
2015 |
sounds |
Download
|
|
|
|
sounds/desed2019_eval_synthetic |
DESED public evaluation dataset |
None |
2019 |
sounds |
Download
|
|
|
|
sounds/nigens_sound_events |
NIGENS general sound events database |
NIGENS |
2019 |
sounds |
Download
|
|
|
|
sounds/dcase2017_task4_dev |
DCASE2017 task 4 development dataset |
DCASE |
2017 |
sounds |
Download
|
|
|
|
sounds/dcase2016_task2_dev |
IEEE DCASE 2016 Challenge - Task 2 - Train/Development Datasets |
IRCCYN |
2016 |
sounds |
Download
|
|
|
|
sounds/desed2019_dev_synthetic |
DESED development dataset (synthetic clips) |
None |
2019 |
sounds |
Download
|
|
|
|
sounds/tau_nigens_spatial_events_2021 |
TAU-NIGENS Spatial Sound Events 2021 |
TAU |
2021 |
sounds |
Download
|
|
|
|
sounds/idmt_traffic |
IDMT-Traffic: An Open Benchmark Dataset for Acoustic Traffic Monitoring Research |
Fraunhofer IDMT |
2021 |
sounds |
Download
|
|
|
|
sounds/dcase2017_task4_eval |
DCASE2017 task 4 evaluation dataset |
DCASE |
2017 |
sounds |
Download
|
|
|
|
sounds/esc_10 |
Dataset for Environmental Sound Classification, 10 classes |
ESC |
2015 |
sounds |
Download
|
|
|
|
sounds/voice |
VOICe Dataset |
TAU |
2020 |
sounds |
Download
|
|
|
|
sounds/tut_sound_events_2016_eval |
TUT Sound events 2016, Evaluation dataset |
TUT |
2016 |
sounds |
Download
|
|
|
|
sounds/arca23k_fsd |
ARCA23K-FSD |
CVSSP |
2021 |
sounds |
Download
|
|
|
|
sounds/dcase2018_task5_eval |
DCASE 2018, Task 5: Monitoring of domestic activities based on multi-channel acoustics - Evaluation dataset |
KU Leuven |
2018 |
sounds |
Download
|
|
|
|
sounds/desed2020_eval_sed |
Evaluation set DCASE 2020 task 4 |
None |
2020 |
sounds |
Download
|
|
|
|
sounds/mivia_aed_road |
Audio Events Data Set for Road Surveillance Applications |
MIVIA |
2014 |
sounds |
Download
|
|
|
|
sounds/qmul_freefield1010 |
Freefield1010 |
QMUL |
2013 |
sounds |
Download
|
|
|
|
sounds/fsd50k |
FSD50K |
UPF |
2020 |
sounds |
Download
|
|
|
|
sounds/dcase2013_public_isolated |
IEEE AASP CASA Challenge - Training Dataset for Event Detection Task (subtasks OL, OS) |
IEEE AASP Challenge 2013 |
2012 |
sounds |
Download
|
|
|
|
sounds/freiburg106 |
Freiburg-106, Audio Data Set for Human Activity Recognition |
Freiburg |
2012 |
sounds |
Download
|
|
|
|
sounds/vgg_sound |
VGGSound |
VGG |
2020 |
sounds |
Download
|
|
|
|
sounds/wearable_seld_foa |
Wearable SELD FOA |
NTT Media Intelligence Laboratories |
2022 |
sounds |
Download
|
|
|
|
sounds/dcase2018_task4 |
DCASE2018 Task 4 dataset |
None |
2018 |
sounds |
Download
|
|
|
|
anomalous/toyadmos |
ToyADMOS |
NTT |
2019 |
anomalous |
Download
|
|
|
|
anomalous/imad-ds |
IMAD-DS |
STMicroelectronics |
2024 |
anomalous |
Download
|
|
|
|