| captions/audiocaps | AudioCaps: Generating Captions for Audios in The Wild | SNU | 2019 | captions | Download |  |  |  | 
                            
                        
                            
                            
                                | captions/clotho_v2 | Clotho dataset (v2) | TAU | 2019 | captions | Download |  |  |  | 
                            
                        
                            
                            
                                | captions/macs | MACS: Multi-Annotator Captioned Soundscapes | TAU | 2021 | captions | Download |  |  |  | 
                            
                        
                            
                            
                                | captions/audiocaption | AudioCaption: Listen and Tell | SJTU | 2019 | captions | Download |  |  |  | 
                            
                        
                    
                        
                            
                            
                                | scenes/uea_noise_db_s2 | Noise DB / Series 2 | UEA | 2006 | scenes |  |  |  |  | 
                            
                        
                            
                            
                                | scenes/tut_asc_2018_lb | TUT Urban Acoustic Scenes 2018, Leaderboard dataset | TUT | 2018 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tau_asc_2019_eval | TAU Urban Acoustic Scenes 2019, Evaluation dataset | TAU | 2019 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tau_asc_2019_openset_eval | TAU Urban Acoustic Scenes 2019 Openset, Evaluation dataset | TAU | 2019 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tau_asc_2019_dev | TAU Urban Acoustic Scenes 2019, Development dataset | TAU | 2019 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tau_asc_2019_mobile_dev | TAU Urban Acoustic Scenes 2019 Mobile, Development dataset | TAU | 2019 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tut_asc_2018_eval | TUT Urban Acoustic Scenes 2018, Evaluation dataset | TUT | 2018 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/extrasensory | ExtraSensory Dataset | UCSD | 2017 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tut_asc_2017_dev | TUT Acoustic Scenes 2017, Development dataset | TUT | 2017 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tut_asc_2016_eval | TUT Acoustic Scenes 2016, Evaluation dataset | TUT | 2016 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tau_asc_2019_lb | TAU Urban Acoustic Scenes 2019, Leaderboard dataset | TAU | 2019 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/demand | Diverse Environments Multichannel Acoustic Noise Database | Inria | 2013 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tut_asc_2016_dev | TUT Acoustic Scenes 2016, Development dataset | TUT | 2016 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/uea_noise_db_s1 | Noise DB / Series 1 | UEA | 2006 | scenes |  |  |  |  | 
                            
                        
                            
                            
                                | scenes/tut_asc_2017_eval | TUT Acoustic Scenes 2017, Evaluation dataset | TUT | 2017 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tut_asc_2018_mobile_eval | TUT Urban Acoustic Scenes 2018 Mobile, Evaluation dataset | TUT | 2018 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tut_asc_2018_dev | TUT Urban Acoustic Scenes 2018, Development dataset | TUT | 2018 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tau_asc_2020_mobile_eval | TAU Urban Acoustic Scenes 2020 Mobile, Evaluation dataset | TAU | 2020 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tau_asc_2021_mobile_eval | TAU Urban Acoustic Scenes 2021 Mobile, Evaluation dataset | TAU | 2021 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tau_asc_2019_openset_dev | TAU Urban Acoustic Scenes 2019 Openset, Development dataset | TAU | 2019 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/dcase2013_public | IEEE AASP CASA Challenge - Public Dataset for Scene Classification Task | IEEE AASP Challenge 2013 | 2012 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tut_asc_2018_mobile_dev | TUT Urban Acoustic Scenes 2018 Mobile, Development dataset | TUT | 2018 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tut_asc_2018_mobile_lb | TUT Urban Acoustic Scenes 2018 Mobile, Leaderboard dataset | TUT | 2018 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tau_asc_2019_mobile_eval | TAU Urban Acoustic Scenes 2019 Mobile, Evaluation dataset | TAU | 2019 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/whisper | WSJ0 Hipster Ambient Mixtures noise dataset | Whisper/MERL | 2019 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/cochlscene | CochlScene: Acquisition of Acoustic Scene Data Using Crowdsourcing | Cochl | 2022 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/dcase2013_private | IEEE AASP CASA Challenge - Private Dataset for Scene Classification Task | IEEE AASP Challenge 2013 | 2013 | scenes | Download |  |  |  | 
                            
                        
                            
                        
                            
                            
                                | scenes/tau_asc_2019_mobile_lb | TAU Urban Acoustic Scenes 2019 Mobile, Leaderboard dataset | TAU | 2019 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tau_avsc_2021_eval | TAU Urban Audio-Visual Scenes 2021, Evaluation dataset | TAU | 2021 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tau_asc_2020_3class_eval | TAU Urban Acoustic Scenes 2020 3Class, Evaluation dataset | TAU | 2020 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tau_asc_2020_mobile_dev | TAU Urban Acoustic Scenes 2020 Mobile, Development dataset | TAU | 2020 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/aucodefr2007 | AucoDefr07 | AucoDefr07 | 2015 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tau_asc_2020_3class_dev | TAU Urban Acoustic Scenes 2020 3Class, Development dataset | TAU | 2020 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tau_avsc_2021_dev | TAU Urban Audio-Visual Scenes 2021, Development dataset | TAU | 2021 | scenes | Download |  |  |  | 
                            
                        
                            
                            
                                | scenes/tau_asc_2019_openset_lb | TAU Urban Acoustic Scenes 2019 Openset, Leaderboard dataset | TAU | 2019 | scenes | Download |  |  |  | 
                            
                        
                    
                        
                            
                            
                                | sounds/eth_aed | ETH Acoustic Event Dataset | ETH | 2016 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/tau_mats | MATS - Multi-Annotator Tagged Soundscapes | TAU | 2021 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/usm_sed | USM-SED Dataset - A Dataset for Polyphonic Sound Event Detection in Urban Sound Monitoring Scenarios | Fraunhofer IDMT | 2021 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/urbansas | Urban Sound & Sight | NYU | 2022 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/audiosetzsl | AudioSetZSL | IIT Kanpur | 2020 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/desed2019_eval_real | DESED public evaluation dataset | None | 2019 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/wild_desed | WildDESED | Fortemedia | 2024 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/mavd | MAVD-traffic dataset | MAVD | 2019 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/fsl_osr | An Open-set Recognition and Few-Shot Learning Dataset for Audio Event Classification in Domestic Environments | Visualfy, Universitat de Valencia | 2021 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/wearable_seld_mounting | Wearable SELD Mounting | NTT Media Intelligence Laboratories | 2022 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/dcase2013_private_live | IEEE AASP CASA Challenge - Development Dataset for Event Detection Task (subtask OL) | IEEE AASP Challenge 2013 | 2012 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/urbansound | UrbanSound | NYU | 2014 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/nonspeech7k | Nonspeech7k | South China University of Technology | 2023 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/tut_rare_sound_events_2017_dev | TUT Rare sound events 2017, Development dataset | TUT | 2017 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/audioset | AudioSet | Google | 2017 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/inria_nar | NAR | INRIA | 2014 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/epic_sounds | EPIC-SOUNDS | University of Oxford, University of Bristol | 2023 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/spass | SPASS dataset: A synthetic polyphonic dataset with spatiotemporal labels of sound sources | SPASS | 2023 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/wearable_seld_ear | Wearable SELD Earphone | NTT Media Intelligence Laboratories | 2022 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/dcase2016_task2_eval | IEEE DCASE 2016 Challenge - Task 2 - Test Dataset | IRCCYN | 2016 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/idmt_urban_fl | IDMT-URBAN-FL | Fraunhofer IDMT | 2021 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/tu_dortmund_sed | Multi-channel acoustic event dataset | TU Dortmund | 2016 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/tau_nigens_spatial_events_2020 | TAU-NIGENS Spatial Sound Events 2020 | TAU | 2020 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/tut_sound_events_2017_dev | TUT Sound events 2017, Development dataset | TUT | 2017 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/desed2019_dev_real | DESED development dataset (real recordings) | None | 2019 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/tut_rare_sound_events_2017_eval | TUT Rare sound events 2017, Evaluation dataset | TUT | 2017 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/fsdnoisy18k | FSDnoisy18k | UPF | 2019 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/tau_spatial_events_2019_dev | TAU Spatial Sound Events 2019 - Ambisonic and Microphone Array, Development Datasets | TAU | 2019 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/tut_synthetic_2016 | TUT-SED Synthetic 2016 | TUT | 2016 | sounds |  |  |  |  | 
                            
                        
                            
                            
                                | sounds/idmt_desed_fl | IDMT-DESED-FL | Fraunhofer IDMT | 2021 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/fsdkaggle2019 | FSDKaggle2019 | UPF | 2019 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/rwsp | RWCP Sound Scene Database | NII-SRC | 2000 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/esc_50 | Dataset for Environmental Sound Classification, 50 classes | ESC | 2015 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/dcase2013_public_synthetic | IEEE AASP CASA Challenge - Development Dataset for Event Detection Task (subtask OS) | IEEE AASP Challenge 2013 | 2015 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/ave | AVE: The Audio-Visual Event Dataset | UR | 2018 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/upc_talp | UPC-TALP database of isolated meeting-room acoustic events | CHIL | 2008 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/sins | SINS database | KU Leuven | 2017 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/audioset_temporal | AudioSet with Temporally-Strong Labels | Google | 2021 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/tau_spatial_events_2019_eval | TAU Spatial Sound Events 2019 - Ambisonic and Microphone Array, Evaluation Datasets | TAU | 2019 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/msos | Making Sense of Sounds | CVSSP | 2018 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/asped | Audio Sensing for Pedestrian Detection Dataset | None | 2024 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/iusd | Isolated urban sound database | None | 2018 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/tut_sound_events_2016_dev | TUT Sound events 2016, Development dataset | TUT | 2016 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/urbansound8k | UrbanSound8K | NYU | 2014 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/bsd10k | BSD10k | UPF | 2024 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/dcase2013_private_synthetic | IEEE AASP CASA Challenge - Testing Dataset for Event Detection Task (subtask OS) | IEEE AASP Challenge 2013 | 2015 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/ybss200 | YBSS-200 | YBSS | 2019 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/esc_us | Dataset for Environmental Sound Classification, unlabeled dataset | ESC | 2015 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/arca23k | ARCA23K | CVSSP | 2021 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/mivia_aed | Audio Events Data Set for Surveillance Applications | MIVIA | 2015 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/urbansed | URBAN-SED | NYU | 2017 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/fbk_irst | FBK-Irst database of isolated meeting-room acoustic events | CHIL | 2009 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/sonyc | SONYC Urban Sound Tagging | NYU | 2020 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/afpild | Acoustic footstep dataset collected using one microphone array and LiDAR sensor for person identification and localization | None | 2023 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/dcase2018_task5_dev | DCASE 2018, Task 5: Monitoring of domestic activities based on multi-channel acoustics - Development dataset | KU Leuven | 2018 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/dcase2013_public_live | IEEE AASP CASA Challenge - Development Dataset for Event Detection Task (subtask OL) | IEEE AASP Challenge 2013 | 2012 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/tut_sound_events_2017_eval | TUT Sound events 2017, Evaluation dataset | TUT | 2017 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/chime_home | CHiME-Home, development & evaluation dataset | QMUL | 2015 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/desed2019_eval_synthetic | DESED public evaluation dataset | None | 2019 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/nigens_sound_events | NIGENS general sound events database | NIGENS | 2019 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/dcase2017_task4_dev | DCASE2017 task 4 development dataset | DCASE | 2017 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/dcase2016_task2_dev | IEEE DCASE 2016 Challenge - Task 2 - Train/Development Datasets | IRCCYN | 2016 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/desed2019_dev_synthetic | DESED development dataset (synthetic clips) | None | 2019 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/tau_nigens_spatial_events_2021 | TAU-NIGENS Spatial Sound Events 2021 | TAU | 2021 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/idmt_traffic | IDMT-Traffic: An Open Benchmark Dataset for Acoustic Traffic Monitoring Research | Fraunhofer IDMT | 2021 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/dcase2017_task4_eval | DCASE2017 task 4 evaluation dataset | DCASE | 2017 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/esc_10 | Dataset for Environmental Sound Classification, 10 classes | ESC | 2015 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/voice | VOICe Dataset | TAU | 2020 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/tut_sound_events_2016_eval | TUT Sound events 2016, Evaluation dataset | TUT | 2016 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/arca23k_fsd | ARCA23K-FSD | CVSSP | 2021 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/dcase2018_task5_eval | DCASE 2018, Task 5: Monitoring of domestic activities based on multi-channel acoustics - Evaluation dataset | KU Leuven | 2018 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/desed2020_eval_sed | Evaluation set DCASE 2020 task 4 | None | 2020 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/mivia_aed_road | Audio Events Data Set for Road Surveillance Applications | MIVIA | 2014 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/qmul_freefield1010 | Freefield1010 | QMUL | 2013 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/fsd50k | FSD50K | UPF | 2020 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/dcase2013_public_isolated | IEEE AASP CASA Challenge - Training Dataset for Event Detection Task (subtasks OL, OS) | IEEE AASP Challenge 2013 | 2012 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/freiburg106 | Freiburg-106, Audio Data Set for Human Activity Recognition | Freiburg | 2012 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/vgg_sound | VGGSound | VGG | 2020 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/wearable_seld_foa | Wearable SELD FOA | NTT Media Intelligence Laboratories | 2022 | sounds | Download |  |  |  | 
                            
                        
                            
                            
                                | sounds/dcase2018_task4 | DCASE2018 Task 4 dataset | None | 2018 | sounds | Download |  |  |  | 
                            
                        
                    
                        
                            
                            
                                | anomalous/toyadmos | ToyADMOS | NTT | 2019 | anomalous | Download |  |  |  | 
                            
                        
                            
                            
                                | anomalous/imad-ds | IMAD-DS | STMicroelectronics | 2024 | anomalous | Download |  |  |  |