Publications

Title Authors Year Datasets Tasks
04cbf40d0a62e43d311012defe8df452 2021Naranjo-Alcazar Squeeze-Excitation Convolutional Recurrent Neural Networks for Audio-Visual Scene Classification Javier Naranjo-Alcazar, Sergi Perez-Castanos, Maximo Cobos, Francesc J. Ferri, Pedro Zuccarello 2021 tau_avsc_2021_dev AVSC
ead9228dbf2bda2768904fe92f8021f6 2021Okazaki A Multi-Modal Fusion Approach for Audio-Visual Scene Classification Enhanced by CLIP Variants Soichiro Okazaki, Quan Kong, Tomoaki Yoshinaga 2021 tau_avsc_2021_dev AVSC
8080709e6bd7c5e3e46a449b3b5a2fcb 2021Khaled Receptive Field Regularization Techniques for Audio Classification and Tagging with Deep Convolutional Neural Networks Koutini Khaled, Eghbal-zadeh Hamid, Widmer Gerhard 2021 openmic_2018 InsRec