Fonseca, Eduardo; Favory, Xavier; Pons, Jordi; Font, Frederic; Serra, Xavier
(Institute of Electrical and Electronics Engineers (IEEE), 2022)
Most existing datasets for sound event recognition (SER) are relatively small and/or domain-specific, with the exception of AudioSet, based on over 2 M tracks from YouTube videos and encompassing over 500 sound classes. ...