Prediction of pleasantness and eventfulness perceptual sound qualities in urban soundscapes
Mostra el registre complet Registre parcial de l'ítem
- dc.contributor.author Sagasti, Amaia
- dc.contributor.author Rocamora, Martín
- dc.contributor.author Font, Frederic
- dc.date.accessioned 2024-10-29T07:34:46Z
- dc.date.available 2024-10-29T07:34:46Z
- dc.date.issued 2024
- dc.description Comunicació presentada a 9th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2024), celebrada a Tòquio del 23 al 25 d'octubre de 2024.
- dc.description.abstract The acoustic environment induces emotions in human listeners. To describe such emotions, ISO-12913 defines pleasantness and eventfulness as orthogonal properties that characterise urban soundscapes. In this paper, we study different approaches for automatically estimating these two perceptual sound qualities. We emphasize the comparison of three sets of audio features: a first set from the acoustic and psychoacoustic domain, suggested in ISO-12913; a second set of features from the machine listening domain based on traditional signal processing algorithms; and a third set consisting of audio embeddings generated with a pre-trained audio-language deep-learning model. Each feature set is tested on its own and in combination with ground-truth labels about the sound sources present in the recordings to determine if this additional information improves the prediction accuracy. Our findings indicate that the deep-learning representation yields slightly better performance than the other feature sets when predicting pleasantness, but all of them yield similar performance when predicting eventfulness. Nevertheless, deep-learning embeddings present other advantages, such as faster calculation times and greater robustness against changes in sensor calibration, making them more effective for real-time acoustic monitoring. Furthermore, we observe a clear correlation between the sound sources that are present in the urban soundscape and its induced emotions, specially regarding the sensation of pleasantness. Models like the ones proposed in this paper allow for an assessment of the acoustic environment that goes beyond a characterisation solely based on sound pressure level measurements and could be integrated into current acoustic monitoring solutions to enhance the understanding from the perspective of the induced emotions.
- dc.format.mimetype application/pdf
- dc.identifier.citation Sagasti A, Rocamora M, Font F. Prediction of pleasantness and eventfulness perceptual sound qualities in urban soundscapes. In: Ono N, Harada N, Kawaguchi Y, Gan WS, Imoto K, Komatsu T, Kong Q , Martin Morato I, editors. Proceedings of the 9th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2024); 2024 Oct 23-25; Tokyo, Japan. [Tokyo]: DCASE; 2024. p. 131-5.
- dc.identifier.isbn 9784600015183
- dc.identifier.uri http://hdl.handle.net/10230/68386
- dc.language.iso eng
- dc.publisher DCASE
- dc.relation.ispartof Ono N, Harada N, Kawaguchi Y, Gan WS, Imoto K, Komatsu T, KongQ , Martin Morato I, editors. Proceedings of the 9th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2024); 2024 Oct 23-25; Tokyo, Japan. [Tokyo]: DCASE; 2024. p. 131-5.
- dc.rights This work is licensed under a Creative Commons Attribution 4.0 International License. To view a copy of this license, visit: http://creativecommons.org/licenses/by/4.0/
- dc.rights.accessRights info:eu-repo/semantics/openAccess
- dc.rights.uri http://creativecommons.org/licenses/by/4.0/
- dc.subject.keyword Urban soundscapes
- dc.subject.keyword Acoustic monitoring
- dc.subject.keyword Emotions
- dc.subject.keyword Machine-learning
- dc.subject.keyword Perception
- dc.title Prediction of pleasantness and eventfulness perceptual sound qualities in urban soundscapes
- dc.type info:eu-repo/semantics/conferenceObject
- dc.type.version info:eu-repo/semantics/publishedVersion