Training neural audio classifiers with few data

Pons Puig, Jordi; Serrà Julià, Joan; Serra, Xavier

Training neural audio classifiers with few data

Mostra el registre complet Registre parcial de l'ítem

dc.contributor.author Pons Puig, Jordi
dc.contributor.author Serrà Julià, Joan
dc.contributor.author Serra, Xavier
dc.date.accessioned 2019-05-22T08:05:05Z
dc.date.issued 2019
dc.description Comunicació presentada a: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing celebrat de 12 al 17 de maig de 2019 a Brighton, Regne Unit.
dc.description.abstract We investigate supervised learning strategies that improve the training of neural network audio classifiers on small annotated collections. In particular, we study whether (i) a naive regularization of the solution space, (ii) prototypical networks, (iii) transfer learning, or (iv) their combination, can foster deep learning models to better leverage a small amount of training examples. To this end, we evaluate (i–iv) for the tasks of acoustic event recognition and acoustic scene classification, considering from 1 to 100 labeled examples per class. Results indicate that transfer learning is a powerful strategy in such scenarios, but prototypical networks show promising results when one does not count with external or validation data.
dc.format.mimetype application/pdf
dc.identifier.citation Pons J, Serrà J, Serra X. Training neural audio classifiers with few data. In: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); 2019 May 12-17; Brighton, United Kingdom. New Jersey: Institute of Electrical and Electronics Engineers; 2019. p. 16-20. DOI: 10.1109/ICASSP.2019.8682591
dc.identifier.doi http://dx.doi.org/10.1109/ICASSP.2019.8682591
dc.identifier.issn 2379-190X
dc.identifier.uri http://hdl.handle.net/10230/37259
dc.language.iso eng
dc.publisher Institute of Electrical and Electronics Engineers (IEEE)
dc.relation.ispartof 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); 2019 May 12-17; Brighton, United Kingdom. New Jersey: Institute of Electrical and Electronics Engineers; 2019.
dc.relation.isreferencedby https://github.com/jordipons/neural-classifiers-with-few-audio/
dc.rights © 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. http://dx.doi.org/10.1109/ICASSP.2019.8682591
dc.rights.accessRights info:eu-repo/semantics/openAccess
dc.subject.keyword Prototypical networks
dc.subject.keyword Transfer learning
dc.subject.keyword Audio classification
dc.subject.keyword Small data
dc.title Training neural audio classifiers with few data
dc.type info:eu-repo/semantics/conferenceObject
dc.type.version info:eu-repo/semantics/acceptedVersion

Col·leccions

Congressos (Departament de Tecnologies de la Informació i les Comunicacions)