General-purpose tagging of Freesound audio with audioSet labels: task description, dataset, and baseline

Mostra el registre complet Registre parcial de l'ítem

  • dc.contributor.author Fonseca, Eduardo
  • dc.contributor.author Plakal, Manoj
  • dc.contributor.author Font Corbera, Frederic
  • dc.contributor.author Ellis, Daniel P. W.
  • dc.contributor.author Favory, Xavier
  • dc.contributor.author Pons Puig, Jordi
  • dc.contributor.author Serra, Xavier
  • dc.date.accessioned 2019-03-18T08:58:42Z
  • dc.date.available 2019-03-18T08:58:42Z
  • dc.date.issued 2018
  • dc.description Comunicació presentada a: Detection and Classification of Acoustic Scenes and Events 2018, celebrat a Surrey, Regne Unit, del 19 al 20 de novembre de 2018.
  • dc.description.abstract This paper describes Task 2 of the DCASE 2018 Challenge, titled “General-purpose audio tagging of Freesound content with AudioSet labels”. This task was hosted on the Kaggle platform as “Freesound General-Purpose Audio Tagging Challenge”. The goal of the task is to build an audio tagging system that can recognize the category of an audio clip from a subset of 41 diverse categories drawn from the AudioSet Ontology. We present the task, the dataset prepared for the competition, and a baseline system.
  • dc.description.sponsorship This work is partially supported by the European Union’s Horizon 2020 research and innovation programme under grant agreement No 688382 AudioCommons and a Google Faculty Research Award 2017.
  • dc.format.mimetype application/pdf
  • dc.identifier.citation Fonseca E, Plakal M, Font F, Ellis DPW, Favory X, Pons J, Serra X. General-purpose tagging of Freesound audio with audioSet labels: task description, dataset, and baseline. In: Plumbley MD, Kroos C, Bello JP, Richard G, Ellis, DPW, Mesaros A, editors. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2018 Workshop (DCASE2018); 2018 Nov 19-20; Surrey, UK. Tampere: Tampere University of Technology; 2018. p. 69-73.
  • dc.identifier.isbn 978-952-15-4262-6
  • dc.identifier.uri http://hdl.handle.net/10230/36853
  • dc.language.iso eng
  • dc.publisher Tampere University of Technology
  • dc.relation.ispartof Plumbley MD, Kroos C, Bello JP, Richard G, Ellis, DPW, Mesaros A, editors. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2018 Workshop (DCASE2018); 2018 Nov 19-20; Surrey, UK. Tampere: Tampere University of Technology; 2018. p. 69-73.
  • dc.relation.projectID info:eu-repo/grantAgreement/EC/H2020/688382
  • dc.rights © Tampere University of Technology
  • dc.rights.accessRights info:eu-repo/semantics/openAccess
  • dc.rights.uri http://creativecommons.org/licenses/by/4.0/
  • dc.subject.keyword Audio tagging
  • dc.subject.keyword Audio dataset
  • dc.subject.keyword Data collection
  • dc.title General-purpose tagging of Freesound audio with audioSet labels: task description, dataset, and baseline
  • dc.type info:eu-repo/semantics/conferenceObject
  • dc.type.version info:eu-repo/semantics/publishedVersion