BAF: an audio fingerprinting dataset for broadcast monitoring

Mostra el registre complet Registre parcial de l'ítem

  • dc.contributor.author Cortès, Guillem
  • dc.contributor.author Ciurana, Alex
  • dc.contributor.author Molina, Emilio
  • dc.contributor.author Miron, Marius
  • dc.contributor.author Meyers, Owen
  • dc.contributor.author Six, Joren
  • dc.contributor.author Serra, Xavier
  • dc.date.accessioned 2023-04-11T06:43:07Z
  • dc.date.available 2023-04-11T06:43:07Z
  • dc.date.issued 2022
  • dc.description.abstract Audio Fingerprinting (AFP) is a well-studied problem in music information retrieval for various use-cases e.g. content-based copy detection, DJ-set monitoring, and music excerpt identification. However, AFP for continuous broadcast monitoring (e.g. for TV & Radio), where music is often in the background, has not received much attention despite its importance to the music industry. In this paper (1) we present BAF, the first public dataset for music monitoring in broadcast. It contains 74 hours of production music from Epidemic Sound and 57 hours of TV audio recordings. Furthermore, BAF provides cross-annotations with exact matching timestamps between Epidemic tracks and TV recordings. Approximately, 80% of the total annotated time is background music. (2) We benchmark BAF with public state-of-the-art AFP systems, together with our proposed baseline PeakFP: a simple, non-scalable AFP algorithm based on spectral peak matching. In this benchmark, none of the algorithms obtain a F1-score above 47%, pointing out that further research is needed to reach the AFP performance levels in other studied use cases. The dataset, baseline, and benchmark framework are open and available for research.
  • dc.description.sponsorship This research is part of NextCore - New generation of music monitoring technology (RTC2019-007248-7), funded by the Spanish Ministerio de Ciencia e Innovación and the Agencia Estatal de Investigación. Also, has received support from Industrial Doctorates plan of the Secretaria d’universitats i Recerca, Departament d’Empresa i Coneixement de la Generalitat de Catalunya, grant agreement No. DI46-2020.
  • dc.format.mimetype application/pdf
  • dc.identifier.citation Cortès G, Ciurana A, Molina E, Miron M, Meyers O, Six J, Serra X. BAF: an audio fingerprinting dataset for broadcast monitoring. In: Rao P, Murthy H, Srinivasamurthy A, Bittner R, Caro Repetto R, Goto M, Serra X, Miron M, editors. Proceedings of the 23nd International Society for Music Information Retrieval Conference (ISMIR 2022); 2022 Dec 4-8; Bengaluru, India. [Canada]: International Society for Music Information Retrieval; 2022. p. 908-16. DOI: 10.5281/zenodo.7372162
  • dc.identifier.doi http://dx.doi.org/10.5281/zenodo.7372162
  • dc.identifier.isbn 978-1-7327299-2-6
  • dc.identifier.uri http://hdl.handle.net/10230/56445
  • dc.language.iso eng
  • dc.publisher International Society for Music Information Retrieval (ISMIR)
  • dc.relation.ispartof Rao P, Murthy H, Srinivasamurthy A, Bittner R, Caro Repetto R, Goto M, Serra X, Miron M, editors. Proceedings of the 23nd International Society for Music Information Retrieval Conference (ISMIR 2022); 2022 Dec 4-8; Bengaluru, India. [Canada]: International Society for Music Information Retrieval; 2022. p. 908-16.
  • dc.relation.isreferencedby https://github.com/guillemcortes/baf-dataset
  • dc.relation.isreferencedby https://doi.org/10.5281/zenodo.6868083
  • dc.relation.isreferencedby https://github.com/guillemcortes/neural-audio-fp
  • dc.relation.projectID info:eu-repo/grantAgreement/ES/2PE/RTC2019-007248-7
  • dc.rights © G. Cortès, A. Ciurana, E. Molina, M. Miron, O. Meyers, J. Six and X. Serra. Licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0).
  • dc.rights.accessRights info:eu-repo/semantics/openAccess
  • dc.rights.uri http://creativecommons.org/licenses/by/4.0/
  • dc.subject.other Música
  • dc.subject.other Recuperació de la informació
  • dc.title BAF: an audio fingerprinting dataset for broadcast monitoring
  • dc.type info:eu-repo/semantics/conferenceObject
  • dc.type.version info:eu-repo/semantics/publishedVersion