BAF: an audio fingerprinting dataset for broadcast monitoring
| dc.contributor.author | Cortès, Guillem | |
| dc.contributor.author | Ciurana, Alex | |
| dc.contributor.author | Molina, Emilio | |
| dc.contributor.author | Miron, Marius | |
| dc.contributor.author | Meyers, Owen | |
| dc.contributor.author | Six, Joren | |
| dc.contributor.author | Serra, Xavier | |
| dc.date.accessioned | 2023-04-11T06:43:07Z | |
| dc.date.available | 2023-04-11T06:43:07Z | |
| dc.date.issued | 2022 | |
| dc.description.abstract | Audio Fingerprinting (AFP) is a well-studied problem in music information retrieval for various use-cases e.g. content-based copy detection, DJ-set monitoring, and music excerpt identification. However, AFP for continuous broadcast monitoring (e.g. for TV & Radio), where music is often in the background, has not received much attention despite its importance to the music industry. In this paper (1) we present BAF, the first public dataset for music monitoring in broadcast. It contains 74 hours of production music from Epidemic Sound and 57 hours of TV audio recordings. Furthermore, BAF provides cross-annotations with exact matching timestamps between Epidemic tracks and TV recordings. Approximately, 80% of the total annotated time is background music. (2) We benchmark BAF with public state-of-the-art AFP systems, together with our proposed baseline PeakFP: a simple, non-scalable AFP algorithm based on spectral peak matching. In this benchmark, none of the algorithms obtain a F1-score above 47%, pointing out that further research is needed to reach the AFP performance levels in other studied use cases. The dataset, baseline, and benchmark framework are open and available for research. | |
| dc.description.sponsorship | This research is part of NextCore - New generation of music monitoring technology (RTC2019-007248-7), funded by the Spanish Ministerio de Ciencia e Innovación and the Agencia Estatal de Investigación. Also, has received support from Industrial Doctorates plan of the Secretaria d’universitats i Recerca, Departament d’Empresa i Coneixement de la Generalitat de Catalunya, grant agreement No. DI46-2020. | |
| dc.format.mimetype | application/pdf | |
| dc.identifier.citation | Cortès G, Ciurana A, Molina E, Miron M, Meyers O, Six J, Serra X. BAF: an audio fingerprinting dataset for broadcast monitoring. In: Rao P, Murthy H, Srinivasamurthy A, Bittner R, Caro Repetto R, Goto M, Serra X, Miron M, editors. Proceedings of the 23nd International Society for Music Information Retrieval Conference (ISMIR 2022); 2022 Dec 4-8; Bengaluru, India. [Canada]: International Society for Music Information Retrieval; 2022. p. 908-16. DOI: 10.5281/zenodo.7372162 | |
| dc.identifier.doi | http://dx.doi.org/10.5281/zenodo.7372162 | |
| dc.identifier.isbn | 978-1-7327299-2-6 | |
| dc.identifier.uri | http://hdl.handle.net/10230/56445 | |
| dc.language.iso | eng | |
| dc.publisher | International Society for Music Information Retrieval (ISMIR) | |
| dc.relation.ispartof | Rao P, Murthy H, Srinivasamurthy A, Bittner R, Caro Repetto R, Goto M, Serra X, Miron M, editors. Proceedings of the 23nd International Society for Music Information Retrieval Conference (ISMIR 2022); 2022 Dec 4-8; Bengaluru, India. [Canada]: International Society for Music Information Retrieval; 2022. p. 908-16. | |
| dc.relation.isreferencedby | https://github.com/guillemcortes/baf-dataset | |
| dc.relation.isreferencedby | https://doi.org/10.5281/zenodo.6868083 | |
| dc.relation.isreferencedby | https://github.com/guillemcortes/neural-audio-fp | |
| dc.relation.projectID | info:eu-repo/grantAgreement/ES/2PE/RTC2019-007248-7 | |
| dc.rights | © G. Cortès, A. Ciurana, E. Molina, M. Miron, O. Meyers, J. Six and X. Serra. Licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0). | |
| dc.rights.accessRights | info:eu-repo/semantics/openAccess | |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.subject.other | Música | |
| dc.subject.other | Recuperació de la informació | |
| dc.title | BAF: an audio fingerprinting dataset for broadcast monitoring | |
| dc.type | info:eu-repo/semantics/conferenceObject | |
| dc.type.version | info:eu-repo/semantics/publishedVersion |
Files
Original bundle
1 - 1 of 1

