Serra J, Gómez E, Herrera P, Serra X. Chroma binary similarity and local alignment applied to cover song identification. IEEE Transactions on Audio, Speech and Language Processing. 2008; 16(6): 1138-1151. DOI 10.1109/TASL.2008.924595
http://hdl.handle.net/10230/16277
|
Title:
|
Chroma binary similarity and local alignment applied to cover song identification |
|
Author:
|
Serrà Julià, Joan; Gómez Gutiérrez, Emilia; Herrera Boyer, Perfecto; Serra, Xavier
|
|
Abstract:
|
We present a new technique for audio signal comparison based on tonal subsequence alignment and its application to detect cover versions (i.e., different performances of the same underlying musical piece). Cover song identification is a task whose popularity has increased in the Music Information Retrieval (MIR) community along in the past, as it provides a direct and objective way to evaluate music similarity algorithms.
This article first presents a series of experiments carried out
with two state-of-the-art methods for cover song identification.
We have studied several components of these (such as chroma resolution and similarity, transposition, beat tracking or Dynamic Time Warping constraints), in order to discover which characteristics would be desirable for a competitive cover song identifier. After analyzing many cross-validated results, the importance of these characteristics is discussed, and the best-performing ones are finally applied to the newly proposed method. Multiple
evaluations of this one confirm a large increase in identification
accuracy when comparing it with alternative state-of-the-art
approaches.
|
|
Document type:
|
Article
|
|
Document version:
|
Accepted version
|
|
Date:
|
2008 |
|
Rights:
|
© 2008 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
The final published article can be found at (http://dx.doi.org/10.1109/TASL.2008.924595) |
Show full document record