What is the effect of audio quality on the robustness of MFCCs and chroma features?

Citació

  • Urbano J, Bogdanov D, Herrera P, Gómez E, Serra X. What is the effect of audio quality on the robustness of MFCCs and chroma features? In: Wang HM, Yang YH, Lee JH, editors. Proceedings of the 15th Conference of the International Society for Music Information Retrieval (ISMIR 2014); 2014 Oct 27-31; Taipei, Taiwan. [Place unknown]: International Society for Music Information Retrieval; 2014. p. 573-78.

Enllaç permanent

Descripció

  • Resum

    Music Information Retrieval is largely based on descriptors computed from audio signals, and in many practical applications they are to be computed on music corpora containing audio files encoded in a variety of lossy formats. Such encodings distort the original signal and therefore may affect the computation of descriptors. This raises the question of the robustness of these descriptors across various audio encodings. We examine this assumption for the case of MFCCs and chroma features. In particular, we analyze their robustness to sampling rate, codec, bitrate, frame size and music genre. Using two different audio analysis tools over a diverse collection of music tracks, we compute several statistics to quantify the robustness of the resulting descriptors, and then estimate the practical effects for a sample task like genre classification.
  • Mostra el registre complet