Towards assessing data replication in music generation with music similarity metrics on raw audio

Citació

  • Batlle-Roca R, Liao WH, Serra X, Mitsufuji Y, Gómez E. Towards assessing data replication in music generation with music similarity metrics on raw audio. Paper presented at: 25th International Society for Music Information Retrieval Conference (ISMIR2024); 2024 November 10-14; San Francisco, USA.

Enllaç permanent

Descripció

  • Resum

    Recent advancements in music generation are raising multiple concerns about the implications of AI in creative music processes, current business models and impacts related to intellectual property management. A relevant discussion and related technical challenge is the potential replication and plagiarism of the training set in AI-generated music, which could lead to misuse of data and intellectual property rights violations. To tackle this issue, we present the Music Replication Assessment (MiRA) tool: a modelindependent open evaluation method based on diverse audio music similarity metrics to assess data replication. We evaluate the ability of five metrics to identify exact replication by conducting a controlled replication experiment in different music genres using synthetic samples. Our results show that the proposed methodology can estimate exact data replication with a proportion higher than 10%. By introducing the MiRA tool, we intend to encourage the open evaluation of music-generative models by researchers, developers, and users concerning data replication, highlighting the importance of the ethical, social, legal, and economic consequences. Code and examples are available for reproducibility purposes.
  • Descripció

    This work has been accepted at 25th International Society for Music Information Retrieval Conference (ISMIR2024), in San Francisco, USA. November 10-14, 2024
  • Mostra el registre complet