Data efficient voice cloning for neural singing synthesis

dc.contributor.authorBlaauw, Merlijn
dc.contributor.authorBonada, Jordi, 1973-
dc.contributor.authorDaido, Ryunosuke
dc.date.accessioned2021-02-26T07:16:55Z
dc.date.issued2019
dc.descriptionComunicació presentada al IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), celebrat els dies 12 al 17 de 2019 a Brighton, Anglaterra.
dc.description.abstractThere are many use cases in singing synthesis where creating voices from small amounts of data is desirable. In text-to-speech there have been several promising results that apply voice cloning techniques to modern deep learning based models. In this work, we adapt one such technique to the case of singing synthesis. By leveraging data from many speakers to first create a multispeaker model, small amounts of target data can then efficiently adapt the model to new unseen voices. We evaluate the system using listening tests across a number of different use cases, languages and kinds of data.en
dc.format.mimetypeapplication/pdf
dc.identifier.citationBlaauw M, Bonada J, Daido R. Data efficient voice cloning for neural singing synthesis. In: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); 2019 May 12-17; Brighton, United Kingdom. New Jersey: Institute of Electrical and Electronics Engineers; 2019. p. 6840-4. DOI: 10.1109/ICASSP.2019.8682656
dc.identifier.doihttp://dx.doi.org/10.1109/ICASSP.2019.8682656
dc.identifier.issn2379-190X
dc.identifier.urihttp://hdl.handle.net/10230/46596
dc.language.isoeng
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)
dc.relation.ispartof2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); 2019 May 12-17; Brighton, United Kingdom. New Jersey: Institute of Electrical and Electronics Engineers; 2019. p. 6840-4
dc.rights© 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. http://dx.doi.org/10.1109/ICASSP.2019.8682656
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.subject.keywordSinging synthesisen
dc.subject.keywordVoice cloningen
dc.subject.keywordSpeaker embeddingen
dc.subject.keywordSpeaker adaptationen
dc.subject.keywordMultispeaker modelen
dc.titleData efficient voice cloning for neural singing synthesisen
dc.typeinfo:eu-repo/semantics/conferenceObject
dc.type.versioninfo:eu-repo/semantics/acceptedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
blaauw_icassp19_data.pdf
Size:
114.66 KB
Format:
Adobe Portable Document Format