CATOTRON – A Neural text-to-speech system in catalan
Mostra el registre complet Registre parcial de l'ítem
- dc.contributor.author Külebi, Baybars
- dc.contributor.author Öktem, Alp
- dc.contributor.author Peiró Lilja, Àlex
- dc.contributor.author Pascual, Santiago
- dc.contributor.author Farrús, Mireia
- dc.date.accessioned 2020-11-11T07:30:29Z
- dc.date.available 2020-11-11T07:30:29Z
- dc.date.issued 2020
- dc.description Comunicació presentada a Interspeech 2020 celebrat del 25 al 29 d'octubre de 2020 a Shanghai, Xina.
- dc.description.abstract We present Catotron, a neural network-based open-source speech synthesis system in Catalan. Catotron consists of a sequence-to-sequence model trained with two small opensource datasets based on semi-spontaneous and read speech. We demonstrate how a neural TTS can be built for languages with limited resources using found-data optimization and crosslingual transfer learning. We make the datasets, initial models and source code publicly available for both commercial and research purposes.
- dc.description.sponsorship This work was subsidised by the Catalan Department of Culture. A part of the funding comes from the financing administered by the inheritance board of the Generalitat de Catalunya. The last author has been funded by the Agencia Estatal de Investigaci ´on (AEI), Ministerio de Ciencia, Innovaci´on y Universidades and the Fondo Social Europeo (FSE) under grant RYC- 2015-17239 (AEI/FSE, UE). Part of this work has been carried out using an NVIDIA GPU Titan Xp generously provided by NVIDIA Company. This research was also partially supported by the project TEC2015-69266-P (MINECO/FEDER, UE).
- dc.format.mimetype application/pdf
- dc.identifier.citation Külebi B, Öktem A, Peiró-Lilja A, Pascual S, Farrús M. CATOTRON – A Neural text-to-speech system in catalan. In: Proceedings of Interspeech 2020; 2020 Oct 25-29; Shanghai, China. [Baixas]: ISCA; 2020. p. 490-1.
- dc.identifier.issn 1990-9772
- dc.identifier.uri http://hdl.handle.net/10230/45715
- dc.language.iso eng
- dc.publisher International Speech Communication Association (ISCA)
- dc.relation.ispartof Proceedings of Interspeech 2020; 2020 Oct 25-29; Shanghai, China. [Baixas]: ISCA; 2020.
- dc.relation.isreferencedby http://github.com/CollectivaT-dev/catotron
- dc.relation.isreferencedby http://github.com/CollectivaT-dev/catotron-cpu
- dc.relation.isreferencedby http://catotron.collectivat.cat/
- dc.relation.projectID info:eu-repo/grantAgreement/ES/1PE/TEC2015-69266-P
- dc.rights © 2020 ISCA
- dc.rights.accessRights info:eu-repo/semantics/openAccess
- dc.subject.keyword Text-to-speech
- dc.subject.keyword Sequence-to-sequence
- dc.subject.keyword Catalan
- dc.title CATOTRON – A Neural text-to-speech system in catalan
- dc.type info:eu-repo/semantics/conferenceObject
- dc.type.version info:eu-repo/semantics/publishedVersion