CATOTRON – A Neural text-to-speech system in catalan

Citació

Külebi B, Öktem A, Peiró-Lilja A, Pascual S, Farrús M. CATOTRON – A Neural text-to-speech system in catalan. In: Proceedings of Interspeech 2020; 2020 Oct 25-29; Shanghai, China. [Baixas]: ISCA; 2020. p. 490-1.

Enllaç permanent

Descripció

Dades relacionades
http://github.com/CollectivaT-dev/catotron
http://github.com/CollectivaT-dev/catotron-cpu
http://catotron.collectivat.cat/
Resum
We present Catotron, a neural network-based open-source speech synthesis system in Catalan. Catotron consists of a sequence-to-sequence model trained with two small opensource datasets based on semi-spontaneous and read speech. We demonstrate how a neural TTS can be built for languages with limited resources using found-data optimization and crosslingual transfer learning. We make the datasets, initial models and source code publicly available for both commercial and research purposes.
Descripció
Comunicació presentada a Interspeech 2020 celebrat del 25 al 29 d'octubre de 2020 a Shanghai, Xina.
Col·leccions
Congressos (Departament de Tecnologies de la Informació i les Comunicacions)

Fitxers