Arias Badia, BlancaBel Rafecas, NúriaFomicheva, MarinaLarrea Mendizabal, ImanolLorente, MercèMarimon, MontserratMilà-Garcia, AlbaVivaldi, J. (Jorge), 1952-Padró, Muntsa2021-01-212021-01-212014Arias B, Bel N, Lorente M, Marimón M, Milà A, Vivaldi J, Padró M, Fomicheva M, Larrea I. Boosting the creation of a treebank. In: Calzolari N, Choukri K, Declerck T, Loftsson H, Maegaard B, Mariani J, Moreno A, Odijk J, Piperidis S, editors. Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC'14); 2014 May 26-31; Reykjavik, Iceland. Paris: European Language Resources Association (ELRA); 2014. p. 775-81.http://hdl.handle.net/10230/46232Comunicació presentada al 9th International Conference on Language Resources and Evaluation (LREC'14), celebrat del 26 al 31 de maig de 2014 a Reykjavík, Islàndia.We present the results of the experiment of bootstrapping a Treebank for Catalan by using a Dependency Parser trained with Spanish sentences. In order to save time and cost, our approach was to profit from the typological similarities between Catalan and Spanish to create a first Catalan data set quickly by (i) automatically annotating with a delexicalized Spanish parser, (ii) manually correcting the parses, and (iii) using the Catalan corrected sentences to train a Catalan parser. The results showed that the number of parsed sentences required to train a Catalan parser is about 1000, which were achieved in 4 months with 2 annotators.application/pdfengLicensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 International License (https://creativecommons.org/licenses/by-nc-sa/3.0/)Boosting the creation of a treebankinfo:eu-repo/semantics/conferenceObjectDependency treebankTreebank bootstrappingLess resourced languagesinfo:eu-repo/semantics/openAccess