The IULA Spanish LSP treebank: building and browsing

Mostra el registre complet Registre parcial de l'ítem

  • dc.contributor.author Arias Badia, Blanca
  • dc.contributor.author Bel Rafecas, Núria
  • dc.contributor.author Fisas Elizalde, Beatriz
  • dc.contributor.author Lorente, Mercè
  • dc.contributor.author Marimon, Montserrat
  • dc.contributor.author Marimon, Montserrat
  • dc.contributor.author Morell, Carlos
  • dc.contributor.author Vázquez, Silvia
  • dc.contributor.author Vivaldi, J. (Jorge), 1952-
  • dc.date.accessioned 2021-01-21T09:10:17Z
  • dc.date.available 2021-01-21T09:10:17Z
  • dc.date.issued 2014
  • dc.description Comunicació presentada al 9th International Conference on Language Resources and Evaluation (LREC'14), celebrat del 26 al 31 de maig de 2014 a Reykjavík, Islàndia.
  • dc.description.abstract This paper presents the IULA Spanish LSP Treebank, a dependency treebank of over 41,000 sentences of different domains (Law, Economy, Computing Science, Environment, and Medicine), developed in the framework of the European project METANET4U. Dependency annotations in the treebank were automatically derived from manually selected parses produced by an HPSG-grammar by a deterministic conversion algorithm that used the identifiers of grammar rules to identify the heads, the dependents, and some dependency types that were directly transferred onto the dependency structure (e.g., subject, specifier, and modifier), and the identifiers of the lexical entries to identify the argument-related dependency functions (e.g. direct object, indirect object, and oblique complement). The treebank is accessible with a browser that provides concordance-based search functions and delivers the results in two formats: (i) a column-based format, in the style of CoNLL-2006 shared task, and (ii) a dependency graph, where dependency relations are noted by an oriented arrow which goes from the dependent node to the head node. The IULA Spanish LSP Treebank is the first technical corpus of Spanish annotated at surface syntactic level following the dependency grammar theory. The treebank has been made publicly and freely available from the META-SHARE platform with a Creative Commons CC-by licence.en
  • dc.description.sponsorship This work was co-funded by the Ramon y Cajal program of the Spanish Ministerio de Ciencia e Innovacion, the EU UNER - Competitiveness and Innovation Framework Program, METANET (CIP-PSP-270893), and the UPF-IULA PhD grant program.
  • dc.format.mimetype application/pdf
  • dc.identifier.citation Marimon M, Bel N, Fisas B, Arias B, Vázquez S, Vivaldi J, Morell C, Lorente M. The IULA Spanish LSP treebank: building and browsing. In: Calzolari N, Choukri K, Declerck T, Loftsson H, Maegaard B, Mariani J, Moreno A, Odijk J, Piperidis S, editors. Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC'14); 2014 May 26-31; Reykjavik, Iceland. Paris: European Language Resources Association (ELRA); 2014. p. 782-8.
  • dc.identifier.uri http://hdl.handle.net/10230/46233
  • dc.language.iso eng
  • dc.publisher ELRA (European Language Resources Association)
  • dc.relation.ispartof Calzolari N, Choukri K, Declerck T, Loftsson H, Maegaard B, Mariani J, Moreno A, Odijk J, Piperidis S, editors. Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC'14); 2014 May 26-31; Reykjavik, Iceland. Paris: European Language Resources Association (ELRA); 2014. p. 782-8
  • dc.rights Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 International License (https://creativecommons.org/licenses/by-nc-sa/3.0/)
  • dc.rights.accessRights info:eu-repo/semantics/openAccess
  • dc.rights.uri https://creativecommons.org/licenses/by-nc-sa/3.0/
  • dc.subject.keyword Spanishen
  • dc.subject.keyword Treebanken
  • dc.subject.keyword Dependencyen
  • dc.title The IULA Spanish LSP treebank: building and browsingen
  • dc.type info:eu-repo/semantics/conferenceObject
  • dc.type.version info:eu-repo/semantics/publishedVersion