A multilingual annotated corpus for the study of information structure

Mostra el registre complet Registre parcial de l'ítem

  • dc.contributor.author Brunetti, Lisa
  • dc.contributor.author Bott, Stefan Markus
  • dc.contributor.author Costa, Joan
  • dc.contributor.author Vallduví, Enric
  • dc.date.accessioned 2025-01-28T14:47:50Z
  • dc.date.available 2025-01-28T14:47:50Z
  • dc.date.issued 2009
  • dc.description.abstract This paper presents a corpus of spoken narrative texts in Catalan, Italian, Spanish, English, and German. The aim of this corpus compilation is to create an empirical resource for a comparative study of Information Structure. A total of 68 speakers were asked to tell a story in an acoustically isolated room by looking at the pictures of three textless books. A total of 222 narrations resulted in about 16 hours of speech. The recordings have been transcribed and an original annotation of non-canonical constructions for the Romance subgroup has been proposed, namely of morphosyntactically/prosodically marked constructions that relate informational categories such as topic, focus, and contrast. Transcriptions and annotations of some selected high quality recordings have been aligned to the acoustic signal stream. The corpus is available in audio and text format.
  • dc.description.sponsorship This research has been partially funded by the Spanish Ministry of Education and Science project OpenMT (TIN2006 15307-C03-02). The NOCANDO project was funded by the Spanish Secretaria de Estado de Universidades e Investigación of the Ministerio de Educación y Ciencia (n. I+D HUM2004-04463).
  • dc.format.mimetype application/pdf
  • dc.identifier.citation Brunetti L, Bott S, Costa J, Vallduví E. A multilingual annotated corpus for the study of information structure. In: Konopka M et al.. Grammatik und Korpora 2009 dritte internationale Konferenz Mannheim, 22.-24.09.2009 = Grammar & Corpora 2009 third international conference. 1 ed. Tubinga: Narr; 2011. p. 305-27
  • dc.identifier.isbn 9783823366485
  • dc.identifier.uri http://hdl.handle.net/10230/69342
  • dc.language.iso eng
  • dc.publisher Narr Francke Attempto Verlag
  • dc.rights © 2011 Narr Francke Attempto Verlag GmbH + Co
  • dc.rights.accessRights info:eu-repo/semantics/openAccess
  • dc.subject.keyword Information structure
  • dc.subject.keyword Multilingual annotated corpus
  • dc.title A multilingual annotated corpus for the study of information structure
  • dc.type info:eu-repo/semantics/conferenceObject
  • dc.type.version info:eu-repo/semantics/acceptedVersion