Welcome to the UPF Digital Repository

LaSTUS/TALN at Complex Word Identification (CWI) 2018 Shared Task

Show simple item record

dc.contributor.author AbuRa'ed, Ahmed Ghassan Tawfiq
dc.contributor.author Saggion, Horacio
dc.date.accessioned 2018-07-24T07:54:23Z
dc.date.available 2018-07-24T07:54:23Z
dc.date.issued 2018
dc.identifier.citation AbuRa'ed A, Saggion H. LaSTUS/TALN at Complex Word Identification (CWI) 2018 Shared Task. In: Proceedings of the Thirteenth Workshop on Innovative Use of NLP for Building Educational Applications; 2018 Jun 5; New Orleans, LA. Stroudsburg (PA): ACL; 2018. p. 159–65.
dc.identifier.uri http://hdl.handle.net/10230/35241
dc.description Comunicació presentada al 13th Workshop on Innovative Use of NLP for Building Educational Applications, celebrat el dia 5 de juny de 2018 a Nova Orleans, EUA.
dc.description.abstract This paper presents the participation of the LaSTUS/TALN team in the Complex Word Identification (CWI) Shared Task 2018 in the English monolingual track . The purpose of the task was to determine if a word in a given sentence can be judged as complex or not by a certain target audience. For the English track, task organizers provided a training and a development datasets of 27,299 and 3,328 words respectively together with the sentence in which each word occurs. The words were judged as complex or not by 20 human evaluators; ten of whom are natives. We submitted two systems: one system modeled each word to evaluate as a numeric vector populated with a set of lexical, semantic and contextual features while the other system relies on a word embedding representation and a distance metric. We trained two separate classifiers to automatically decide if each word is complex or not. We submitted six runs, two for each of the three subsets of the English monolingual CWI track.
dc.description.sponsorship This work is (partly) supported by the Spanish Ministry of Economy and Competitiveness under the Maria de Maeztu Units of Excellence Programme (MDM-2015-0502) and by the TUNER project (TIN2015-65308-C5-5-R, MINECO/FEDER, UE).
dc.format.mimetype application/pdf
dc.language.iso eng
dc.publisher ACL (Association for Computational Linguistics)
dc.relation.ispartof Proceedings of the Thirteenth Workshop on Innovative Use of NLP for Building Educational Applications; 2018 Jun 5; New Orleans, LA. Stroudsburg (PA): ACL; 2018. p. 159–65.
dc.rights © ACL, Creative Commons Attribution 4.0 License
dc.rights.uri http://creativecommons.org/licenses/by/4.0/
dc.subject.other Tractament del llenguatge natural (Informàtica)
dc.title LaSTUS/TALN at Complex Word Identification (CWI) 2018 Shared Task
dc.type info:eu-repo/semantics/conferenceObject
dc.relation.projectID info:eu-repo/grantAgreement/ES/1PE/TIN2015-65308-C5-5-R
dc.rights.accessRights info:eu-repo/semantics/openAccess
dc.type.version info:eu-repo/semantics/publishedVersion

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account

Statistics

In collaboration with Compliant to Partaking