TALN at SemEval-2016 Task 14: semantic taxonomy enrichment via sense-based embeddings

Mostra el registre complet Registre parcial de l'ítem

  • dc.contributor.author Espinosa-Anke, Luis
  • dc.contributor.author Ronzano, Francesco
  • dc.contributor.author Saggion, Horacio
  • dc.date.accessioned 2023-12-18T06:53:38Z
  • dc.date.available 2023-12-18T06:53:38Z
  • dc.date.issued 2016
  • dc.description.abstract This paper describes the participation of the TALN team in SemEval-2016 Task 14: Semantic Taxonomy Enrichment. The purpose of the task is to find the best point of attachment in WordNet for a set of Out of Vocabulary (OOV) terms. These may come, to name a few, from domain specific glossaries, slang or typical jargon from Internet forums and chatrooms. Our contribution takes as input an OOV term, its part of speech and its associated definition, and generates a set of WordNet synset candidates derived from modelling the term’s definition as a sense embedding representation. We leverage a BabelNet-based vector space representation, which allows us to map the algorithm’s prediction to WordNet. Our approach is designed to be generic and fitting to any domain, without exploiting, for instance, HTML markup in source web pages. Our system performs above the median of all submitted systems, and rivals in performance a powerful baseline based on extracting the first word of the definition with the same partof-speech as the OOV term.
  • dc.description.sponsorship This work was partially funded by Dr. Inventor (FP7-ICT-2013.8.1611383), and by the Spanish Ministry of Economy and Competitiveness under the Maria de Maeztu Units of Excellence Programme (MDM-2015-0502).
  • dc.format.mimetype application/pdf
  • dc.identifier.citation Espinosa-Anke L, Ronzano F, Saggion H. TALN at SemEval-2016 Task 14: semantic taxonomy enrichment via sense-based embeddings. In: Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016); 2016 Jun 16-17; San Diego, California. [San Diego]: Association for Computational Linguistics; 2016. p. 1332-36. DOI: 10.18653/v1/s16-1208
  • dc.identifier.doi http://dx.doi.org/10.18653/v1/s16-1208
  • dc.identifier.issn 0736-587X
  • dc.identifier.uri http://hdl.handle.net/10230/58549
  • dc.language.iso eng
  • dc.publisher ACL (Association for Computational Linguistics)
  • dc.relation.ispartof Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016); 2016 Jun 16-17; San Diego, California. [San Diego]: Association for Computational Linguistics; 2016. p. 1332-36
  • dc.relation.projectID info:eu-repo/grantAgreement/EC/FP7/611383
  • dc.relation.projectID info:eu-repo/grantAgreement/ES/1PE/MDM-2015-0502
  • dc.rights ACL materials are Copyright © 1963–2023 ACL; other materials are copyrighted by their respective copyright holders. Materials prior to 2016 here are licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 International License. Permission is granted to make copies for the purposes of teaching and research. Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International License.
  • dc.rights.accessRights info:eu-repo/semantics/openAccess
  • dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/3.0/
  • dc.subject.other Encapçalaments de matèria
  • dc.subject.other Vocabulari
  • dc.subject.other Fòrums (Internet)
  • dc.title TALN at SemEval-2016 Task 14: semantic taxonomy enrichment via sense-based embeddings
  • dc.type info:eu-repo/semantics/conferenceObject
  • dc.type.version info:eu-repo/semantics/publishedVersion