Savana: a global information extraction and terminology expansion framework in the medical domain
Mostra el registre complet Registre parcial de l'ítem
- dc.contributor.author Espinosa-Anke, Luisca
- dc.contributor.author Tello, Jorgeca
- dc.contributor.author Pardo, Albertoca
- dc.contributor.author Medrano, Ignacioca
- dc.contributor.author Ureña, Albertoca
- dc.contributor.author Salcedo, Ignacioca
- dc.contributor.author Saggion, Horacioca
- dc.date.accessioned 2017-12-19T10:20:56Z
- dc.date.available 2017-12-19T10:20:56Z
- dc.date.issued 2016
- dc.description.abstract Terminological databases constitute a fundamental source of information in the medical domain. They are used daily both by practitioners in the area, as well as in academia. Several resources of this kind are available, e.g. CIE, SnomedCT or UMLS (Unified Medical Language System). These terminological databases are of high quality due to them being the result of collaborative expert knowledge. However, they may show certain drawbacks in terms of faithfully representing the ever-changing medical domain. Therefore, systems aimed at capturing novel terminological knowledge in heterogeneous text sources, and able to include them in standard terminologies have the potential to add great value to such repositories. This paper presents, first, Savana, a Biomedical Information Extraction system which, combined with a validation phase carried out by medical practitioners, is used to populate the Spanish branch of SnomedCT with novel knowledge. Second, we describe and evaluate a system which, given a novel medical term, finds its most likely hypernym, thus becoming an enabler in the task of terminological database enrichment and expansion.en
- dc.description.sponsorship This work is partially funded by the Spanish Ministry of Economy and Competitiveness under the following sponsorships: Maria de Maeztu Units of Excellence Programme (MDM-2015-0502), and TUNER project (TIN2015-65308-C5-5-R, MINECO/FEDER, UE).
- dc.format.mimetype application/pdfca
- dc.identifier.citation Espinosa L, Tello J, Pardo A, Medrano I, Ureña A, Salcedo I, Saggion H. Savana: a global information extraction and terminology expansion framework in the medical domain. Procesamiento del Lenguaje Natural. 2016;57:23-30
- dc.identifier.issn 1135-5948
- dc.identifier.uri http://hdl.handle.net/10230/33531
- dc.language.iso eng
- dc.publisher Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN)ca
- dc.relation.ispartof Procesamiento del Lenguaje Natural. 2016;57:23-30
- dc.relation.projectID info:eu-repo/grantAgreement/ES/1PE/TIN2015-65308-C5-1-R
- dc.rights © Sociedad Española para el Procesamiento de Lenguaje Natural
- dc.rights.accessRights info:eu-repo/semantics/openAccess
- dc.subject.keyword Medical terminologiesen
- dc.subject.keyword Knowledge basesen
- dc.subject.keyword Snomeden
- dc.subject.keyword Word2vecen
- dc.subject.keyword Semanticsen
- dc.subject.keyword Savanaen
- dc.subject.keyword Terminologías médicases
- dc.subject.keyword Bases de conocimientoes
- dc.subject.keyword Snomedes
- dc.subject.keyword Word2veces
- dc.subject.keyword Semánticaes
- dc.subject.keyword Savanaes
- dc.title Savana: a global information extraction and terminology expansion framework in the medical domainca
- dc.type info:eu-repo/semantics/article
- dc.type.version info:eu-repo/semantics/publishedVersion