DEFEXT: a semi supervised definition extraction tool

dc.contributor.authorEspinosa-Anke, Luisca
dc.contributor.authorCarlini, Robertoca
dc.contributor.authorSaggion, Horacioca
dc.contributor.authorRonzano, Francescoca
dc.date.accessioned2016-12-22T17:38:22Z
dc.date.available2016-12-22T17:38:22Z
dc.date.issued2016ca
dc.descriptionPaper presented at. Globalex: Lexicographic Resources for Human Language Technology, full day workshop at LREC2016 , Portorož, Slovenia. May 24, 2016en
dc.description.abstractWe present DEFEXT, an easy to use semi supervised Definition Extraction Tool. DEFEXT is designed to extract from a target corpus those textual fragments where a term is explicitly mentioned together with its core features, i.e. its definition. It works on the back of a Conditional Random Fields based sequential labeling algorithm and a bootstrapping approach. Bootstrapping enables the model to gradually become more aware of the idiosyncrasies of the target corpus. In this paper we describe the main components of the toolkit as well as experimental results stemming from both automatic and manual evaluation. We release DEFEXT as open source along with the necessary files to run it in any Unix machine. We also provide access to training and test data for immediate use.en
dc.format.mimetypeapplication/pdfca
dc.identifier.citationEspinosa-Anke L, Saggion H, Ronzano F. DEFEXT: a semi supervised definition extraction tool. In: Kernerman I, Kosem I, Krek S, Trap-Jensen L, editors. Globalex: Lexicographic Resources for Human Language Technology; 2016 May 24; Portoroz, Slovenia. [Place unknown]: European Language Resources Association, 2016. p. 24-8.ca
dc.identifier.urihttp://hdl.handle.net/10230/27836
dc.language.isoengca
dc.publisherEuropean Language Resources Associationca
dc.relation.ispartofKernerman I, Kosem I, Krek S, Trap-Jensen L, editors. Globalex: Lexicographic Resources for Human Language Technology; 2016 May 24; Portoroz, Slovenia. [Place unknown]: European Language Resources Association, 2016. p. 24-8.
dc.rights© European Language Resources Association. The LREC 2016 Proceedings are licensed under a Creative Commons Attribution-NonCommercial 4.0 International Licenseca
dc.rights.accessRightsinfo:eu-repo/semantics/openAccessca
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/
dc.subject.keywordLexicographyen
dc.subject.keywordDefinition extractionen
dc.subject.keywordBootstrappingen
dc.titleDEFEXT: a semi supervised definition extraction toolca
dc.typeinfo:eu-repo/semantics/conferenceObjectca
dc.type.versioninfo:eu-repo/semantics/publishedVersionca

Fitxers

Paquet original

Mostrant 1 - 1 de 1
Carregant...
Miniatura
Nom:
espinosa_defext.pdf
Mida:
401.75 KB
Format:
Adobe Portable Document Format

Llicència

Drets