ALEXSIS: a dataset for lexical simplification in spanish
Full item page Simple item page
- dc.contributor.author Ferrés, Daniel
- dc.contributor.author Saggion, Horacio
- dc.date.accessioned 2022-10-17T06:20:50Z
- dc.date.available 2022-10-17T06:20:50Z
- dc.date.issued 2022
- dc.description Comunicació presentada a: LREC 2022, 13th International Conference on Language Resources and Evaluation, celebrat del 20 al 25 de juny de 2022 a Marsella, França
- dc.description.abstract Lexical Simplification is the process of reducing the lexical complexity of a text by replacing difficult words with easier to read (or understand) expressions while preserving the original information and meaning. In this paper we introduce ALEXSIS, a new dataset for this task, and we use ALEXSIS to benchmark Lexical Simplification systems in Spanish. The paper describes the evaluation of three kind of approaches to Lexical Simplification: a thesaurus-based approach, a single transformers-based approach, and a combination of transformers. We also report state of the art results on a previous Lexical Simplification dataset for Spanish
- dc.description.sponsorship We acknowledge support from the project Context-aware Multilingual Text Simplification (ConMuTeS) PID2019- 109066GB-I00/AEI/10.13039/501100011033 awarded by Ministerio de Ciencia, Innovacion y Universidades (MCIU) ´ and by Agencia Estatal de Investigacion (AEI) of Spain.
- dc.format.mimetype application/pdf
- dc.identifier.citation Ferres D, Saggion H. ALEXSIS: a dataset for lexical simplification in spanish. In: Calzolari N, Béchet F, Blache P, Choukri K, Cieri C, Declerck T, Goggi S, Isahara H, Maegaard B, Mariani J, Mazo H, Odijk J, Piperidis S, editors. LREC 2022, 13th International Conference on Language Resources and Evaluation; 2022 June 20-25; Marseille, France. Paris: European Language Resources; 2022. p. 3582-94.
- dc.identifier.uri http://hdl.handle.net/10230/54417
- dc.language.iso eng
- dc.publisher European Language Resources
- dc.relation.ispartof Calzolari N, Béchet F, Blache P, Choukri K, Cieri C, Declerck T, Goggi S, Isahara H, Maegaard B, Mariani J, Mazo H, Odijk J, Piperidis S, editors. LREC 2022, 13th International Conference on Language Resources and Evaluation; 2022 June 20-25; Marseille, France. Paris: European Language Resources; 2022. p. 3582-94.
- dc.relation.projectID info:eu-repo/grantAgreement/ES/2PE/PID2019-109066GB-I00
- dc.rights © European Language Resources Association (ELRA) These LREC2022 proceedings are licensed under a Creative Commons Attribution-NonCommercial 4.0 International License
- dc.rights.accessRights info:eu-repo/semantics/openAccess
- dc.rights.uri https://creativecommons.org/licenses/by-nc/4.0/
- dc.subject.keyword Lexical Simplification
- dc.subject.keyword Text Simplification
- dc.subject.keyword Evaluation Dataset
- dc.title ALEXSIS: a dataset for lexical simplification in spanish
- dc.type info:eu-repo/semantics/conferenceObject
- dc.type.version info:eu-repo/semantics/publishedVersion