Multilingual complex word identification: convolutional neural networks with morphological and linguistic features

Citació

  • Sheang KC. Multilingual complex word identification: convolutional neural networks with morphological and linguistic features. In: Proceedings of the Student Research Workshop (RANLPStud 2019); 2019 Sep 2-4; Varna, Bulgaria. [Varna]: ACL;2019. p. 83-9. DOI: 10.26615/issn.2603-2821.2019_013

Enllaç permanent

Descripció

  • Resum

    Complex Word Identification (CWI) is an essential task in helping Lexical Simplification (LS) identify the difficult words that should be simplified. In this paper, we present an approach to CWI based on Convolutional Neural Networks (CNN) trained on pre-trained word embeddings with morphological and linguistic features. Generally, the majority of works on CWI are either feature-engineered or neural network with word embeddings. Both approaches have advantages and limitations, so here we combine both approaches in order to achieve higher performance and still support multilingualism. Our evaluation has shown that our system achieves quite similar performance as the state-ofthe-art system for English, and it outperforms the state-of-the-art systems for both Spanish and German.
  • Descripció

    Comunicació presentada a: Student Research Workshop (RANLPStud 2019, celebrat del 2 al 4 de setembre de 2019 a Varna, Bulgaria
  • Mostra el registre complet