Multilingual extraction and categorization of lexical collocations with graph-aware transformers
Mostra el registre complet Registre parcial de l'ítem
- dc.contributor.author Espinosa-Anke, Luis
- dc.contributor.author Shvets, Alexander
- dc.contributor.author Mohammadshahi, Alireza
- dc.contributor.author Henderson, James
- dc.contributor.author Wanner, Leo
- dc.date.accessioned 2023-03-01T13:49:03Z
- dc.date.available 2023-03-01T13:49:03Z
- dc.date.issued 2022
- dc.description Comunicació presentada a 11th Joint Conference on Lexical and Computational Semantics (SEM 2022), celebrat del 14 al 15 de juliol de 2022 a Seattle, Estats Units.
- dc.description.abstract Recognizing and categorizing lexical collocations in context is useful for language learning, dictionary compilation and downstream NLP. However, it is a challenging task due to the varying degrees of frozenness lexical collocations exhibit. In this paper, we put forward a sequence tagging BERT-based model enhanced with a graph-aware transformer architecture, which we evaluate on the task of collocation recognition in context. Our results suggest that explicitly encoding syntactic dependencies in the model architecture is helpful, and provide insights on differences in collocation typification in English, Spanish and French.
- dc.description.sponsorship The work by Alexander Shvets and Leo Wanner has been supported by the European Commission in the context of the Horizon 2020 Research Program under the grant numbers 825079 and 870930. Alireza Mohammadshahi is supported by the Swiss National Science Foundation (grant number CRSII5- 180320).
- dc.format.mimetype application/pdf
- dc.identifier.citation Espinosa Anke L, Shvets A, Mohammadshahi A, Henderson J, Wanner L. Multilingual extraction and categorization of lexical collocations with graph-aware transformers. In: Nastase V, Pavlick E, Taher Pilehvar M, Camacho-Collados J, Raganato A, editors. The 11th Joint Conference on Lexical and Computational Semantics (SEM 2022): proceedings of the Conference; 2022 Jul 14-15; Seattle, United States. Stroudsburg: ACL; 2022. p. 89-100. DOI: 10.18653/v1/2022.starsem-1.8
- dc.identifier.doi http://dx.doi.org/10.18653/v1/2022.starsem-1.8
- dc.identifier.uri http://hdl.handle.net/10230/55992
- dc.language.iso eng
- dc.publisher ACL (Association for Computational Linguistics)
- dc.relation.ispartof Nastase V, Pavlick E, Taher Pilehvar M, Camacho-Collados J, Raganato A, editors. The 11th Joint Conference on Lexical and Computational Semantics (SEM 2022): proceedings of the Conference; 2022 Jul 14-15; Seattle, United States. Stroudsburg: ACL; 2022. p. 89-100.
- dc.relation.isreferencedby https://github.com/TalnUPF/graph-aware-collocation-recognition
- dc.relation.projectID info:eu-repo/grantAgreement/EC/H2020/825079
- dc.relation.projectID info:eu-repo/grantAgreement/EC/H2020/870930
- dc.rights © ACL, Creative Commons Attribution 4.0 License
- dc.rights.accessRights info:eu-repo/semantics/openAccess
- dc.rights.uri http://creativecommons.org/licenses/by/4.0/
- dc.subject.other Lexicologia--Informàtica
- dc.subject.other Categorització (Lingüística)
- dc.title Multilingual extraction and categorization of lexical collocations with graph-aware transformers
- dc.type info:eu-repo/semantics/conferenceObject
- dc.type.version info:eu-repo/semantics/publishedVersion