Extraction and categorization of Japanese lexical collocations with graph-aware transformers

Mostra el registre complet Registre parcial de l'ítem

  • dc.contributor.author Nisho, Kosuke James
  • dc.date.accessioned 2023-01-31T18:45:59Z
  • dc.date.available 2023-01-31T18:45:59Z
  • dc.date.issued 2022
  • dc.description Treball fi de màster de: Master in Intelligent Interactive Systemsca
  • dc.description Tutors: Leo Wanner, Alexander Shvets
  • dc.description.abstract Lexical collocations may be identified and categorized in context, which is helpful for language acquisition, dictionary creation, and many other downstream NLP tasks. However, the automatic collocation extraction and categorization using modern machine learning techniques has not been tried in Japanese. In this paper, a previous work in context-sensitive collocation identification using a sequence tagging BERT-based model improved with a graph-aware transformer architecture is used to investigate its feasibility to Japanese Language. The findings provide the initial insights into the automatic collocation typification in a non Indo-European language using deep learning models, and suggests that low resource languages can benefit from this approach.ca
  • dc.format.mimetype application/pdf*
  • dc.identifier.uri http://hdl.handle.net/10230/55504
  • dc.language.iso engca
  • dc.rights This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International Licenseca
  • dc.rights.accessRights info:eu-repo/semantics/openAccessca
  • dc.rights.uri http://creativecommons.org/licenses/by-nc-nd/4.0ca
  • dc.subject.keyword NLP
  • dc.subject.keyword Collocation
  • dc.subject.keyword Japanese
  • dc.subject.keyword Transformer
  • dc.title Extraction and categorization of Japanese lexical collocations with graph-aware transformersca
  • dc.type info:eu-repo/semantics/masterThesisca