Nisho, Kosuke James2023-01-312023-01-312022http://hdl.handle.net/10230/55504Treball fi de màster de: Master in Intelligent Interactive SystemsTutors: Leo Wanner, Alexander ShvetsLexical collocations may be identified and categorized in context, which is helpful for language acquisition, dictionary creation, and many other downstream NLP tasks. However, the automatic collocation extraction and categorization using modern machine learning techniques has not been tried in Japanese. In this paper, a previous work in context-sensitive collocation identification using a sequence tagging BERT-based model improved with a graph-aware transformer architecture is used to investigate its feasibility to Japanese Language. The findings provide the initial insights into the automatic collocation typification in a non Indo-European language using deep learning models, and suggests that low resource languages can benefit from this approach.application/pdfengThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International LicenseExtraction and categorization of Japanese lexical collocations with graph-aware transformersinfo:eu-repo/semantics/masterThesisNLPCollocationJapaneseTransformerinfo:eu-repo/semantics/openAccess