Extraction and categorization of Japanese lexical collocations with graph-aware transformers

Enllaç permanent

Descripció

Resum
Lexical collocations may be identified and categorized in context, which is helpful for language acquisition, dictionary creation, and many other downstream NLP tasks. However, the automatic collocation extraction and categorization using modern machine learning techniques has not been tried in Japanese. In this paper, a previous work in context-sensitive collocation identification using a sequence tagging BERT-based model improved with a graph-aware transformer architecture is used to investigate its feasibility to Japanese Language. The findings provide the initial insights into the automatic collocation typification in a non Indo-European language using deep learning models, and suggests that low resource languages can benefit from this approach.
Descripció
Treball fi de màster de: Master in Intelligent Interactive Systems
Tutors: Leo Wanner, Alexander Shvets
Col·leccions
Master in Intelligent Interactive Systems. Master Thesis projects

Fitxers