Enhanced word embeddings for anorexia nervosa detection on social media
Enhanced word embeddings for anorexia nervosa detection on social media
Citació
- Ramírez-Cifuentes D, Largeron C, Tissier J, Freire A, Baeza-Yates R. Enhanced word embeddings for anorexia nervosa detection on social media. In: Berthold M, Feelders A, Krempl G, editors. Advances in Intelligent Data Analysis XVIII. 18th International Symposium on Intelligent Data Analysis, IDA 2020 Proceedings; 2020 Apr 27-29; Konstanz, Germany. Cham: Springer; 2020. p. 404-17. (LNCS; no. 12080). DOI: 10.1007/978-3-030-44584-3_32
Enllaç permanent
Descripció
Resum
Anorexia Nervosa (AN) is a serious mental disorder that has been proved to be traceable on social media through the analysis of users’ written posts. Here we present an approach to generate word embeddings enhanced for a classification task dedicated to the detection of Reddit users with AN. Our method extends Word2vec’s objective function in order to put closer domain-specific and semantically related words. The approach is evaluated through the calculation of an average similarity measure, and via the usage of the embeddings generated as features for the AN screening task. The results show that our method outperforms the usage of fine-tuned pre-learned word embeddings, related methods dedicated to generate domain adapted embeddings, as well as representations learned on the training set using Word2vec. This method can potentially be applied and evaluated on similar tasks that can be formalized as document categorization problems. Regarding our use case, we believe that this approach can contribute to the development of proper automated detection tools to alert and assist clinicians.Descripció
Comunicació presentada a: The18th International Symposium on Intelligent Data Analysis, IDA 2020, celebrat del 27 al 29 d'abril de 2020 a Konstanz, Alemanya.