TITLE:
A Stacking-Based Ensemble Approach with Embeddings from Language Models for Depression Detection from Social Media Text
AUTHORS:
Akwa Gaius, Ronald Waweru Mwangi, Antony Ngunyi
KEYWORDS:
Machine Learning, Natural Language Processing, Depression
JOURNAL NAME:
Journal of Data Analysis and Information Processing,
Vol.11 No.4,
November
10,
2023
ABSTRACT: Depression is a major public health problem around the world and
contributes significantly to poor health and poverty. The rate of the number of
people being affected is very high compared to the rate of medical treatment of
the disease. Thus, the disease often remains untreated and suffering continues.
Machine learning has been widely used in many studies in detecting depressive
individuals from their contents on online social networks. From the related
reviews, it is apparent that the application of stacking for diagnosing
depression has been minimal. The study implements stacking based on Extra Tree,
Extreme Gradient Boosting, Light Gradient Boosting and Multi-layer perceptron
and compares its performance to state of the art bagging and boosting ensemble
learners. To better evaluate the effectiveness of the proposed stacking
approach, three pretrain word embeddings techniques including: Word2vec, Global
Vectors and Embeddings from language models were employed with two datasets.
Also, a corrected resampled paired t-test was applied to test the significance
of the stacked accuracy against the baseline accuracy. The experimental results
shows that the stacking approach yields favourable results with a best accuracy
of 99.54%.