Predicting Bipolar Disorder Treatment Outcomes with Machine Learning: A Comprehensive Evaluation of Random Forest, Gradient Boosting, and Ensemble Approaches

Abstract

Accurate prediction of treatment response in bipolar disorder patients with comorbid obsessive-compulsive disorder (OCD) is essential to improving clinical outcomes and minimizing ineffective interventions. The complex interplay between bipolar disorder and OCD often complicates pharmacological treatment, leading to inconsistent results. This study aims to leverage machine learning (ML) techniques to develop predictive models that enhance the precision of quetiapine monotherapy outcomes. The primary objective of this research is to evaluate the performance of Random Forest (RF) and Gradient Boosting (GB) classifiers in forecasting treatment response, thereby addressing the unpredictability in clinical decision-making. In addition, an ensemble model combining the strengths of RF and GB is developed to optimize predictive accuracy. By incorporating demographic, psychometric, and pharmacological data, the models are trained and validated on a dataset of 300 patients diagnosed with bipolar disorder and comorbid OCD. Comprehensive model evaluations are conducted through visual analyses, including confusion matrices, feature importance plots, precision-recall curves, calibration plots, learning curves, and ROC curves. These visualizations not only reveal the models’ predictive performance but also highlight the key features influencing treatment response. Results indicate that the ensemble model consistently outperforms individual classifiers, achieving higher AUC scores and lower false negative rates, which are critical for minimizing missed treatment opportunities. The findings of this study provide a robust framework for integrating ML into psychiatric care, supporting more personalized and accurate treatment plans. This research underscores the transformative potential of predictive analytics in enhancing therapeutic strategies for complex psychiatric conditions.

Share and Cite:

de Filippis, R. and Al Foysal, A. (2025) Predicting Bipolar Disorder Treatment Outcomes with Machine Learning: A Comprehensive Evaluation of Random Forest, Gradient Boosting, and Ensemble Approaches. Open Access Library Journal, 12, 1-1. doi: 10.4236/oalib.1112897.

1. Introduction

Bipolar disorder (BD) is a severe psychiatric condition characterized by alternating episodes of mania, hypomania, and depression [1]-[3]. When comorbid with obsessive-compulsive disorder (OCD), the complexity of diagnosis and treatment escalates significantly, leading to prolonged suffering and reduced quality of life for patients [4]-[7]. The interplay between mood instability and intrusive thoughts or compulsive behaviors complicates pharmacological strategies, making treatment adherence and symptom management challenging [8]-[10]. Traditional pharmacotherapy, including mood stabilizers and antipsychotics, remains the cornerstone of bipolar disorder management [11]-[13]. However, patients with comorbid OCD often exhibit variable responses to standard treatment regimens, resulting in the need for multiple medication adjustments [14] [15]. This trial-and-error approach can exacerbate symptoms, delay recovery, and increase the risk of side effects. As such, there is a pressing need to develop predictive tools that can accurately forecast treatment responses and guide clinicians in tailoring interventions to individual patient profiles. Machine learning (ML) has emerged as a transformative force in medical research, providing the ability to analyze large datasets, identify complex patterns, and generate predictive models that surpass traditional statistical methods [16]-[18]. Among the most effective ML algorithms are Random Forest (RF) and Gradient Boosting (GB), both of which excel in classification tasks and handling non-linear relationships in data [19] [20]. By leveraging these algorithms, clinicians can harness data-driven insights to improve treatment precision, reduce relapse rates, and optimize patient outcomes. This study investigates the performance of RF and GB classifiers in predicting the response to quetiapine monotherapy in bipolar disorder patients with comorbid OCD. Additionally, we explore the integration of these models into an ensemble approach to enhance predictive accuracy and stability. Through comprehensive model evaluation and visualization, this research aims to establish a robust predictive framework that can facilitate more personalized, effective psychiatric care.

2. Methods

2.1. Data Collection and Preprocessing

To develop predictive models for bipolar disorder treatment outcomes, a synthetic dataset of 300 patients diagnosed with bipolar disorder and comorbid OCD was generated. The dataset was designed to reflect real-world clinical distributions, ensuring that the generated data closely aligns with established psychiatric norms and patterns. Key features included demographic variables such as age and gender, psychometric scores from the Hamilton Depression Rating Scale (HDRS), Young Mania Rating Scale (YMRS), and Yale-Brown Obsessive Compulsive Scale (YBOCS), as well as pharmacological variables including quetiapine dosage and treatment duration. The psychometric scales provided crucial insights into symptom severity and progression, while pharmacological features tracked the relationship between dosage and patient response. To maintain the integrity of the predictive modelling process, the dataset was partitioned into training and testing subsets using a 75/25 split [21] [22]. Stratified sampling was employed to preserve class balance and ensure an even distribution of responders and non-responders across both subsets [23]. This approach mitigated the risk of overfitting and ensured that the model’s performance could be generalized effectively to new data.

2.2. Model Development

Three machine learning models were developed to predict treatment response. The first model utilized Random Forest (RF), an ensemble-based classifier known for its robustness and ability to handle complex datasets [24] [25]. The RF model consisted of 150 decision trees with a maximum depth of 10, leveraging bootstrap sampling to enhance performance and reduce variance. The second model employed Gradient Boosting (GB), a sequential ensemble technique that builds decision trees iteratively, focusing on correcting errors made by previous iterations [26] [27]. This model was configured with 150 estimators, a learning rate of 0.1, and a maximum tree depth of 6, allowing it to effectively capture non-linear patterns in the data. An ensemble model combining RF and GB was developed to further enhance predictive performance. A soft-voting classifier was implemented to aggregate predictions from both models, leveraging the strengths of each algorithm. This ensemble approach improved overall stability and accuracy, reducing model variance and compensating for the individual weaknesses of RF and GB.

2.3. Evaluation Metrics

Model performance was rigorously evaluated using multiple metrics to provide a comprehensive assessment. Accuracy, precision, recall, and F1-score were calculated to measure classification performance across various thresholds. The area under the receiver operating characteristic curve (AUC) was used to assess the model’s ability to distinguish between responders and non-responders. Class imbalance, a common issue in medical datasets, was addressed using the Synthetic Minority Over-sampling Technique (SMOTE) [28]-[30]. This technique generated synthetic samples from the minority class, ensuring the models were not biased toward the majority class. Visual tools such as confusion matrices, feature importance plots, precision-recall curves, calibration plots, learning curves, and ROC curves were employed to interpret the results and provide insights into model behavior. These visualizations not only highlighted the predictive strengths and limitations of each model but also guided the interpretation of key features influencing treatment outcomes.

3. Results

Before delving into the individual figures, it is important to highlight the overarching goal of this analysis—to provide a thorough comparison of machine learning models in predicting treatment response for bipolar disorder patients with comorbid OCD [31] [32]. The visualizations serve as critical tools for interpreting model performance, offering insights into classification accuracy, key feature contributions, and calibration reliability. By integrating these figures, the results not only demonstrate the predictive power of Random Forest and Gradient Boosting but also emphasize the advantages of the ensemble approach in enhancing stability and reducing errors. This comprehensive visual assessment bridges the gap between raw performance metrics and practical clinical application, allowing for a nuanced understanding of model strengths and limitations.

Figure 1 shows the confusion matrices for Random Forest (left) and Gradient Boosting (right). The RF model correctly classified 39 positive cases, with 13 false negatives, while the GB model had 18 false negatives. This highlights RF’s superior ability to minimize false negatives, which is critical in ensuring effective treatment.

Figure 1. Confusion matrices of random forest and gradient boosting models.

As shown in Figure 2, quetiapine dose, YBOCS, and HDRS were the most significant predictors of treatment response. RF assigned greater importance to YMRS, reflecting its role in mood stabilization, while GB emphasized quetiapine dose and age.

Figure 2. Feature importance comparison.

Precision-recall curves in Figure 3 illustrate that RF consistently achieves higher precision across varying recall levels compared to GB. This is crucial for clinical applications, where minimizing false positives while maintaining high recall ensures patients receive appropriate treatment.

Figure 3. Precision-recall curve.

Figure 4. Calibration plot.

Figure 4 presents calibration plots indicating RF’s closer alignment with the ideal diagonal at moderate probabilities, suggesting better probabilistic predictions. GB exhibited overconfidence at lower probabilities, emphasizing the need for post-calibration adjustments.

Figure 5. Learning curve for random forest.

Figure 5 demonstrates the learning curve for RF, showing stable performance as the dataset size increases. RF achieves near-perfect training accuracy, while cross-validation scores steadily improve, indicating robust generalization.

Figure 6 compares the ROC curves of RF, GB, and the ensemble model. The ensemble model achieved an AUC of 0.76, outperforming GB (0.73) and matching RF. This result highlights the ensemble’s ability to integrate the strengths of both models, improving overall performance and reducing variance.

Figure 6. ROC curve comparison.

4. Discussion

The results of this study underscore the transformative potential of machine learning (ML) in revolutionizing the prediction of treatment response in bipolar disorder patients with comorbid obsessive-compulsive disorder (OCD). By leveraging advanced models such as Random Forest (RF) and Gradient Boosting (GB) alongside an ensemble approach that integrates their strengths, this research highlights the critical role of predictive analytics in optimizing psychiatric care [33] [34]. These findings represent a significant step forward in addressing the complexity of psychiatric disorders, particularly in the context of comorbid conditions, where treatment responses can be highly variable and challenging to predict.

4.1. Performance Comparison and Clinical Relevance

The confusion matrices generated during this study provided a detailed performance comparison between RF and GB models. While both algorithms achieved respectable predictive capabilities, RF consistently outperformed GB in minimizing false negatives. This distinction is of paramount importance in clinical settings, as false negatives—where responders are misclassified as non-responders—can lead to delayed treatment interventions and prolonged patient distress [35] [36]. Such delays not only exacerbate the severity of psychiatric symptoms but may also contribute to the development of treatment resistance, a phenomenon that is particularly problematic in mood disorders. By integrating RF and GB through an ensemble approach, the study further demonstrated a reduction in classification errors. The ensemble model combined the individual strengths of RF and GB, achieving a higher degree of predictive accuracy and stability [37]-[39]. This hybrid approach not only bolstered performance metrics but also highlighted the potential of ensemble learning to handle the diverse and nonlinear nature of psychiatric data. Notably, the ensemble model exhibited a lower false negative rate than either RF or GB alone, underscoring its potential to prioritize patient safety by ensuring fewer responders are misclassified. This improvement is crucial in psychiatric care, where precise identification of treatment responders can guide clinicians toward effective interventions, reducing the trial-and-error often associated with pharmacological adjustments.

4.2. Insights from Feature Importance Analysis

The feature-important plots generated in this study provide valuable insights into the clinical variables driving model predictions. Key predictors such as Quetiapine dosage, Yale-Brown Obsessive-Compulsive Scale (YBOCS), and Hamilton Depression Rating Scale (HDRS) consistently emerged as the most influential features. These findings reinforce the significance of psychometric scores in guiding pharmacological decisions for bipolar disorder patients with comorbid OCD. The prominence of Quetiapine dosage underscores its central role in treatment plans, while YBOCS and HDRS highlight the utility of standardized clinical measures in capturing the nuanced dimensions of symptom severity and treatment response. Interestingly, RF assigned greater importance to the Young Mania Rating Scale (YMRS), suggesting that mood instability plays a critical role in predicting treatment outcomes. This divergence between RF and GB highlights the complementary nature of the two models, which the ensemble approach effectively harnesses to enhance overall performance. By integrating the unique strengths of each algorithm, the ensemble model offers a more comprehensive and nuanced perspective on treatment response prediction.

4.3. Precision-Recall and Calibration Insights

Precision-recall curves further illustrate the practical advantages of RF, which consistently maintained higher precision across varying recall levels. This capability is particularly aligned with the needs of psychiatric care, where minimizing false positives is crucial to avoid unnecessary medication adjustments and their associated risks. The ensemble model closely mirrored RF’s performance in precision-recall analyses, reaffirming its robustness as a predictive tool. This reliability ensures that clinicians can make data-driven decisions with confidence, minimizing the risk of over-treatment or under-treatment. The calibration plots provided additional insights into the probabilistic predictions of the models. GB demonstrated a tendency to overestimate probabilities at lower thresholds, while RF exhibited superior calibration at moderate-to-high probabilities. The ensemble model balanced these tendencies, producing more reliable probability estimates that can guide clinicians in decision-making with greater confidence [40] [41]. Reliable probability estimates are essential in clinical practice, where treatment decisions often hinge on nuanced interpretations of risk and likelihood.

4.4. Scalability and Generalizability

Another critical finding of this study was the learning curve analysis, which highlighted RF’s ability to generalize across dataset sizes without overfitting. This scalability is a crucial factor for the practical deployment of predictive models in clinical settings, as it ensures that model performance can be maintained or even improved as more patient data becomes available. The ensemble model exhibited similar learning behavior, reinforcing its suitability for deployment in diverse clinical environments, including those with limited initial datasets. As the volume of electronic health record (EHR) data continues to grow, models with strong generalization capabilities, such as the ensemble approach, are well-positioned to leverage these data sources for enhanced predictive accuracy. By integrating structured data (e.g., psychometric scores and medication dosages) with unstructured data (e.g., clinician notes and patient narratives), future iterations of these models could further improve their clinical utility.

4.5. ROC Curve Analysis and Clinical Implications

The receiver operating characteristic (ROC) curve comparison ultimately validated the superior predictive performance of the ensemble model, achieving an area under the curve (AUC) of 0.76. While the improvement over individual models was marginal, this increment could translate to significant clinical benefits. A higher AUC indicates that the ensemble model is better equipped to balance sensitivity and specificity, which is vital in optimizing treatment plans for complex psychiatric conditions. This finding reinforces the value of ensemble learning in psychiatric applications, particularly in settings where precision and reliability are paramount.

4.6. Future Directions and Research Opportunities

The results of this study provide a strong foundation for future research aimed at further enhancing the predictive capabilities of machine learning models in psychiatry. Key areas for future exploration include:

1) Expanding Datasets with Real-World Clinical Data: Incorporating real-world data from diverse patient populations will improve the robustness and generalizability of predictive models. Longitudinal data could help models predict not only immediate treatment responses but also long-term outcomes such as relapse rates and functional recovery [42]-[44].

2) Exploring Deep Learning Techniques: While RF and GB demonstrated strong performance, the integration of deep learning techniques—such as recurrent neural networks (RNNs) and transformers—could capture temporal patterns and interactions in longitudinal datasets, potentially elevating model performance. Deep learning approaches may also facilitate the integration of multimodal data sources, further enhancing the granularity and accuracy of predictions.

3) Ethical Oversight and Interpretability: Developing explainable AI (XAI) techniques will be essential for ensuring that clinicians can understand and trust the predictions generated by these models. Ethical oversight frameworks must also be established to address potential biases and ensure equitable application of predictive analytics in psychiatric care.

4) Strengthening the Collaborative Role of Clinicians: Predictive models should complement, not replace, clinician expertise. Future research should explore strategies for integrating model outputs into clinical workflows, ensuring that data-driven insights enhance, rather than undermine, the art of psychiatric care.

5. Conclusion

This study highlights the transformative potential of machine learning (ML) in predicting treatment response for bipolar disorder (BD) patients with comorbid obsessive-compulsive disorder (OCD). By evaluating the performance of Random Forest (RF) and Gradient Boosting (GB) models and further enhancing predictive accuracy through an ensemble approach, the research successfully addresses the inherent variability in psychiatric treatment outcomes [45]-[47]. The combination of demographic, psychometric, and pharmacological data allowed for the development of robust predictive models that outperform traditional approaches, providing a foundation for more personalized and effective patient care. The results clearly demonstrate that RF consistently achieves high classification accuracy, with superior recall and precision compared to GB. The ensemble model integrates the strengths of both classifiers, reducing false negatives and ensuring stable performance across diverse datasets. This capacity to minimize classification errors is critical in clinical environments where misclassification can lead to ineffective treatment and prolonged patient distress. Additionally, the feature importance analysis identified quetiapine dosage, YBOCS, and HDRS as primary drivers of treatment response, validating the model’s ability to focus on clinically relevant variables. Future work will involve expanding the dataset with real-world clinical data from diverse populations to improve generalizability and account for broader patient variability. Further research will also explore deep learning architectures to capture more complex non-linear relationships within psychiatric datasets. Refining model hyperparameters and applying advanced techniques such as neural networks or transformer models could further enhance predictive accuracy and stability [48]-[51]. In conclusion, this study successfully achieves its goal of leveraging ML to optimize treatment strategies for BD patients with comorbid OCD. The predictive framework developed here lays the groundwork for data-driven psychiatric care, reducing trial-and-error approaches and facilitating more targeted interventions that align with individual patient profiles.

Conflicts of Interest

The authors declare no conflicts of interest.

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1] Fagiolini, A., Forgione, R., Maccari, M., Cuomo, A., Morana, B., Dell’Osso, M.C., et al. (2013) Prevalence, Chronicity, Burden and Borders of Bipolar Disorder. Journal of Affective Disorders, 148, 161-169.
https://doi.org/10.1016/j.jad.2013.02.001
[2] Palmisano, A., Pandit, S., Smeralda, C.L., Demchenko, I., Rossi, S., Battelli, L., et al. (2024) The Pathophysiological Underpinnings of Gamma-Band Alterations in Psychiatric Disorders. Life, 14, Article 578.
https://doi.org/10.3390/life14050578
[3] Naharcı, M.İ., Çelik, C., Helvalı, E.E., Bodur, Ş., Akçaer, E.T., Can, S.S., et al. (2018) Case Reports Presentations: 10th International Congress on Psychopharmacology & 6th International Symposium on Child and Adolescent Psychopharmacology. Psychiatry and Clinical Psychopharmacology, 28, 114-270.
https://doi.org/10.1080/24750573.2018.1467600
[4] Pedley, R., Bee, P., Wearden, A. and Berry, K. (2019) Illness Perceptions in People with Obsessive-Compulsive Disorder; a Qualitative Study. PLOS ONE, 14, e0213495.
https://doi.org/10.1371/journal.pone.0213495
[5] Fineberg, N.A., Dell’Osso, B., Albert, U., Maina, G., Geller, D., Carmi, L., et al. (2019) Early Intervention for Obsessive Compulsive Disorder: An Expert Consensus Statement. European Neuropsychopharmacology, 29, 549-565.
https://doi.org/10.1016/j.euroneuro.2019.02.002
[6] Markarian, Y., Larson, M.J., Aldea, M.A., Baldwin, S.A., Good, D., Berkeljon, A., et al. (2010) Multiple Pathways to Functional Impairment in Obsessive-Compulsive Disorder. Clinical Psychology Review, 30, 78-88.
https://doi.org/10.1016/j.cpr.2009.09.005
[7] Katz, T.C., Bui, T.H., Worhach, J., Bogut, G. and Tomczak, K.K. (2022) Tourettic OCD: Current Understanding and Treatment Challenges of a Unique Endophenotype. Frontiers in Psychiatry, 13, Article 929526.
https://doi.org/10.3389/fpsyt.2022.929526
[8] Pompili, M., Serafini, G., Del Casale, A., Rigucci, S., Innamorati, M., Girardi, P., et al. (2009) Improving Adherence in Mood Disorders: The Struggle against Relapse, Recurrence and Suicide Risk. Expert Review of Neurotherapeutics, 9, 985-1004.
https://doi.org/10.1586/ern.09.62
[9] Roy-Byrne, P. (2015) Treatment-Refractory Anxiety; Definition, Risk Factors, and Treatment Challenges. Dialogues in Clinical Neuroscience, 17, 191-206.
https://doi.org/10.31887/dcns.2015.17.2/proybyrne
[10] Thase, M.E. (1996) The Role of Axis II Comorbidity in the Management of Patients with Treatment-Resistant Depression. Psychiatric Clinics of North America, 19, 287-309.
https://doi.org/10.1016/s0193-953x(05)70289-6
[11] Jann, M.W. (2014) Diagnosis and Treatment of Bipolar Disorders in Adults: A Review of the Evidence on Pharmacologic Treatments. American Health & Drug Benefits, 7, 489-499.
[12] Baldessarini, R.J., Tondo, L. and Vázquez, G.H. (2018) Pharmacological Treatment of Adult Bipolar Disorder. Molecular Psychiatry, 24, 198-217.
https://doi.org/10.1038/s41380-018-0044-2
[13] Altamura, A.C., Lietti, L., Dobrea, C., Benatti, B., Arici, C. and Dell’Osso, B. (2011) Mood Stabilizers for Patients with Bipolar Disorder: The State of the Art. Expert Review of Neurotherapeutics, 11, 85-99.
https://doi.org/10.1586/ern.10.181
[14] Keeley, M.L., Storch, E.A., Merlo, L.J. and Geffken, G.R. (2008) Clinical Predictors of Response to Cognitive-Behavioral Therapy for Obsessive–Compulsive Disorder. Clinical Psychology Review, 28, 118-130.
https://doi.org/10.1016/j.cpr.2007.04.003
[15] Subramaniam, M., Soh, P., Vaingankar, J.A., Picco, L. and Chong, S.A. (2013) Quality of Life in Obsessive-Compulsive Disorder: Impact of the Disorder and of Treatment. CNS Drugs, 27, 367-383.
https://doi.org/10.1007/s40263-013-0056-z
[16] Rasool, S., Husnain, A., Saeed, A., Gill, A.Y. and Hussain, H.K. (2023) Harnessing Predictive Power: Exploring the Crucial Role of Machine Learning in Early Disease Detection. Jurihum: Jurnal Inovasi dan Humaniora, 1, 302-315.
[17] Filippis, R.D. and Foysal, A.A. (2024) Integrating Explainable Artificial Intelligence (XAI) in Forensic Psychiatry: Opportunities and Challenges. Open Access Library, 11, 1-20.
https://doi.org/10.4236/oalib.1112518
[18] de Filippis, R. and Al Foysal, A. (2024) The Fusion of Minds: Navigating the Confluence of AI, ML, and Psychology in the Digital Era. Journal of Mathematical Techniques and Computational Mathematics, 3, 1-9.
[19] Kalusivalingam, A.K., Sharma, A., Patel, N. and Singh, V. (2022) Leveraging Random Forests and Gradient Boosting for Enhanced Predictive Analytics in Operational Efficiency. International Journal of AI and ML, 3.
[20] Bakro, M., Kumar, R.R., Husain, M., Ashraf, Z., Ali, A., Yaqoob, S.I., et al. (2024) Building a Cloud-IDS by Hybrid Bio-Inspired Feature Selection Algorithms along with Random Forest Model. IEEE Access, 12, 8846-8874.
https://doi.org/10.1109/access.2024.3353055
[21] Farooq, F., Ahmed, W., Akbar, A., Aslam, F. and Alyousef, R. (2021) Predictive Modeling for Sustainable High-Performance Concrete from Industrial Wastes: A Comparison and Optimization of Models Using Ensemble Learners. Journal of Cleaner Production, 292, Article ID: 126032.
https://doi.org/10.1016/j.jclepro.2021.126032
[22] Filippis, R.D. and Foysal, A.A. (2024) Comparative Analysis of Gabaergics vs. Opioids in Chronic Pain Management. Open Access Library, 11, 1-25.
https://doi.org/10.4236/oalib.1112388
[23] Rodrigues, F. and Oliveira, T. (2021) A Data Mining Framework for Response Modelling in Direct Marketing. In: Abraham, A., Piuri, V., Gandhi, N., Siarry, P., Kaklauskas, A. and Madureira, A., Eds., Intelligent Systems Design and Applications, Springer, 357-366.
https://doi.org/10.1007/978-3-030-71187-0_33
[24] Zhang, L. and Suganthan, P.N. (2014) Random Forests with Ensemble of Feature Spaces. Pattern Recognition, 47, 3429-3437.
https://doi.org/10.1016/j.patcog.2014.04.001
[25] Nhu, V., Shirzadi, A., Shahabi, H., Chen, W., Clague, J.J., Geertsema, M., et al. (2020) Shallow Landslide Susceptibility Mapping by Random Forest Base Classifier and Its Ensembles in a Semi-Arid Region of Iran. Forests, 11, Article 421.
https://doi.org/10.3390/f11040421
[26] González, S., García, S., Del Ser, J., Rokach, L. and Herrera, F. (2020) A Practical Tutorial on Bagging and Boosting Based Ensembles for Machine Learning: Algorithms, Software Tools, Performance Study, Practical Perspectives and Opportunities. Information Fusion, 64, 205-237.
https://doi.org/10.1016/j.inffus.2020.07.007
[27] Bentéjac, C., Csörgő, A. and Martínez-Muñoz, G. (2020) A Comparative Analysis of Gradient Boosting Algorithms. Artificial Intelligence Review, 54, 1937-1967.
https://doi.org/10.1007/s10462-020-09896-5
[28] Naseriparsa, M., Al-Shammari, A., Sheng, M., Zhang, Y. and Zhou, R. (2020) RSMOTE: Improving Classification Performance over Imbalanced Medical Datasets. Health Information Science and Systems, 8, Article No. 22.
https://doi.org/10.1007/s13755-020-00112-w
[29] Rahman, M.M. and Davis, D.N. (2013) Addressing the Class Imbalance Problem in Medical Datasets. International Journal of Machine Learning and Computing, 3, 224-228.
https://doi.org/10.7763/ijmlc.2013.v3.307
[30] Soltanzadeh, P. and Hashemzadeh, M. (2021) RCSMOTE: Range-Controlled Synthetic Minority Over-Sampling Technique for Handling the Class Imbalance Problem. Information Sciences, 542, 92-111.
https://doi.org/10.1016/j.ins.2020.07.014
[31] Chekroud, A.M., Bondar, J., Delgadillo, J., Doherty, G., Wasil, A., Fokkema, M., et al. (2021) The Promise of Machine Learning in Predicting Treatment Outcomes in Psychiatry. World Psychiatry, 20, 154-170.
https://doi.org/10.1002/wps.20882
[32] Wu, M., Mwangi, B., Bauer, I.E., Passos, I.C., Sanches, M., Zunta-Soares, G.B., et al. (2017) Identification and Individualized Prediction of Clinical Phenotypes in Bipolar Disorders Using Neurocognitive Data, Neuroimaging Scans and Machine Learning. NeuroImage, 145, 254-264.
https://doi.org/10.1016/j.neuroimage.2016.02.016
[33] Saputra, D.C.E., Ma’arif, A. and Sunat, K. (2024) Optimizing Predictive Performance: Hyperparameter Tuning in Stacked Multi-Kernel Support Vector Machine Random Forest Models for Diabetes Identification. Journal of Robotics and Control (JRC), 4, 896-904.
https://doi.org/10.18196/jrc.v4i6.20898
[34] Grattan, N., Alencar da Costa, D. and Stanger, N. (2024) The Need for More Informative Defect Prediction: A Systematic Literature Review. Information and Software Technology, 171, Article ID: 107456.
https://doi.org/10.1016/j.infsof.2024.107456
[35] Antonelli, M., Conti, G., Esquinas, A., Montini, L., Maggiore, S.M., Bello, G., et al. (2007) A Multiple-Center Survey on the Use in Clinical Practice of Noninvasive Ventilation as a First-Line Intervention for Acute Respiratory Distress Syndrome. Critical Care Medicine, 35, 18-25.
https://doi.org/10.1097/01.ccm.0000251821.44259.f3
[36] Deitch, E.A. (1992) Multiple Organ Failure Pathophysiology and Potential Future Therapy. Annals of Surgery, 216, 117-134.
https://doi.org/10.1097/00000658-199208000-00002
[37] Zhang, Y., Liu, J. and Shen, W. (2022) A Review of Ensemble Learning Algorithms Used in Remote Sensing Applications. Applied Sciences, 12, Article 8654.
https://doi.org/10.3390/app12178654
[38] Ganaie, M.A., Hu, M., Malik, A.K., Tanveer, M. and Suganthan, P.N. (2022) Ensemble Deep Learning: A Review. Engineering Applications of Artificial Intelligence, 115, Article ID: 105151.
https://doi.org/10.1016/j.engappai.2022.105151
[39] Ribeiro, M.H.D.M. and dos Santos Coelho, L. (2020) Ensemble Approach Based on Bagging, Boosting and Stacking for Short-Term Prediction in Agribusiness Time Series. Applied Soft Computing, 86, Article ID: 105837.
https://doi.org/10.1016/j.asoc.2019.105837
[40] Rane, N., Choudhary, S.P. and Rane, J. (2024) Ensemble Deep Learning and Machine Learning: Applications, Opportunities, Challenges, and Future Directions. Studies in Medical and Health Sciences, 1, 18-41.
https://doi.org/10.48185/smhs.v1i2.1225
[41] Jayatilake, S.M.D.A.C. and Ganegoda, G.U. (2021) Involvement of Machine Learning Tools in Healthcare Decision Making. Journal of Healthcare Engineering, 2021, Article ID: 6679512.
https://doi.org/10.1155/2021/6679512
[42] Marques, L., Costa, B., Pereira, M., Silva, A., Santos, J., Saldanha, L., et al. (2024) Advancing Precision Medicine: A Review of Innovative in Silico Approaches for Drug Development, Clinical Pharmacology and Personalized Healthcare. Pharmaceutics, 16, Article 332.
https://doi.org/10.3390/pharmaceutics16030332
[43] Tantray, J., Patel, A., Wani, S.N., Kosey, S. and Prajapati, B.G. (2024) Prescription Precision: A Comprehensive Review of Intelligent Prescription Systems. Current Pharmaceutical Design, 30, 2671-2684.
https://doi.org/10.2174/0113816128321623240719104337
[44] Vallée, A. (2024) Envisioning the Future of Personalized Medicine: Role and Realities of Digital Twins. Journal of Medical Internet Research, 26, e50204.
https://doi.org/10.2196/50204
[45] Kalusivalingam, A.K., Sharma, A., Patel, N. and Singh, V. (2012) Enhancing Hospital Readmission Rate Predictions Using Random Forest and Gradient Boosting Algorithms. International Journal of AI and ML, 1.
[46] Chauhan, N.K. and Singh, K. (2018) A Review on Conventional Machine Learning vs Deep Learning. 2018 International Conference on Computing, Power and Communication Technologies (GUCON), Greater Noida, 28-29 September 2018, 347-352.
https://doi.org/10.1109/gucon.2018.8675097
[47] Okpete, U.E. and Byeon, H. (2024) Challenges and Prospects in Bridging Precision Medicine and Artificial Intelligence in Genomic Psychiatric Treatment. World Journal of Psychiatry, 14, 1148-1164.
https://doi.org/10.5498/wjp.v14.i8.1148
[48] Emad-Eldeen, A., Azim, M.A., Abdelsattar, M. and AbdelMoety, A. (2024) Utilizing Machine Learning and Deep Learning for Enhanced Supercapacitor Performance Prediction. Journal of Energy Storage, 100, Article ID: 113556.
https://doi.org/10.1016/j.est.2024.113556
[49] Cao, K., Zhang, T. and Huang, J. (2024) Advanced Hybrid LSTM-Transformer Architecture for Real-Time Multi-Task Prediction in Engineering Systems. Scientific Reports, 14, Article No. 4890.
https://doi.org/10.1038/s41598-024-55483-x
[50] Ahmed, S.F., Alam, M.S.B., Hassan, M., Rozbu, M.R., Ishtiak, T., Rafa, N., et al. (2023) Deep Learning Modelling Techniques: Current Progress, Applications, Advantages, and Challenges. Artificial Intelligence Review, 56, 13521-13617.
https://doi.org/10.1007/s10462-023-10466-8
[51] Abd Elaziz, M., Dahou, A., Abualigah, L., Yu, L., Alshinwan, M., Khasawneh, A.M., et al. (2021) Advanced Metaheuristic Optimization Techniques in Applications of Deep Neural Networks: A Review. Neural Computing and Applications, 33, 14079-14099.
https://doi.org/10.1007/s00521-021-05960-5

Copyright © 2025 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.