Cureus. 2025 Aug 12;17(8):e89873. doi: 10.7759/cureus.89873. eCollection 2025 Aug.
ABSTRACT
The incidence of suicide, self-harm, and mental crises among teenagers is rising, presenting significant global public health issues. Conventional clinical risk evaluations have inadequate predictive accuracy, often overlooking high-risk adolescents. Machine learning models employing electronic health records provide an innovative method for predicting mental health crises through the integration of intricate clinical data patterns. Hence, this review aims to comprehensively examine and synthesize information about machine learning models created using electronic health record data for predicting suicide attempts, self-harm, or mental hospitalization in teenagers. Adhering to Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) principles, we executed an exhaustive search across six databases (2000-2025) for peer-reviewed research utilizing machine learning algorithms on electronic health record data to predict adolescent mental health crises. The inclusion criteria emphasized structured or unstructured electronic health record inputs, teenage cohorts (ages 10-20), and performance indicators like area under the curve (AUC), sensitivity, or specificity. The Prediction Model Risk of Bias Assessment Tool (PROBAST) was utilized to evaluate the risk of bias. Our search yielded five studies (2019-2024) that satisfied the inclusion criteria. All studies were retrospective cohorts conducted in high-income nations. Structured electronic health record data (e.g., diagnoses, prescriptions) were frequently utilized; two studies included natural language processing. Machine learning models demonstrated moderate to high discrimination (AUC 0.68-0.88), exhibiting optimal performance in short-term suicide prediction with hybrid data inputs. All investigations, however, exhibited a significant risk of bias in the analysis domain owing to insufficient external validation and absent calibration data. The positive predictive values consistently remained modest across all models. Overall, machine learning models demonstrate potential for enhancing adolescent suicide risk classification utilizing electronic health record data, surpassing numerous traditional instruments. Nonetheless, issues of generalizability, ethical constraints, and implementation obstacles remain. Thorough validation, calibration, and equity assessments are necessary prior to incorporation into clinical practice.
PMID:40951013 | PMC:PMC12426581 | DOI:10.7759/cureus.89873