Design and Validation of an AI-Assisted Sequential Screening Framework for Psychological Distress in Glaucoma
=============================================================================================================

* Natalie A. Chou
* Youngsoo Baek
* Feilin Feng
* Kaiwen Lu
* Eun Young Choi
* Hannah M. Fisher
* Davina Malek
* Alessandro Jammal
* Tamara J. Somers
* Kelly W. Muir
* Felipe Medeiros
* Samuel I. Berchuck

## ABSTRACT

**Purpose** Psychological distress is highly prevalent in glaucoma and is associated with worse adherence, reduced quality of life, and faster disease progression. However, distress is rarely assessed in ophthalmology settings due to time, workflow, and staffing constraints. We evaluated two artificial intelligence (AI)-based screening strategies, designed to efficiently identify distressed primary open angle glaucoma (POAG) patients during routine care, aiming to achieve effective, resource conscious, low burden clinical screening.

**Design** Hybrid retrospective cohort and prospective cross-sectional study.

**Participants** The retrospective cohort included >3,000 POAG patients from the Duke Ophthalmic Registry. Prospective validation was conducted in a separate 300 POAG patient cohort who completed patient-reported distress screening.

**Methods** Using retrospective data, a neural network model was trained to predict an electronic health record (EHR)-derived computable phenotype of distress (“silver standard”). Prospective validation used the 8-item Patient Health Questionnaire (PHQ-8) as the “gold standard.” Three screening strategies were compared against PHQ-8: (1) universal PHQ-2 screening (two-item screener administered to all patients), (2) AI-only screening (fully automated EHR-based screener), and (3) sequential screening, (only patients flagged as high risk by AI screener completed the PHQ-2). Performance metrics included sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), accuracy, and screening burden.

**Main Outcome Measures** Sensitivity; specificity; PPV; NPV; accuracy; proportion of patients requiring secondary screening (screening burden).

**Results** Distress prevalence was 17% (PHQ-8 > 6). Universal PHQ-2 screening (> 0) achieved high sensitivity (0.96) but lower specificity (0.73) and PPV (0.41), while requiring screening of all patients. The AI-assisted sequential approach substantially reduced screening burden while maintaining strong diagnostic performance. By administering PHQ-2 to ∼25% of patients, sequential screening achieved sensitivity 0.64, specificity 0.93, PPV 0.64, NPV 0.93, and accuracy 0.88, representing a ∼50% increase in PPV compared to PHQ-2 alone. AI-only screening reduced burden further but did not achieve comparable sensitivity or predictive performance.

**Conclusions** AI-assisted sequential screening enables scalable, resource efficient identification of psychological distress in glaucoma care, substantially reducing screening burden while preserving clinically meaningful performance. This framework offers a practical pathway for integrating distress screening into routine ophthalmology workflows and improving the identification and referral of at-risk patients.

Key Words
*   Primary open-angle glaucoma
*   Psychological distress screening
*   Machine learning
*   Electronic health records
*   Clinical decision support

Primary open angle glaucoma (POAG) is a chronic, progressive optic neuropathy that affects over 65 million people aged 40-80 worldwide,1 and remains the leading cause of irreversible vision loss.2 As such, the uncertainty about future vision loss and the disease’s gradual progression contribute to considerable psychological burden in patients.3 While management primarily targets intraocular pressure (IOP) reduction to slow functional decline, factors such as emotional wellbeing, social support, and mental health, are increasingly recognized as critical in disease evaluation and care.4,5

The chronic course of POAG often leads to psychological distress including anxiety, depression, fear of blindness, and social isolation, all of which substantially reduce quality of life.6–8 Psychological distress prevalence in POAG patients is notably high, with up to 25% experiencing anxiety and 30% experiencing depression, rates more than 10% higher than the general population.6 Disease severity influences these outcomes, as worse functional disease severity8 and faster rates of progression9 correlate with greater depression. Importantly, psychological distress impacts disease management and prognosis. Patients reporting distress frequently show poor glaucoma comprehension,10 missed follow up visits,11 and suboptimal medication adherence,12 contributing to long term visual field loss. Psychological distress may additionally exert a direct physiologic effect, as stress can elevate IOP, the only modifiable risk factor in glaucoma.4 Experimental studies in primates13 and humans14 confirm that acute stress can increase IOP. Addressing psychological distress early in the treatment pathway could improve adherence, outcomes, and quality of life, but routine screening in glaucoma care is not common.

Early identification of psychological distress enables timely intervention and reduces adverse outcomes.15 However, conventional screening approaches are often limited by substantial time and cost demands.16 To address these challenges, several medical specialties have adopted more efficient strategies, including brief self-report questionnaires and two stage screening frameworks. In oncology and cardiology, for example, initial large scale pre-screening with short instruments, such as the National Comprehensive Cancer Network (NCCN) Distress Thermometer17 or the two item Patient Health Questionnaire18, is followed by more comprehensive assessment only in patients with positive results.19 This approach reduces resource burden while targeting high risk patients for further evaluation.20 In glaucoma care, however, routine psychological distress screening remains uncommon despite its potential to identify vulnerable patients and enable early intervention, representing a critical gap in supportive disease management. The most direct solution would be universal administration of validated patient reported outcome (PRO) measures. Our group has taken steps toward this goal, including recent work validating the Distress Thermometer in patients with POAG, supporting the feasibility and clinical relevance of routine distress screening in this population.21 Nevertheless, universal screening has not been widely implemented in ophthalmology practice due to persistent workflow, time, and resource constraints. As a result, there is a need for scalable approaches that approximate the benefits of universal screening while remaining feasible in routine care. In this context, we propose an AI-assisted screening framework designed to reduce screening burden while preserving performance comparable to universal screening.

Electronic health record (EHR) data have been widely used for predictive modeling across clinical domains,22–25 including work demonstrating accurate prediction of incident suicide risk.26 These advances highlight the potential of artificial intelligence (AI) to enable scalable, population level identification of patients at risk for adverse outcomes. In prior work, we developed machine learning models using EHR data from a large retrospective glaucoma cohort and demonstrated strong discrimination for a computable distress phenotype (area under the receiver operating characteristic [ROC] curve > 0.90).27 However, because patient reported distress measures were not available in that setting, outcomes were defined using an EHR-based proxy (hereafter referred to as a “silver standard”). While such approaches are useful for large scale model development, understanding clinical impact requires evaluation against validated PROs that directly measure psychological distress.

To address this gap, we conducted a prospective study with PRO measures of distress, enabling evaluation against a gold standard in a clinical setting. We evaluated whether AI-based screening strategies can approximate the performance of universal screening while reducing implementation burden. Specifically, we compared three approaches for identifying psychological distress: (1) universal screening using the two item Patient Health Questionnaire (PHQ-2), (2) fully automated AI-only screening based on EHR data, and (3) an AI-assisted sequential screening strategy in which the algorithm is used to pre-screen patients for targeted questionnaire administration. This sequential approach leverages AI to reduce screening burden by limiting manual questionnaire screening to a subgroup of high-risk patients, rather than relying on algorithmic predictions alone. This design enables direct evaluation of the tradeoff between screening burden and diagnostic performance across approaches. Together, these analyses provide a practical framework for implementing a scalable, clinically actionable distress screening in glaucoma care.

## METHODS

### Study Design

This study used data from both a large retrospective EHR cohort and a prospective cohort with PROs. The retrospective cohort was used to develop machine learning models using an EHR-derived phenotype of distress (silver standard), while the prospective cohort was used to evaluate model performance against PRO measures of distress (gold standard). Retrospective data were obtained from the Duke Ophthalmic Registry, which includes patients evaluated at the Duke Eye Center and affiliated satellite clinics between 2012 and 2021. Eligible patients were adults aged ≥18 years with POAG. Prospective data were obtained from a study examining patient attitudes toward psychological distress screening in glaucoma clinics.21,28 This study was approved by the Duke University Institutional Review Board (Pro00108155). The retrospective component was conducted under a waiver of informed consent due to the use of existing clinical data, and all participants in the prospective study provided informed consent. All procedures adhered to the tenets of the Declaration of Helsinki and complied with the Health Insurance Portability and Accountability Act.

**Figure 1** provides an overview of the study design, including development of the AI algorithm in the retrospective cohort and external validation in the prospective cohort. **Figure 2** summarizes the three screening strategies evaluated: (1) universal screening, (2) AI-only screening, and (3) AI-assisted sequential screening.

![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2026/05/22/2026.05.20.26353679/F1.medium.gif)

[Figure 1.](http://medrxiv.org/content/early/2026/05/22/2026.05.20.26353679/F1)

Figure 1. 
Schematic of Data Collection and Machine Learning Workflow. In the first phase, an AI model was trained on electronic health record (EHR) data from a retrospective cohort (Duke Ophthalmic Registry) to predict a silver standard outcome derived from EHR data. Model tuning and internal validation were performed within the retrospective dataset. In the second phase, the trained model was externally validated in a prospective cohort, based on its predictions for both silver standard outcomes and gold standard patient-reported outcomes (PROs).

![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2026/05/22/2026.05.20.26353679/F2.medium.gif)

[Figure 2.](http://medrxiv.org/content/early/2026/05/22/2026.05.20.26353679/F2)

Figure 2. 
Comparison of Screening Strategies. Three approaches to identifying psychological distress are shown: (1) *Universal* screening, in which a patient-reported outcome (PRO) is administered to all patients; (2) *AI-only* screening, which relies exclusively on the trained AI model (Figure 1) to classify patients; and (3) *Sequential* screening, in which the AI model first identifies a subset of high-risk patients who then undergo targeted screening with a brief PRO. In Stage 1 (algorithm training), yellow and purple figures represent patients with positive and negative silver standard distress phenotypes. In Stage 2 (manual screening), green and red figures represent patients with and without distress based on the gold standard. Blue shading covers patients who receive PROs under each strategy. Green dashed line covers patients ultimately classified as having distress for referral.

### Retrospective Cohort and Model Development

The retrospective cohort was derived from the Duke Ophthalmic Registry and included adults (aged ≥18 years) with POAG who were evaluated at the Duke Eye Center and affiliated satellite clinics between 2012 and 2021. Patients were required to have at least two encounters and a minimum of one year of follow up. To align with the prospective validation cohort, analyses were restricted to encounters in the glaucoma clinic. POAG was defined using International Classification of Diseases diagnostic codes (H40.1), and the unit of analysis was the clinical encounter.

Psychological distress was defined using an encounter-level EHR-derived computable phenotype based on a validated Phenotype KnowledgeBase (PheKB) depression algorithm, as described in prior work.27 Because this definition relies on proxy clinical data rather than direct patient report, it is referred to as a silver standard measure of distress. Predictors were constructed from structured EHR data available prior to each encounter and included demographics, healthcare utilization (diagnoses, procedures, medications, and encounters), and problem-list information, resulting in a high-dimensional feature set for model development (1,844 variables). Socioeconomic variables, including income and education, were derived from U.S. Census data and linked at the Zip Code level. Income level was measured by per capita income in the past 12 months and was race specific. Education was measured by the percentage of residents who achieved a high school education and was sex specific. Insurance status (commercial, Medicaid, Medicare, uninsured, other) was also included.

Two predictive models were trained to estimate encounter-level probabilities of psychological distress: an elastic net classifier and a feedforward neural network. The elastic net model was fit using patient-level 10-fold nested cross-validation to select optimal regularization parameters and prevent overfitting. The neural network consisted of a four-layer architecture with ReLU activation functions and was trained using the Adam optimizer,29 with early stopping to improve generalization.30 To enhance stability, predictions were averaged across multiple random initializations. Model performance was evaluated using the area under the receiver operating characteristic (ROC) curve and precision-recall (PR) curve in held-out data constructed via patient-level splitting. These models generated the predicted probabilities used for AI-only screening and as the first stage of the sequential screening strategy.

### Prospective Cohort and Outcome Assessment

Patients with POAG were recruited from the Duke Eye Center glaucoma clinics between September 2022 and August 2023. Study procedures have been described previously and are briefly summarized here.21,28 Eligibility required age ≥18 years, a confirmed POAG diagnosis, at least one prior glaucoma clinic visit, and a minimum of one year of follow up. Exclusion criteria included recent glaucoma or cataract surgery within 6 months, reported or suspected serious mental illness (e.g., psychosis), or legal blindness, as determined by provider or chart review. Eligible patients were contacted by email prior to scheduled clinic visits, and consenting participants completed a set of questionnaires.

Demographic and clinical characteristics were collected to describe the cohort. Patient reported measures included age, sex, race, ethnicity, marital status, education, income, and employment status. Clinical data were obtained from the Duke Ophthalmic Registry and supplemented by manual chart review. Eye specific measures included intraocular pressure (IOP) and standard automated perimetry mean deviation (SAP MD), reported for the better eye.

Psychological distress was assessed using the Patient Health Questionnaire-8 (PHQ-8), which served as the gold standard measure of distress.18 The PHQ-8 includes all items from the PHQ-9 except for the question assessing suicidal or self-injurious thoughts; this item was excluded to avoid triggering risk assessments that could not be reliably and sufficiently managed in a remote research setting. Prior validation studies have demonstrated that the removal of this item has minimal impact on total scores and does not alter established cutoffs for depression severity.31,32 The PHQ-8 evaluates the frequency of depressive symptoms over the prior two weeks, with total scores ranging from 0 to 24. Consistent with prior work, clinically relevant psychological distress was defined as a PHQ-8 score > 6.33,34 The PHQ-8 was used for evaluation purposes and was not part of the screening strategies assessed in this study.

### Screening Strategies

We evaluated three approaches for identifying psychological distress: (1) universal questionnaire-based screening, (2) AI-only screening, and (3) an AI-assisted sequential screening strategy. Universal screening was conducted using the two item Patient Health Questionnaire (PHQ-2), a brief validated screener for depressive symptoms. In accordance with American Heart Association (AHA) recommendations for depression screening,18 any positive response (score > 0) was considered indicative of a positive screen. In the AI-only approach, our EHR-based algorithm was used to assign each patient a predicted probability of psychological distress based on routinely collected clinical data. Patients with predicted probabilities above a specified threshold were classified as distressed (more details in the following Statistical Evaluation section). This approach is fully automated and does not require administration of patient questionnaires, relying solely on EHR-derived predictions.

Finally, the AI-assisted sequential strategy combines algorithmic risk stratification with targeted questionnaire-based assessment. In this approach, the AI algorithm is first used to identify patients at elevated risk for psychological distress. Only these high-risk patients then undergo questionnaire-based screening using the PHQ-2, and patients with a positive PHQ-2 result are classified as distressed. This design follows established two stage screening frameworks used in other clinical domains, such as oncology and cardiology,35,36 while reducing the burden of universal screening by limiting questionnaire administration to a subset of high-risk patients.

### Statistical Evaluation

Summary statistics for cohort characteristics are presented using mean and standard deviation (SD) for continuous variables and counts and percentages for categorical variables. Hypothesis tests are presented, with categorical variables tested using a Chi-square test, and continuous variables tested using a Wilcoxon rank sum test. Model discrimination was evaluated using ROC and precision-recall (PR) curves in both the training (retrospective) and test (prospective) datasets. In the training data, performance was assessed against the EHR-derived silver standard. In the test data, performance was evaluated against both the silver standard and the patient reported gold standard. Areas under the curve were computed for both ROC and PR curves.

Screening strategies were evaluated using the prospective cohort with PROs as the gold standard (PHQ-8 > 6). Diagnostic performance was assessed using sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and overall accuracy. We considered two complementary evaluation approaches reflecting distinct clinical objectives. First, to assess whether AI-only screening could serve as a substitute for universal questionnaire-based screening, thresholds for the AI model were selected to match the specificity of PHQ-2 screening (score > 0). This enabled a direct comparison of sensitivity, predictive values, and accuracy under comparable operating conditions. Results are presented for both elastic net and neural network models. Second, for the AI-assisted sequential strategy, we aimed not to replicate the operating characteristics of universal screening, but to evaluate the tradeoff between screening burden and diagnostic performance. In this setting, thresholds for the AI model were selected using Youden’s index37 to balance sensitivity and specificity. The PHQ-2 threshold indicating distress was fixed at any score > 0. Screening burden was defined as the proportion of patients requiring questionnaire-based screening, which varies with the chosen AI threshold; operating characteristics were evaluated as a function of this proportion to characterize the tradeoff between burden and performance. Operating characteristics for the sequential strategy were further examined across demographic subgroups, including age, sex, and race.

Variable importance was computed for both models. For the elastic net model, the largest 25 coefficients in absolute value were presented along with their odds ratios (OR). OR p-values were not presented, since they cannot be computed reliably for the elastic net model.38 The non-zero demographic predictors were also presented. For a measure of variable importance in neural network, Shapley values39 were computed on a random subset of 1,000 encounters from the EHR data. Variables with the top 25 absolute Shapley values for positive prediction were presented, along with a list of demographic variables with the top 10 largest values.

All analyses were conducted using R (Version 4.0.5, R Core Team, Vienna, Austria) and Python (Version 3.13, Python Software Foundation) within the Protected Analytics Computing Environment (PACE) at Duke University. The glmnet and PyTorch packages were used for model development.40–42

## RESULTS

The retrospective cohort consisted of 31,043 encounters from 3,149 patients with a mean (SD) of 9.9 (7.3) encounters per patient over 2.6 (2.2) years of follow up. The mean age of the patients at first encounter was 68.6 (13.2) years with a breakdown of 1,790 (57%) females, 1,651 (52%) White, 1,337 (42%) Black/African American, and the rest Asian, multiracial, and other races. There were 1,240 (39%) patients with at least one encounter with a positive silver standard distress phenotype. The prospective cohort consisted of 300 patients with an average age of 68.4 (12.5) years, with a breakdown of 155 (52%) females, 242 (81%) White, and 46 (15%) Black/African American. Full summary details for the retrospective and prospective cohorts can be found in **Table 1** and **Table 2**, respectively.

View this table:
[Table 1.](http://medrxiv.org/content/early/2026/05/22/2026.05.20.26353679/T1)

Table 1. 
Patient-level summary statistics for the training set data derived from a retrospective cohort. Summaries are presented across the silver standard distress status (at least one encounter with a positive EHR-derived distress phenotype). P-values represent hypothesis tests across distress. Continuous variables were tested using Wilcoxon rank sum test and categorical variables with Chi-square exact test.

View this table:
[Table 2.](http://medrxiv.org/content/early/2026/05/22/2026.05.20.26353679/T2)

Table 2. 
Summary statistics for the test set data derived from a prospective cohort. Summaries are presented across gold standard distress status (PHQ-8 > 6). P-values represent hypothesis tests across distress. Continuous variables were tested using Wilcoxon rank sum test and categorical variables with Chi-square test.

**Figure 3** presents ROC and PR curves for both machine learning models evaluated in the training (retrospective) and test (prospective) data, against both silver and gold standard outcomes. Both models demonstrated strong discrimination when predicting the silver standard, with ROC AUCs of 0.93 (neural network) and 0.86 (elastic net) in the training data, and 0.83 and 0.86, respectively, in the test data. PR AUCs were 0.89 and 0.79 in training, and 0.79 for both models in the test data. When evaluated against the gold standard in the test data, discrimination was lower, reflecting the increased difficulty of predicting PROs. However, the neural network retained meaningful predictive signal, with ROC AUC of 0.76 and PR AUC of 0.39, and outperformed the elastic net (ROC AUC 0.64; PR AUC 0.30). For comparison, universal PHQ-2 screening achieved higher discrimination (ROC AUC 0.94; PR AUC 0.80). Given the superior performance of the neural network for the gold standard outcome, subsequent analyses focus on the neural network model, with elastic net results reported for completeness.

![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2026/05/22/2026.05.20.26353679/F3.medium.gif)

[Figure 3.](http://medrxiv.org/content/early/2026/05/22/2026.05.20.26353679/F3)

Figure 3. 
Receiver operating characteristic (ROC; top row) and precision-recall (PR; bottom row) curves for distress prediction models in training and test data. The left and middle columns show performance for predicting the silver standard distress phenotype in the training and test sets, respectively, comparing neural network (NN) and elastic net (EN) models. The right column shows performance for predicting gold-standard distress (PHQ-8 > 6) in the test set, comparing NN, EN, and PHQ-2 screening. Area under the curve (AUC) values are reported for each model and outcome.

**Table 3** compares the diagnostic performance of the three screening approaches. Universal screening using PHQ-2 achieved high sensitivity (0.96) and NPV (0.99), with specificity of 0.73, PPV of 0.41, and accuracy of 0.77, but required questionnaire administration for all patients. In contrast, AI-only screening eliminated the need for questionnaire-based screening but did not achieve comparable diagnostic performance. The neural network-based screening, with the threshold fixed to achieve the same specificity of 0.73 (anyone with a predicted probability of distress ≥ 21%), yielded sensitivity of 0.68, PPV of 0.33, NPV of 0.92, and accuracy of 0.72. The sequential screening strategy, using the Youden’s index-based threshold (anyone with a predicted probability of distress ≥ 26%) provided a balance between these approaches. When using the neural network, sequential screening achieved the highest specificity (0.93), PPV (0.64), and accuracy (0.88), while maintaining high NPV (0.93), with only a modest reduction in sensitivity (0.64). This improvement was achieved while limiting questionnaire-based screening to 28% of patients, compared to 100% under universal screening. **Table 4** presents the diagnostic characteristics across subgroups of age, sex and race for the sequential screening method relying on the neural network. The same breakdown is given for the elastic net model in **Supplementary Table S1**. These findings highlight a tradeoff between sensitivity and operational feasibility, with sequential screening achieving improved precision while substantially reducing screening burden.

View this table:
[Supplementary Table S1.](http://medrxiv.org/content/early/2026/05/22/2026.05.20.26353679/T5)

Supplementary Table S1. 
Operating characteristics of the elastic net sequential screening strategy by demographic subgroup. Counts, number of patients with distress, and distress prevalence are also reported for each subgroup.

View this table:
[Table 3.](http://medrxiv.org/content/early/2026/05/22/2026.05.20.26353679/T3)

Table 3. 
Operating characteristics of manual, AI-only, and sequential screening strategies for identifying elevated distress (PHQ-8 > 6). Manual screening was defined as PHQ-2 > 0. AI-only strategies (neural network / elastic net) were evaluated under two different thresholding methods, applied to each predicted probability of distress: (1) thresholds selected to match the specificity of manual screening (specificity = 0.73) to enable direct comparison, and (2) thresholds selected using Youden’s index to optimize discrimination. Sequential screening used the AI model (threshold selected by Youden’s index) as a first-stage triage tool, with PHQ-2 administered only to patients identified as high risk. Proportion requiring screening denotes the fraction of participants receiving PHQ-2.

View this table:
[Table 4.](http://medrxiv.org/content/early/2026/05/22/2026.05.20.26353679/T4)

Table 4. 
Operating characteristics of the neural network sequential screening strategy by demographic subgroup. Counts, number of patients with distress, and distress prevalence are also reported for each subgroup.

**Figure 4** illustrates the tradeoff between screening burden and diagnostic performance for the neural network sequential strategy. The vertical dashed line indicates the operating point selected using Youden’s index, which corresponds to the results in **Table 3**. As the operating threshold varies, the proportion of patients requiring questionnaire-based screening changes, with corresponding changes in sensitivity, specificity, and predictive values. These curves demonstrate that the operating point can be adjusted to achieve different balances between screening burden and diagnostic performance. **Supplementary Figure S1** shows the same curves for elastic net.

![Supplementary Figure S1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2026/05/22/2026.05.20.26353679/F5.medium.gif)

[Supplementary Figure S1.](http://medrxiv.org/content/early/2026/05/22/2026.05.20.26353679/F5)

Supplementary Figure S1. 
Operating characteristics of the elastic net sequential screening strategy as a function of the proportion requiring manual screening. The vertical dashed line indicates the threshold corresponding to Youden’s index, which was used to compute the metrics in **Tables 3** and **S1**.

![Figure 4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2026/05/22/2026.05.20.26353679/F4.medium.gif)

[Figure 4.](http://medrxiv.org/content/early/2026/05/22/2026.05.20.26353679/F4)

Figure 4. 
Operating characteristics of the neural network sequential screening strategy as a function of the proportion requiring manual screening. The vertical dashed line indicates the threshold corresponding to Youden’s index, which was used to compute the metrics in **Tables 3** and **4**.

Variable importance measures for each model are reported in the **Supplementary Tables S2-S4. Table S2** presents a list of variables with the 25 highest absolute Shapley values for the neural network predicting positive psychological distress; **Supplementary Table S3** contains the top 10 demographic predictors for the same model. Finally, **Supplementary Table S4** shows the top 25 predictors from the elastic net model ordered by the largest coefficients in absolute value.

View this table:
[Supplementary Table S2.](http://medrxiv.org/content/early/2026/05/22/2026.05.20.26353679/T6)

Supplementary Table S2. 
Predictors with the largest absolute Shapley values for positive prediction. All utilization variables presented used a 5-year window prior to the encounter. Also presented are the number of encounters with each predictor present, for both encounters with and without distress, along with corresponding proportions.

View this table:
[Supplementary Table S3.](http://medrxiv.org/content/early/2026/05/22/2026.05.20.26353679/T7)

Supplementary Table S3. 
Demographic predictors with the largest absolute Shapley values for positive prediction.

View this table:
[Supplementary Table S4.](http://medrxiv.org/content/early/2026/05/22/2026.05.20.26353679/T8)

Supplementary Table S4. 
Odds ratios (ORs) for the top 25 predictors of distress for elastic net, ordered by coefficient absolute values. In parenthases is the time window prior the encounter that the variable corresponds to, one year (1y) or five years (5y).

## DISCUSSION

This study developed and validated AI-based screening strategies for identifying psychological distress in glaucoma care, including both a fully automated AI-only approach and an AI-assisted sequential screening framework. Our findings demonstrate that while AI-only screening does not achieve performance comparable to universal PHQ-2 administration, it captures meaningful predictive signal that can be leveraged to improve efficiency when combined with targeted PRO screening. The sequential approach improved specificity, PPV, and overall accuracy, while reducing the need for questionnaire-based screening by approximately fourfold, supporting a practical balance between diagnostic performance and real world feasibility.

A central challenge of implementing psychological distress screening in ophthalmology is balancing its diagnostic performance with the practical constraints of clinical workflows. Universal PHQ-2 screening maximizes sensitivity but requires screening all patients, which may be difficult to sustain in busy clinics with limited staff and competing priorities. Conversely, fully automated approaches reduce burden, but rely solely on EHR-derived information and may not capture patient reported distress with sufficient accuracy. These constraints motivate approaches that can selectively target screening efforts while preserving meaningful clinical performance.

Consistent with this framing, the AI-only approach demonstrated uniformly lower performance than universal PHQ-2 screening across sensitivity, PPV, NPV, and accuracy when evaluated at matched specificity. In contrast, the sequential framework improved PPV and overall accuracy while substantially reducing screening burden. Although universal screening achieved the highest sensitivity, it required manual questionnaire administration for all patients. AI-only screening eliminated this burden, but at the cost of reduced sensitivity and predictive performance. The neural network-based sequential approach, borrowing strengths of both strategies, achieved the highest specificity and accuracy, and markedly increased PPV to 0.64, compared to 0.41 for PHQ-2 alone (approximately a 50% relative increase). Subgroup analyses (**Table 4**) further show that PPV remained generally high across demographic groups, indicating that the improved precision of the sequential approach was consistent across populations.

A key advantage of the sequential screening framework is its higher PPV and specificity, ensuring that patients flagged by the method are more likely to benefit from downstream psychological resources. While both universal and sequential approaches aim at identifying subsets of patients for referral, the sequential framework reduces the number of patients who require initial questionnaire-based screening, focusing clinical attention earlier in the workflow on a smaller, higher-risk group. The sequential approach therefore yields a more targeted cohort for referral (e.g., to a social worker), optimizing the limited availability of mental health service providers and increasing the likelihood of meaningful intervention. Efficient allocation is particularly critical in under resourced settings, as it promotes equity by prioritizing those with greater need, reduces stigma associated with false positives, and strengthens provider confidence in referrals. Moreover, this framework can be readily adapted to other chronic diseases characterized by similar workflow and psychological care challenges43 and aligns with value based care principles, emphasizing efficient, personalized screening that balances diagnostic precision with real world feasibility. Finally, any AI model that outputs predicted probabilities of psychological distress can be seamlessly integrated into this framework, enabling further improvements in performance as models evolve.

Reduced sensitivity in the sequential approach reflects an intentional and pragmatic tradeoff to improve feasibility. As illustrated in **Figure 4**, thresholds that maximize sensitivity come at the cost of substantially lower specificity and PPV, whereas modestly higher thresholds yield meaningful gains in predictive value and screening efficiency. In this context, higher PPV is particularly valuable, as patients identified by the sequential method are more likely to have true psychological distress, increasing clinical confidence and the likelihood of effective follow up. Furthermore, higher specificity resulting from algorithmic preselection implies that most patients not at risk avoid unnecessary and potentially distressing assessment, preserving scarce mental health resources.

These tradeoffs are especially relevant in ophthalmology, where universal PRO-based screening is often impractical due to workflow constraints, limited staff capacity, and the absence of standardized referral pathways.44 In such settings, high sensitivity approaches, while ideal in theory, may become “assessment without action,” where positive screens do not reliably lead to follow up care.45 The sequential framework instead aligns with real world clinical practice by focusing screening and downstream care on a smaller subset of high-risk patients, increasing the likelihood that identified patients receive meaningful intervention. Evidence from glaucoma patients indicates strong interest in both distress screening and referral for psychological support, underscoring the importance of designing workflows that translate screening into actionable care.28

Identifying psychological distress is only the first step; effective implementation requires clear pathways for follow up. In academic and large health systems, referral to embedded social workers or integrated behavioral health providers represents the ideal pathway. However, in many ophthalmology practices, particularly private or resource limited settings, alternative approaches are needed. Physicians may recommend peer support resources or condition specific support groups for vision loss, such as those offered through the Glaucoma Research Foundation or local vision rehabilitation programs. Coordination with the patient’s primary care provider provides another accessible pathway, enabling further evaluation and referral within an established care relationship. Brief educational materials addressing distress, coping strategies, and available community resources can also be provided at the point of care. As noted in prior work, integrating mental health screening and referral into ophthalmology, even in resource limited settings, can meaningfully improve patient wellbeing and engagement in care.46 Future implementation studies should evaluate the effectiveness of these pathways across diverse practice settings.

Several limitations warrant consideration. First, the sequential approach reduces sensitivity, and some patients with psychological distress may be missed under resource constrained screening, highlighting the need for careful calibration of thresholds as clinical workflows evolve. Second, the prospective validation cohort was predominantly White, which may limit generalizability to more diverse populations, so subgroup differences in performance warrant further study in larger and more representative cohorts. Third, although external validation was conducted using prospective data, model development and validation were performed within a single health system, and evaluation in multisite settings is needed to establish broader generalizability. Finally, reliance on an EHR-derived phenotype in the training phase may not fully capture patient reported distress and may introduce misclassification, underscoring the importance of incorporating additional gold standard PRO data in future work. Continued evaluation in prospective implementation studies will be important to assess workflow integration, patient and provider acceptability, and the impact of screening on downstream clinical care.

In conclusion, this study presents a practical, resource aware framework for screening psychological distress in glaucoma care using EHR-derived AI. While AI-only screening does not replace universal PRO-based screening, an AI-assisted sequential screening strategy provides a meaningful balance between diagnostic performance and feasibility. By substantially reducing burden while improving PPV and specificity, this approach offers a realistic and scalable pathway for integrating psychological risk assessment into routine ophthalmology care and may be adaptable to other chronic diseases with similar workflow and behavioral health challenges.

## Data Availability

All consented data produced in the present study are available upon reasonable request to the authors. Unconsented EHR data is cannot be made available, since it is not ours to share.

## Financial Support

Research reported in this publication was supported by the National Eye Institute of the National Institutes of Health (Bethesda, Maryland) under Awards Number R00EY033027 (SIB), R01EY029885 (FAM), and R01EY036593 (FAM). The sponsor or funding organization had no role in the design or conduct of this research. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

## Conflicts of Interest

NC: None; YB: None; FF: None; KL: None; EYC: None; HMF: None; DM: None; AAJ: None; TS: None; KM: None; FAM: AbbVie (C), Annexon (C); Carl Zeiss Meditec (C), Enavate Sciences (C), Galimedix (C); Google Inc. (F); Heidelberg Engineering (F), nGoggle Inc. (P), Novartis (F); ONL Therapeutics (C), Perfuse Therapeutics (C), Perceive Bio (C), Stealth Biotherapeutics (C); Stuart Therapeutics (C), Thea Pharmaceuticals (C), Reichert (C, F).; SIB: None.

## Funder Information Declared

National Eye Institute, https://ror.org/03wkg3b53, R00EY033027, R01EY029885, R01EY036593

*   Received May 20, 2026.
*   Revision received May 20, 2026.
*   Accepted May 22, 2026.


*   © 2026, Posted by openRxiv

This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/)

## REFERENCES

1.  1.Tham Y-C, Li X, Wong TY, Quigley HA, Aung T, Cheng C-Y. Global Prevalence of Glaucoma and Projections of Glaucoma Burden through 2040. Ophthalmology. 2014;121(11):2081–2090. doi:10.1016/j.ophtha.2014.05.013
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ophtha.2014.05.013&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24974815&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000344480400011&link_type=ISI) 

2.  2.Weinreb RN, Aung T, Medeiros FA. The pathophysiology and treatment of glaucoma: a review. JAMA. 2014;311(18):1901–1911.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.2014.3192&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24825645&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000335798100022&link_type=ISI) 

3.  3.Nelson P, Aspinall P, Papasouliotis O, Worton B, O’Brien C. Quality of life in glaucoma and its relationship with visual function. Journal of Glaucoma. 2003;12(2):139–150.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/00061198-200304000-00009&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12671469&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000182165500009&link_type=ISI) 

4.  4.Kaluza G, Maurer H. Stress and intraocular pressure in open angle glaucoma. Psychology & Health. 1997/10// 1997;12(5):667–675. doi:10.1080/08870449708407413
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1080/08870449708407413&link_type=DOI) 

5.  5.Berchuck S, Jammal A, Mukherjee S, Somers T, Medeiros FA. Impact of anxiety and depression on progression to glaucoma among glaucoma suspects. British Journal of Ophthalmology. 2021;105(9):1244–1249. doi:10.1136/bjophthalmol-2020-316617
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTI6ImJqb3BodGhhbG1vbCI7czo1OiJyZXNpZCI7czoxMDoiMTA1LzkvMTI0NCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI2LzA1LzIyLzIwMjYuMDUuMjAuMjYzNTM2NzkuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

6.  6.Zhang X, Olson DJ, Le P, Lin F-C, Fleischman D, Davis RM. The association between glaucoma, anxiety, and depression in a large population. American Journal of Ophthalmology. 2017;183:37–41.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajo.2017.07.021&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28760639&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 

7.  7.Abe RY, Diniz-Filho A, Costa VP, Gracitelli CPB, Baig S, Medeiros FA. The Impact of Location of Progressive Visual Field Loss on Longitudinal Changes in Quality of Life of Patients with Glaucoma. Ophthalmology. 2016/3// 2016;123(3):552–557. doi:10.1016/j.ophtha.2015.10.046
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ophtha.2015.10.046&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26704883&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 

8.  8.Skalicky S, Goldberg I. Depression and quality of life in patients with glaucoma: a cross-sectional analysis using the Geriatric Depression Scale-15, assessment of function related to vision, and the Glaucoma Quality of Life-15. Journal of Glaucoma. 2008;17(7):546–551.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/IJG.0b013e318163bdd1&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18854731&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000260317800007&link_type=ISI) 

9.  9.Diniz-Filho A, Abe RY, Cho HJ, Baig S, Gracitelli CPB, Medeiros FA. Fast visual field progression is associated with depressive symptoms in patients with glaucoma. Ophthalmology. 2016;123(4):754–759.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26920097&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 

10. 10.Kong XM, Zhu WQ, Hong JX, Sun XH. Is glaucoma comprehension associated with psychological disturbance and vision-related quality of life for patients with glaucoma? A cross-sectional study. BMJ Open. 2014/5// 2014;4(5):e004632-e004632. doi:10.1136/bmjopen-2013-004632
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYm1qb3BlbiI7czo1OiJyZXNpZCI7czoxMToiNC81L2UwMDQ2MzIiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyNi8wNS8yMi8yMDI2LjA1LjIwLjI2MzUzNjc5LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

11. 11.Lee B, Fudemberg S, Hark L, et al. Factors contributing to nonadherence to follow-up appointments in a resident glaucoma clinic versus primary eye care clinic. Patient Preference and Adherence. 2016/1// 2016:19–19. doi:10.2147/PPA.S89336
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2147/PPA.S89336&link_type=DOI) 

12. 12.Salman M, Andrews C, Heister M, Darnley-Fisch D, Newman-Casey PA. Psychosocial Predictors of Glaucoma Medication Adherence among the Support, Educate, Empower (SEE) Personalized Glaucoma Coaching Pilot Study Participants. American Journal of Ophthalmology. 2020/2// 2020;doi:10.1016/j.ajo.2020.02.009
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajo.2020.02.009&link_type=DOI) 

13. 13.Turner DC, Miranda M, Morris JS, Girkin CA, Downs JC. Acute Stress Increases Intraocular Pressure in Nonhuman Primates. Ophthalmology Glaucoma. 2019/7// 2019;2(4):210–214. doi:10.1016/j.ogla.2019.03.010
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ogla.2019.03.010&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31799505&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 

14. 14.Abe RY, Silva TC, Dantas I, et al. Can Psychologic Stress Elevate Intraocular Pressure in Healthy Individuals? Ophthalmology Glaucoma. 2020/7// 2020;doi:10.1016/j.ogla.2020.06.011
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ogla.2020.06.011&link_type=DOI) 

15. 15.Cunningham SC, Aizvera J, Wakim P, Felber L. Use of a self-reported psychosocial distress screening tool as a predictor of need for psychosocial intervention in a general medical setting. Social Work in Health Care. 2018/5// 2018;57(5):315–331. doi:10.1080/00981389.2018.1437499
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1080/00981389.2018.1437499&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29461938&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 

16. 16.Mitchell AJ, Kaar S, Coggan C, Herdman J. Acceptability of common screening methods used to detect distress and related mood disorders—preferences of cancer specialists and non-specialists. Psycho-Oncology. 2008/3// 2008;17(3):226–236. doi:10.1002/pon.1228
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/pon.1228&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17575565&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 

17. 17.Donovan KA, Jacobsen PB. Progress in the Implementation of NCCN Guidelines for Distress Management by Member Institutions. Journal of the National Comprehensive Cancer Network. 2013/2// 2013;11(2):223–226. doi:10.6004/jnccn.2013.0029
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NToiam5jY24iO3M6NToicmVzaWQiO3M6ODoiMTEvMi8yMjMiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyNi8wNS8yMi8yMDI2LjA1LjIwLjI2MzUzNjc5LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

18. 18.Lichtman JH, Bigger JT, Blumenthal JA, et al. Depression and Coronary Heart Disease. Circulation. 2008/10// 2008;118(17):1768-1775. doi:10.1161/CIRCULATIONAHA.108.190769
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjExOiIxMTgvMTcvMTc2OCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI2LzA1LzIyLzIwMjYuMDUuMjAuMjYzNTM2NzkuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

19. 19.Cull A, Gould A, House A, et al. Validating automated screening for psychological distress by means of computer touchscreens for use in routine oncology practice. British Journal of Cancer. 2001/12// 2001;85(12):1842–1849. doi:10.1054/bjoc.2001.2182
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1054/bjoc.2001.2182&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11747324&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000173373000005&link_type=ISI) 

20. 20.Fann JR, Berry DL, Wolpin S, et al. Depression screening using the Patient Health Questionnaire-9 administered on a touch screen computer. Psycho-Oncology. 2009/1// 2009;18(1):14–22. doi:10.1002/pon.1368
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/pon.1368&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18457335&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 

21. 21.Chou NA, Choi EY, Fisher HM, et al. The Validity of the Distress Thermometer in Patients with Primary Open-Angle Glaucoma. Ophthalmology Glaucoma. 2026;doi:10.1016/j.ogla.2026.02.006
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ogla.2026.02.006&link_type=DOI) 

22. 22.Churpek MM, Yuen TC, Park SY, Gibbons R, Edelson DP. Using electronic health record data to develop and validate a prediction model for adverse outcomes on the wards. Critical care medicine. 2014;42(4):841–841.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/CCM.0000000000000038&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24247472&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 

23. 23.Roumia M, Steinhubl S. Improving cardiovascular outcomes using electronic health records. Current cardiology reports. 2014;16(2):451–451.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24408676&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 

24. 24.Neill DB. Using artificial intelligence to improve hospital inpatient care. IEEE Intelligent Systems. 2013;28(2):92–95.
    
    
25. 25.Hu Z, Melton GB, Arsoniadis EG, Wang Y, Kwaan MR, Simon GJ. Strategies for handling missing clinical data for automated surgical site infection detection from the electronic health record. Journal of biomedical informatics. 2017;68:112–120.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jbi.2017.03.009&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28323112&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 

26. 26.Simon GE, Johnson E, Lawrence JM, et al. Predicting suicide attempts and suicide deaths following outpatient visits using electronic health records. American Journal of Psychiatry. 2018;175(10):951–960.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29792051&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 

27. 27.Berchuck SI, Jammal AA, Page D, Somers TJ, Medeiros FA. A Framework for Automating Psychiatric Distress Screening in Ophthalmology Clinics Using an EHR-Derived AI Algorithm. Translational Vision Science & Technology. 2022;11(10)doi:10.1167/tvst.11.10.6
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1167/tvst.11.10.6&link_type=DOI) 

28. 28.Choi EY, Chou N, Fisher HM, et al. Patient Attitudes toward Distress Screening and Referral in Glaucoma Care. Ophthalmology Glaucoma. 2025;doi:10.1016/j.ogla.2025.09.004
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ogla.2025.09.004&link_type=DOI) 

29. 29.Kingma DP, Ba J. Adam: A Method for Stochastic Optimization. 2014/12// 2014;
    
    
30. 30.Raskutti G, Wainwright MJ, Yu B. Early stopping and non-parametric regression: an optimal data-dependent stopping rule. J Mach Learn Res. 2014;15(1):335–366.
    
    
31. 31.Kroenke K, Spitzer RL. The PHQ-9: a new depression diagnostic and severity measure. Slack Incorporated Thorofare, NJ; 2002. p. 509–515.
    
    
32. 32.Wu Y, Levis B, Riehm KE, et al. Equivalency of the diagnostic accuracy of the PHQ-8 and PHQ-9: a systematic review and individual participant data meta-analysis. Psychological Medicine. 2019;50(8):1368–1380. doi:10.1017/s0033291719001314
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1017/s0033291719001314&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31298180&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 

33. 33.Gómez-Gómez I, Benítez I, Bellón J, et al. Utility of PHQ-2, PHQ-8 and PHQ-9 for detecting major depression in primary health care: a validation study in Spain. Psychological Medicine. 2022;53(12):5625–5635. doi:10.1017/s0033291722002835
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1017/s0033291722002835&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36258639&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 

34. 34.Villarreal-Zegarra D, Barrera-Begazo J, Otazú-Alfaro S, Mayo-Puchoc N, Bazo-Alvarez JC, Huarcaya-Victoria J. Sensitivity and specificity of the Patient Health Questionnaire (PHQ-9, PHQ-8, PHQ-2) and General Anxiety Disorder scale (GAD-7, GAD-2) for depression and anxiety diagnosis: a cross-sectional study in a Peruvian hospital population. BMJ Open. 2023;13(9)doi:10.1136/bmjopen-2023-076193
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYm1qb3BlbiI7czo1OiJyZXNpZCI7czoxMjoiMTMvOS9lMDc2MTkzIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjYvMDUvMjIvMjAyNi4wNS4yMC4yNjM1MzY3OS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

35. 35.Brauer ER, Lazaro S, Williams CL, et al. Implementing a Tailored Psychosocial Distress Screening Protocol in a Head and Neck Cancer Program. The Laryngoscope. 2021;132(8):1600–1608. doi:10.1002/lary.30000
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/lary.30000&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=34953151&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 

36. 36.Jha MK, Qamar A, Vaduganathan M, Charney DS, Murrough JW. Screening and Management of Depression in Patients With Cardiovascular Disease. Journal of the American College of Cardiology. 2019;73(14):1827–1845. doi:10.1016/j.jacc.2019.01.041
    
    [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czo0OiJhY2NqIjtzOjU6InJlc2lkIjtzOjEwOiI3My8xNC8xODI3IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjYvMDUvMjIvMjAyNi4wNS4yMC4yNjM1MzY3OS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

37. 37.Fluss R, Faraggi D, Reiser B. Estimation of the Youden Index and its Associated Cutoff Point. Biometrical Journal. 2005/8// 2005;47(4):458–472. doi:10.1002/bimj.200410135
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/bimj.200410135&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16161804&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000231492300005&link_type=ISI) 

38. 38.Bujak R, Daghir-Wojtkowiak E, Kaliszan R, Markuszewski MJ. PLS-Based and Regularization-Based Methods for the Selection of Relevant Variables in Non-targeted Metabolomics Data. Frontiers in Molecular Biosciences. 2016;3doi:10.3389/fmolb.2016.00035
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3389/fmolb.2016.00035&link_type=DOI) 

39. 39.Shapley LS. 17. A Value for n-Person Games. In: Harold William K, Albert William T, eds. Contributions to the Theory of Games, Volume II. Princeton University Press; 1953:307–318.
    
    
40. 40.Friedman J, Hastie T, Tibshirani R. Regularization Paths for Generalized Linear Models via Coordinate Descent. Journal of statistical software. 2010;33(1):1–22.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.18637/jss.v033.i01&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20808728&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 

41. 41.Wright MN, Ziegler A. ranger : A Fast Implementation of Random Forests for High Dimensional Data in C++ and R. Journal of Statistical Software. 2017;77(1)doi:10.18637/jss.v077.i01
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.18637/jss.v077.i01&link_type=DOI) 

42. 42.Dorogush AV, Ershov V, Gulin A. CatBoost: gradient boosting with categorical features support. 2018/10// 2018;
    
    
43. 43.Deshields TL, Wells-Di Gregorio S, Flowers SR, et al. Addressing distress management challenges: Recommendations from the consensus panel of the American Psychosocial Oncology Society and the Association of Oncology Social Work. CA: A Cancer Journal for Clinicians. 2021;71(5):407–436. doi:10.3322/caac.21672
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3322/caac.21672&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=34028809&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 

44. 44.Fudemberg SJ, Amarasekera DC, Silverstein MH, et al. Overcoming Barriers to Eye Care: Patient Response to a Medical Social Worker in a Glaucoma Service. Journal of Community Health. 2016/8// 2016;41(4):845–849. doi:10.1007/s10900-016-0162-1
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s10900-016-0162-1&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26860278&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom) 

45. 45.Donovan KA, Grassi L, Deshields TL, Corbett C, Riba MB. Advancing the science of distress screening and management in cancer care. Epidemiology and Psychiatric Sciences. 2020;29doi:10.1017/s2045796019000799
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1017/s2045796019000799&link_type=DOI) 

46. 46.Demmin DL, Silverstein SM. Visual Impairment and Mental Health: Unmet Needs and Treatment Options. Clinical Ophthalmology. 2020;Volume 14:4229–4251. doi:10.2147/opth.S258783
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2147/opth.S258783&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33299297&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2026%2F05%2F22%2F2026.05.20.26353679.atom)