## ABSTRACT

**Aims** To assess whether incorporating a machine learning (ML) method for accurate prediction of postoperative anterior chamber depth (ACD) improves the refraction prediction performance of existing intraocular lens (IOL) calculation formulas.

**Methods** A dataset of 4806 cataract patients were gathered at the Kellogg Eye Center, University of Michigan, and split into a training set (80% of patients, 5761 eyes) and a testing set (20% of patients, 961 eyes). A previously developed ML-based method was used to predict the postoperative ACD based on preoperative biometry. This ML-based postoperative ACD was integrated into new effective lens position (ELP) predictions using regression models to rescale the ML output for each of four existing formulas (Haigis, Hoffer Q, Holladay, and SRK/T). The performance of the formulas with ML-modified ELP was compared using a testing dataset. Performance was measured by the mean absolute error (MAE) in refraction prediction.

**Results** When the ELP was replaced with a linear combination of the original ELP and the ML-predicted ELP, the MAEs ± SD (in Diopters) in the testing set were: 0.356 ± 0.329 for Haigis, 0.352 ± 0.319 for Hoffer Q, 0.371 ± 0.336 for Holladay, and 0.361 ± 0.331 for SRK/T which were significantly lower than those of the original formulas: 0.373 ± 0.328 for Haigis, 0.408 ± 0.337 for Hoffer Q, 0.384 ± 0.341 for Holladay, and 0.394 ± 0.351 for SRK/T.

**Conclusion** Using a more accurately predicted postoperative ACD significantly improves the prediction accuracy of four existing IOL power formulas.

## INTRODUCTION

The estimation of postoperative intraocular lens position is essential to intraocular lens power calculations for cataract surgery. Norrby and Olsen have reported that inaccuracy in the prediction of the postoperative anterior chamber depth (ACD) is the number one source of error for postoperative refraction prediction.[1,2] In addition to its vital role in intraocular lens (IOL) formulas, the postoperative ACD is also a critical variable in ray tracing, where the uncertainty in the postoperative ACD directly affects the accuracy of the results. Methods to improve the accuracy of the prediction of postoperative ACD have been studied for decades. In first-generation formulas, the lens position was represented by a constant. Later, more and more preoperative biometric variables such as the axial length and the corneal power were added to calculate the postoperative IOL position. In 1993, Holladay first proposed the name “effective lens position (ELP)” to indicate the location of the lens as it relates to a given optical model of the eye.[3] Although the ELP was constructed to estimate the position of the IOL, practically the ELPs calculated using existing formulas (e.g., SRK/T) are not accurate estimates of the physical location of the IOL.[1,4] This is mainly because the ELPs in those formulas were formulated to account for different formula-specific assumptions and regression results.[1] In view of the limitations of the ELP in existing formulas, recently, more efforts have been devoted to constructing ELPs that better reflect the true location of the IOL.[5–9] New IOL power prediction methods have also been developed based on the new-generation ELP prediction methods, and they have shown that using a more accurately predicted IOL position helps to improve the IOL power prediction accuracy.[5]

It is so far largely unexplored whether inserting a more accurately predicted ELP into existing formulas improves refraction prediction accuracy. This is an important question because: (1) it provides a fast and efficient way to modify and improve on existing IOL formulas whose reliability has been tested extensively. (2) such research can provide supports for translating the continued improvements in accuracy in postoperative ACD prediction into better refraction predictions in published formulas. Several previous studies had modified the ELPs in existing formulas in order to achieve better refraction prediction results in certain cataract cases. Modification of ELP calculation in the Haigis formula for sulcus-implanted IOLs was reported to improve performance.[10] Kim et al. adjusted the ELP estimation in SRK/T formulas with the corneal height in post-refractive patients and achieved satisfactory accuracy.[11] It remains to be explored whether improvement of ELP estimates for in-the-bag IOL placement can improve IOL power calculations of existing formulas for general cataract patients.

Since most recently published IOL formulas (e.g., Barrett Universal II[12,13], Holladay 2, Olsen formula[14]) are either not disclosed to the public or do not have the option to customize the value of ELP during the prediction of postoperative refraction, here we applied our previously developed postoperative ACD prediction methods to a dataset of 4806 cataract surgery patients and replaced the ELP estimates in 4 existing IOL formulas: Haigis, Hoffer Q, Holladay, and SRK/T. We combined our machine learning (ML) prediction of true postoperative ACD with the original ELP estimated by each formula and substituted this updated ELP prediction for each formula. We then compared the refraction prediction performance of each formula using its original and enhanced ELP estimates. The findings reported here demonstrate that existing formulas can benefit from improved methods for predicting true postoperative ACD.

## MATERIALS AND METHODS

### Postoperative ACD prediction machine learning model

In previous work,[15] we developed a machine learning-based postoperative anterior chamber depth (ACD) prediction model, which predicts the postoperative anterior chamber depth (in mm) based on preoperative biometry. Here in the presented study, an ACD prediction machine learning model was trained using the method and dataset (847 patients, 4137 eyes) described in the previous research. The dataset was composed of the preoperative and postoperative biometry measured by the Lenstar LS900 optical biometers (Haag-Streit USA Inc, EyeSuite software version i9.1.0.0) at the University of Michigan’s Kellogg Eye Center. The postoperative ACD was defined as the distance from the front surface of the cornea to the front surface of the intraocular lens (IOL). The postoperative ACD predicted by the machine learning model is referred to as *ELP*_{ML} in this manuscript.

### Data collection

In this study, biometry records were collected using the same approach as for the development of the ML postoperative ACD prediction model.[15] The inclusion criteria were: (1) patients who had cataract surgery (CPT = 66984 or 66982) but no prior refractive surgery and no additional surgical procedures at the time of cataract surgery. (2) the implanted lens was an Alcon SN60WF single-piece acrylic monofocal lens (Alcon, USA). Each case in the dataset corresponds to one operation of a single eye with preoperative and postoperative information. The preoperative information includes the measurements of the axial length (AL), lens thickness (LT), anterior chamber depth (ACD), flat keratometry (K1), steep keratometry (K2), and the average keratometry which was calculated as . The postoperative information includes the postoperative refraction (spherical component SC and cylindrical component CC) where the time when it was recorded was closest to one month (30 days) after surgery. Since the patients were measured in a lane of 10 feet long (3.048 meters), which was shorter than the standard length of 20 feet (6 meters), the SC was adjusted for the vergence distance by adding according to Simpson and Charman’s recommendation.[16] The spherical equivalent (SE) refraction was therefore calculated as *SE refraction = (SC* − 0.1614) + 0.5*CC*. Samples that were used to train the postoperative ACD prediction machine learning model were excluded from the dataset so that the dataset better simulates unseen samples.

The dataset in total consisted of 4806 patients (**Figure 1**). The dataset was split into a training dataset used for the development of the methods and a testing dataset used for performance comparison. 80% of the patients were randomly assigned to the training set, and the rest of the patients (20%) were assigned to the testing set. For patients who had more than one associated case in the testing set (i.e., patients who had both eyes operated on), one case was randomly selected to ensure each patient had the same weight when the prediction performance was evaluated. At the end of this process, the training set had 3845 patients (5761 eyes), and the testing set had 961 patients (961 eyes).

### Linear regression model

We implemented four existing formulas (Haigis, Hoffer Q, Holladay, and SRK/T) in Python based on their publications.[17–24] The existing formulas calculated the effective lens position (*ELP*_{F}) as a function of the preoperative biometry (**Figure 1**): *ELP*_{F} *= f*_{0}*(biometry)*. The predicted ELP (*ELP*_{F}) was then used to predict the postoperative refraction: *refraction = f*_{1}*(ELP*_{F}, *biometry)*. Here, the goal was to reduce the refraction prediction error by replacing *ELP*_{F} with a different value, . Our approach involves two steps: (1) finding the theoretically most optimal ELP values, (2) modeling the most optimal ELP with *ELP*_{F} and the ML-predicted postoperative ACD, denoted *ELP*_{ML}.

In the first step, the most optimal ELP (denoted *ELP*_{BC}) was found by the standard method of back-calculating the ELP when the predicted refraction was set to equal the true refraction (i.e., *f*_{1} *(ELP*_{BC}, *biometry) = true refraction*). In other words, when , the refraction prediction errors of all patients equal zero. More details on the computation of *ELP*_{BC} can be found in **Supplementary materials**.

After the computation of *ELP*_{F}, *ELP*_{ML}, and *ELP*_{BC}, we modeled *ELP*_{BC} using a linear function of *ELP*_{F} and/or *ELP*_{ML} so as to obtain an approximation of the most optimal ELP using available variables. We compared four different approaches of approximating *ELP*_{BC} : (1) Original, : using the original *ELP*_{F}, (2) Formula LR, : using linearly adjusted *ELP*_{F}, (3) ML LR, : using linearly adjusted *ELP*_{F}, (4) Formula & ML LR, : using a linear combination of *ELP*_{F} and *ELP*_{ML}. Here *c*_{1}, *c*_{2}, and *c*_{3} are constants. Outliers with large refraction errors (i.e., *error* ≥ *mean error + 2 · standard deviation* or *error* ≥ *mean error* − *2 · standard deviation*) were excluded for each formula before establishing the linear regression model, in order to obtain better modeling results. The refraction prediction errors were calculated as *error = predicted refraction* − *true refraction*. The linear regression was performed using scikit-learn 0.20.3.

On the testing set, was calculated based on the values of *c*_{1}, *c*_{2}, and *c*_{3} obtained through linear regression. The predicted refraction was calculated as *refraction = f*_{1}*(ELP*_{F} *′, biometry)*. The mean absolute error (MAE), median absolute error (MedAE) and mean error (ME) were calculated for performance comparison.

### A-constant optimization

The A-constants for the formulas were optimized based on the training dataset so that the mean error in refraction prediction was closest to zero. The A-constants were optimized separately for the unmodified formulas and formulas with a modified ELP estimate (see **Supplementary Materials**). The optimized A-constants for the original formulas were: a0 = −0.733, a1 = −0.234, a2 = 0.217 for Haigis, ACD constant = 5.724 for Hoffer Q, surgeon factor = 1.864 for Holladay, and A = 119.089 for SRK/T (**Table S1**).

### Statistical analysis

Linear regression analysis was used to assess the significance of the correlation between *ELP*_{F}, *ELP*_{ML}, and *ELP*_{BC}. To test whether the MAE and ME of different methods were significantly different, a Friedman test followed by a post hoc paired Wilcoxon signed-rank test with Bonferroni correction was used. Statistical significance was defined as the p-value <0.05. All the above analyses were performed with Python 3.7.3.

## RESULTS

### Dataset overview

The cases in the training and testing datasets had a similar distribution according to the summary statistics shown in **Table 1**. As elaborated in **Materials and Methods**, we calculated *ELP*_{F}, *ELP*_{ML}, and *ELP*_{BC} based on the formulas and their optimized A-constants. The mean and standard deviation of the ELPs calculated based on the original formulas were summarized in **Table S2**. *ELP*_{BC} and *ELP*_{F} had similar mean values in contrast to *ELP*_{ML}.

The Pearson correlation coefficients (*R*) between *ELP*_{F}, *ELP*_{ML}, and *ELP*_{BC} were shown in **Table 2**. Three ELP-related variables were positively intercorrelated with each other. The correlation coefficients, *R*, between *ELP*_{BC} and *ELP*_{ML} were the weakest among the three pairs of variables across all formulas.

### Linear regression results on the training set

Linear regression models were established based on the training set and the *R*^{2} of alternative linear models were shown in **Table 3**. The coefficients of the fitted linear regression line are shown in **Table S3**. The mean and SD of the resulting from different models are shown in **Table S4**. For “Formula LR”, the *R*^{2} was larger than that of “ML LR” for all four formulas. For “Formula & ML LR”, the *R*^{2} was larger than that when one of *ELP*_{F} and *ELP*_{ML} was excluded from the linear combination for all four formulas.

### Refraction prediction performance comparison on the testing set

We tested the performance of four scenarios on the testing set and summarized the MAE and SD **Table 4**. The mean error (ME) and median absolute error (MedAE) were shown in **Table S5 and Table S6**. Statistical tests were used to compare the difference in the MAEs of different models (see **Materials and Methods**). Using a linear combination of *ELP*_{F} and *ELP*_{ML}, the refraction prediction results of four existing formulas were significantly improved compared to original *ELP*_{F} (statistical test results shown in **Table S7** and **Table S8**).

We further compared the MAEs of “Original” and “Formula & ML LR” among patients with short, medium, and long axial length (**Table S9**). It was observed that the short and medium axial length groups had a higher percentage decrease in MAE than the long axial length group for Hoffer Q and SRK/T. For Haigis, the medium AL group achieved higher decrease than the other two groups. And for Holladay, the long AL group achieved more decrease in MAE than the other two groups.

## DISCUSSION

In this study, we applied a previously developed machine learning method for postoperative anterior chamber depth (ACD) prediction to an unseen dataset of 4806 cataract surgery patients to assess whether it was possible to improve the performance of existing IOL formulas (Haigis, Hoffer Q, Holladay, and SRK/T) by replacing each formula’s ELP estimate.

We computed three ELP-related quantities: the machine learning-predicted postoperative ACD (*ELP*_{ML}), formula-predicted ELP (*ELP*_{F}), and a back-calculated ELP (*ELP*_{BC}) that minimized the refraction error for each eye in the dataset. They are strongly correlated with each other (**Table 2**), which indicates that (1) *ELP*_{F} and *ELP*_{ML} are both predictive of the most optimal ELP *ELP*_{BC}, (2) *ELP*_{F} and *ELP*_{ML} contain partially overlapping information, which is consistent with our expectation. *ELP*_{ML} is an estimation of the value of the true postoperative ACD. On the other hand, the *ELP*_{F} was designed by the originators of each formula to serve a similar purpose but was based on the theoretical assumptions in each formula. Our findings are consistent with observations of previous studies that the ELP estimates made by IOL formulas were numerically different from the true postoperative ACD.[9]

Using a training dataset of 3845 patients, we sought to evaluate whether the machine-predicted postoperative ACD, *ELP*_{ML}, was able to provide information that could be used to refine each formula’s predicted ELP, *ELP*_{F}. We established regression models between the *ELP*_{ML}, *ELP*_{F}, and *ELP*_{BC} to evaluate whether a linear combination of *ELP*_{ML} and *ELP*_{F} used in place of the original *ELP*_{F} could lower the refraction prediction error. Using the modified ELPs, we obtained significantly lower mean absolute errors (MAE) in refraction prediction compared to the formulas with the original ELPs on the unseen testing set (**Table 4**). Notably, the accurately predicted postoperative ACD (*ELP*_{ML}) alone did not outperform the original ELP (*ELP*_{F}) when it was inserted into the formulas (**Table 4**, row 3 compared to row 1). This is likely because the original method of calculating ELP in each formula compensates for its particular model of the eye and its associated assumptions. Our *ELP*_{ML}, however, does not have any components that compensate for the assumptions and constants in the formulas. On the other hand, *ELP*_{ML} has information about the true postoperative ACD, which it appears can beneficially alter the original ELP estimate.

In this study, the A-constants were optimized separately when *ELP*_{F} was replaced with different . The means of , as shown in **Table S4** were numerically close to those of *ELP*_{F} as shown in **Table S2**. However, in our method, the similarity between and *ELP*_{F} was not among the restrictions and goals of the optimization. The reason that and the original *ELP*_{F} have similar means might be that the other parts of each formula put restrictions on the values of ELP in order to obtain reasonable results. This could also be the reason why *ELP*_{BC} and *ELP*_{F} had similar means as shown in **Table S2**.

The presented method of replacing ELP estimates provides a simple way of improving the prediction performance of existing formulas. While it would be ideal to evaluate this method on modern formulas such as Barrett Universal II or Holladay 2, the absence of published equations for these formulas prevents such a study. As such, we studied the application of the machine learning predicted postoperative ACD in four existing formulas whose mathematical equations were published. Although it awaits to be further validated, similar results can likely be transferred to other refraction prediction methods, since many modern IOL power formulas use predicted postoperative ACD as an intermediate step for predicting postoperative refraction.

In summary, the results of this study demonstrate that a machine learning method for postoperative ACD prediction based on postoperative optical biometry can be incorporated into a variety of existing IOL power formulas to improve their accuracy in refraction prediction.

## Data Availability

Data are not publicly available.

## CONTRIBUTIONS

TL: data analysis, programming, and writing of the manuscript; JDS: data collection; NN: data collection, guidance on method development, and writing of the manuscript

## FUNDING

This work was supported by the Lighthouse Guild, New York, NY (JDS) and National Eye Institute, Bethesda, MD, 1R01EY026641-01A1 (JDS).

## COMPETING INTERESTS

None declared

## DATA AVAILABILITY STATEMENT

Data are not publicly available

## ACKNOWLEDGMENTS

None