Abstract
Barrett’s esophagus containing intestinal metaplasia predisposes to cancer, yet the majority of cases are undiagnosed. The length of a Barrett’s segment is a key indicator of cancer risk, but measuring it has so far relied on endoscopy, which is expensive and invasive. Cytosponge-TFF3 is a minimally-invasive test that identifies intestinal metaplasia for endoscopic confirmation. We report a machine learning technique to quantify the extent of intestinal metaplasia and predict Barrett’s segment length from whole-slide image tile counts automatically generated from Cytosponge-TFF3 histology slides. Utilizing data from 529 patients, our segment length prediction model achieves an average validation fold accuracy of 0.84. Applying this algorithm to an independent test set of 162 patients from a screening trial shows a precision of 0.90 for identifying short-segment disease. This advance will enable higher-risk patients to be prioritized for endoscopy while saving more than half of Cytosponge-TFF3-positive patients from endoscopy in the screening setting.
Competing Interest Statement
R.C.F is named on patents related to Cytosponge-TFF3 that have been licensed by the Medical Research Council to Covidien (now Medtronic). RCF is a founder and shareholder of Cyted Ltd. FM is a founder, director and shareholder of Tailor Bio.
Clinical Trial
ISRCTN12730505, ISRCTN68382401
Clinical Protocols
https://github.com/markowetzlab/barretts-segment-length-predictor
Funding Statement
This research was supported by Cancer Research UK (FM: C14303/A17197), a Medical Research Council programme grant (RCF: RG84369) and Cambridge University Hospitals NHS Foundation Trust. BEST2 was funded by Cancer Research UK (12088 and 16893). We would also like to thank the NIHR Cambridge Biomedical Research Centre (BRC-1215-20014) and the Experimental Cancer Medicine Centre for their support. AGB acknowledges support from a Gates Cambridge Scholarship from the Bill & Melinda Gates Foundation. FM is a Royal Society Wolfson Research Merit Award holder.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
BEST2: Ethics approval was obtained from the East of England - Cambridge Central Research Ethics Committee (number 10/H0308/71) and registered in the UK Clinical Research Network Study Portfolio (9461). BEST3: Ethics approval was obtained from the East of England - Cambridge Central Research Ethics Committee (number 16/EE/0546).
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
↵a These authors share senior authorship
Data Availability
The data and fully documented code to perform all of the analyses of this paper can be found at this public repository in the form of two illustrative Jupyter notebooks: https://github.com/markowetzlab/barretts-segment-length-predictor
https://github.com/markowetzlab/barretts-segment-length-predictor