Abstract
Identifying tumor DNA mismatch repair deficiency (dMMR) is important for precision medicine. We assessed tumor features, individually and in combination, in whole-exome sequenced (WES) colorectal cancers (CRCs) and in panel sequenced CRCs, endometrial cancers (ECs) and sebaceous skin tumors (SSTs) for their accuracy in detecting dMMR. CRCs (n=300) with WES, where MMR status was determined by immunohistochemistry, were assessed for microsatellite instability (MSMuTect, MANTIS, MSIseq, MSISensor), COSMIC tumor mutational signatures (TMS) and somatic mutation counts. A 10-fold cross-validation approach (100 repeats) evaluated the dMMR prediction accuracy for 1) individual features, 2) Lasso statistical model and 3) an additive feature combination approach. Panel sequenced tumors (29 CRCs, 22 ECs, 20 SSTs) were assessed for the top performing dMMR predicting features/models using these three approaches. For WES CRCs, 10 features provided >80% dMMR prediction accuracy, with MSMuTect, MSIseq, and MANTIS achieving ≥99% accuracy. The Lasso model achieved 98.3%. The additive feature approach with ≥3/6 of MSMuTect, MANTIS, MSIseq, MSISensor, INDEL count or TMS ID2+ID7 achieved 99.7% accuracy. For the panel sequenced tumors, the additive feature combination approach of ≥3/6 achieved accuracies of 100%, 95.5% and 100%, for CRCs, ECs, and SSTs, respectively. The microsatellite instability calling tools performed well in WES CRCs, however, an approach combining tumor features may improve dMMR prediction in both WES and panel sequenced data across tissue types.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This study was funded by a National Health and Medical Research Council of Australia (NHMRC) project grant GNT1125269 (PI- Daniel Buchanan), supporting the design, analysis, and interpretation of the data. RW is supported by the Margaret and Irene Stewardson Fund Scholarship and by the Melbourne Research Scholarship. DDB is supported by an NHMRC Investigator grant (GNT1194896) and University of Melbourne Dame Kate Campbell Fellowship. PG is supported by the University of Melbourne Research Scholarship. MAJ is supported by an NHMRC Investigator grant (GNT1195099). AKW is supported by an NHMRC Investigator grant (GNT1194392). JLH is supported by the University of Melbourne Dame Kate Campbell Fellowship. OS is supported by an NHMRC Senior Research Fellowship (GNT1136119). DEO is supported by a Canadian Institutes of Health Research (CIHR) Post-doctoral Fellowship. BP is supported by a Victorian Health and Medical Research Fellowship from the Victorian Government.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
1. Human Research Ethics Committee of the University of Melbourne gave ethical approval of this work (HREC#1750748). 2. Institutional review boards at each family cancer clinic gave ethical approval of this work. 3. Institutional human ethics committee of the ACCFR site gave ethical approval of this work. 4. Institutional human ethics committee of the OFCCR site gave ethical approval of this work. 5. Human research ethics committee of the Royal Melbourne Hospital gave ethical approval of this work (HREC 12/19). 6. Human research ethics committee of the Western Hospital Footscray gave ethical approval of this work (HREC 12/19). 7. Human research ethics committee of the University of Melbourne gave ethical approval of this work (HREC#1648355). 8. Institutional human ethics committee of the Sullivan Nicolaides Pathology service in Brisbane gave ethical approval of this work.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.