PT  - JOURNAL ARTICLE
AU  - Huang, Yu-Ping
AU  - Lee, Shyh-Yuan
TI  - An Effective and Reliable Methodology for Deep Machine Learning Application in Caries Detection
AID  - 10.1101/2021.05.04.21256502
DP  - 2021 Jan 01
TA  - medRxiv
PG  - 2021.05.04.21256502
4099  - http://medrxiv.org/content/early/2021/05/07/2021.05.04.21256502.short
4100  - http://medrxiv.org/content/early/2021/05/07/2021.05.04.21256502.full
AB  - Early detection of dental caries has been one of the most predominant topics studied over the last few decades. Conventional examination through visual-tactile inspection and radiography can be inaccurate and destructive to teeth structure. The development of Optical Coherence Tomography (OCT) has given dentistry an alternative diagnostic technique, which has been proven by numerous studies, that it has better sensitivity, specificity, and non-invasive characteristics. The growing popularity of Artificial Intelligence (AI) also contributes to a more efficient and effective way of image-based detection and decision-making. Previous studies, which have attempted to employ AI for caries assessment, did not incorporate high-quality data. Hence, they were unable to produce valid and reliable results. This study highlights the importance of high-quality data and aims to bypass this issue, by implementing an improved methodology to the automated detection and diagnosis of dental caries depending on AI. A two-phase study was carried out to explore different methods for caries detection. Initially OCT was verified, by surveying experienced clinicians, to be a better imaging technique compared to radiography. Then, our study showed that Convolutional Neural Networks (CNNs) in the scope of AI surpassed the accuracy of human clinicians. The data was preprocessed and labelled with the ground truth corresponding to Micro-CT with rigorous definition. Statistical analysis performed was mainly based on weighted Kappa coefficient. The results suggested that OCT (κ = .699, SD = .090) showed a higher accuracy than radiography (κ = .407, SD = .049) and CNNs (κ = .860, SD = .049) were rated higher than clinicians (κ = .679; SD = .113), both within a .05 significance. The best result was carried out by ResNet-152, concluding diagnostic accuracy to be 95.21% and sensitivity 98.85%. The improved methodology of this study hopes to pave the way for future studies in AI application in Dentistry.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis research was partially supported by the Ministry of Science and Technology (MOST 108-2813-C-010-040-B).Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The protocol has been approved by the Institutional Review Board of Taipei Veterans General Hospital and supervised by the Institutional Review Board of Taipei Veterans General Hospital. After the review by Human Research Protection Center, the implementation of the protocol is approved.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe data that support the findings of this study are available on request from the corresponding author (Lee, SY). The data are not publicly available because they contain information that could compromise research participant privacy/consent.