Abstract
Background Computer-aided detection (CAD) software analyzes chest X-rays for features suggestive of tuberculosis (TB) and provides a numeric abnormality score. However, estimates of CAD accuracy for TB screening are hindered by the lack of confirmatory data among people with lower X-ray scores, including those without symptoms. Additionally, the appropriate X-ray score thresholds for obtaining further testing may vary according to population and client characteristics.
Methods We screened for TB in Ugandan individuals aged ≥15 years using portable chest X-rays with CAD (qXR v3). Participants were offered screening regardless of their symptoms. Those with X-ray scores above a threshold of 0.1 (range, 0 – 1) were asked to provide sputum for Xpert Ultra testing. We estimated the diagnostic accuracy of CAD for detecting Xpert-positive TB when using the same threshold for all individuals (under different assumptions about TB prevalence among people with X-ray scores <0.1), and compared this estimate to age- and/or sex-stratified approaches.
Findings Of 52,835 participants screened for TB using CAD, 8,949 (16.9%) had X-ray scores ≥0.1. Of 7,219 participants with valid Xpert Ultra results, 382 (5.3%) were Xpert-positive, including 81 with trace results. Assuming 0.1% of participants with X-ray scores <0.1 would have been Xpert-positive if tested, qXR had an estimated AUC of 0.92 (95% confidence interval 0.90-0.94) for Xpert-positive TB. Stratifying X-ray score thresholds according to age and sex improved accuracy; for example, at 96.1% specificity, estimated sensitivity was 75.0% for a universal threshold (of ≥0.65) versus 76.9% for thresholds stratified by age and sex (p=0.046).
Interpretation The accuracy of CAD for TB screening among all screening participants, including those without symptoms or abnormal chest X-rays, is higher than previously estimated. Stratifying X-ray score thresholds based on client characteristics such as age and sex could further improve accuracy, enabling a more effective and personalized approach to TB screening.
Funding National Institutes of Health
Evidence before this study The World Health Organization (WHO) has endorsed computer-aided detection (CAD) as a screening tool for tuberculosis (TB), but the appropriate X-ray score that triggers further diagnostic evaluation for tuberculosis (the “CAD threshold”) varies by population. The WHO recommends determining the appropriate CAD threshold for specific settings and population and considering unique thresholds for specific populations, including older age groups, among whom CAD may perform poorly. We performed a PubMed literature search for articles published until September 9, 2024, using the search terms “tuberculosis” AND (“computer-aided detection” OR “computer aided detection” OR “CAD” OR “computer-aided reading” OR “computer aided reading” OR “artificial intelligence”), which resulted in 704 articles. Among them, we identified studies that evaluated the performance of CAD for tuberculosis screening and additionally reviewed relevant references. Most prior studies reported area under the curves (AUC) ranging from 0.76 to 0.88 but limited their evaluations to individuals with symptoms or abnormal chest X-rays. Some prior studies identified subgroups (including older individuals and people with prior TB) among whom CAD had lower-than-average AUCs, and authors discussed how the prevalence of such characteristics could affect the optimal value of a population-wide CAD threshold; however, none estimated the accuracy that could be gained with adjusting CAD thresholds between individuals based on personal characteristics.
Added value of this study In this study, all consenting individuals in a high-prevalence setting were offered chest X-ray screening, regardless of symptoms, if they were ≥15 years old, not pregnant, and not on TB treatment. A very low X-ray score cutoff (qXR v3 TB score of 0.1 on a 0-1 scale) was used to select individuals for confirmatory sputum molecular testing, enabling the detection of radiographically mild forms of TB and facilitating comparisons of diagnostic accuracy at different CAD thresholds. To assess CAD performance among all X-ray screening participants capable of providing sputum, a TB prevalence of 0.1% was assumed for individuals with X-ray scores <0.1 who were not offered sputum testing. Using this symptom-neutral evaluation of CAD with expansive criteria for bacteriologic testing, we estimated an AUC of 0.92, and we found that the qXR v3 threshold needed to decrease to under 0.1 to meet the WHO target product profile goal of ≥90% sensitivity and ≥70% specificity. CAD performance decreased when a higher prevalence of TB was assumed among people with X-ray score <0.1. Compared to using the same thresholds for all participants, adjusting CAD thresholds by age and sex strata resulted in a 1 to 2% increase in sensitivity without affecting specificity.
Implications of all the available evidence To obtain high sensitivity with CAD screening in high-prevalence settings, low score thresholds may be needed. However, countries with a high burden of TB often do not have sufficient resources to test all individuals above a low threshold. In such settings, adjusting CAD thresholds based on individual characteristics associated with TB prevalence (e.g., male sex) and those associated with false-positive X-ray results (e.g., old age) can potentially improve the efficiency of TB screening programs.
Competing Interest Statement
The authors have declared no competing interest.
Clinical Trial
NCT05285202
Funding Statement
This work was supported by the National Institutes of Health (grant numbers R01HL138728 [to E.A.K. and D.W.D.], T32 AI007291 and K23AI185268 [to J.S.]). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The study was approved by the Institutional Review Boards at the Johns Hopkins University School of Medicine and Makerere University School of Public Health. Informed consent (or assent with parental consent) was obtained from all study participants.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
Assumptions about TB prevalence values among individuals with X-ray scores <0.1 changed in figures/tables and clarified throughout the text. Additional minor changes include updated title, funding statement, and dates of data collection.
Data Availability
The de-identified dataset of participant demographics, X-ray scores, and Xpert results used for this study and a data dictionary will be available upon a reasonable request. Data sharing will be limited to non-commercial research use only. Requests should include a proposal outlining the intended use and methodology and will be subject to review and approval. Requests can be directed to ekendall{at}jhmi.edu.





