Estimated durations of asymptomatic, symptomatic, and care-seeking phases of tuberculosis disease

Ratios of bacteriologically-positive tuberculosis prevalence to notification rates are used to characterise typical durations of tuberculosis disease, but have not accounted for asymptomatic periods prior to careseeking. We developed novel statistical models to estimate progression from initial bacteriological-positivity including smear conversion, symptom onset and initial care-seeking and fitted them to tuberculosis prevalence survey and notification data (one subnational and 11 national datasets) within a Bayesian framework. Asymptomatic tuberculosis duration was in the range 4 - 8 months for African countries; three countries in Asia showed longer durations of > 1 year. Care-seeking typically began half-way between symptom onset and notification. Our method also estimated smear progression rates and case-detection ratios. We found evidence for higher case-detection ratios and much shorter durations of tuberculosis for people living with HIV. To eradicate tuberculosis transmission, greater gains may be achieved by proactively screening people without symptoms through active case finding interventions.


TB prevalence surveys
We analysed data from eleven national and one sub-national settings with a TB prevalence survey conducted after 2010. We excluded TB prevalence surveys that relied solely on symptom screening to decide who to sample for bacteriological testing and restricted to the population aged 15 years or older.
Summaries of prevalence survey data are shown in Table 1. The proportion of prevalent cases that was asymptomatic ranged from 30% in Malawi, to 70% in Cambodia, and the proportion of prevalent cases that were sputum smear positive varied from 23% in Vietnam to 84% in Tanzania. Definitions used for symptoms, criteria used for bacteriological testing, and bacteriological testing methods are summarized in Table 1-table supplement 1. The sub-national TB prevalence survey from Blantyre, Malawi and the national survey from Kenya [15] included individual data, allowing analysis by smear status, age, and HIV status [14]. In Blantyre, Malawi, the prevalence survey was conducted as part of a community cluster randomised trial across approximately 750,000 adults in 72 densely populated neighbourhoods of the city. For Blantyre, we aligned symptom definition with that of Kenya by considering cough of >=2 weeks, considering cough of any duration in a sensitivity analysis.

Definitions
Unless otherwise stated, we used the definition of TB symptoms and health-care seeking adopted by each TB prevalence survey. Asymptomatic TB was taken to be bacteriologically-positive pulmonary TB in those reporting no TB symptoms, and we assumed no health-care seeking for TB during this phase. Participants with prevalent TB who were already taking TB treatment were excluded. We assumed TB treatment initiation, TB confirmation, and case notification are identical events for the modelling.

TB notification data
We used TB notification data and treatment outcome data from the WHO TB database for the countries with national prevalence surveys [12][13][14][15][16][17][18][19][20][21][22]. We included new and relapse TB regardless of smear status since 2013. We applied the proportions of HIV among TB notifications by sex, age group, and smear status to disaggregate the WHO notification data. Where notification data exhibited level shifts suggestive of changes in reporting, only consistent, sequential data were used.
For Kenya, we obtained data simultaneously stratified by HIV and age group from the National Tuberculosis Programme. For Blantyre, we obtained individual-level notification data from an enhanced surveillance . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 system. We matched the age-groups of the Kenya data to the finest scale of WHO case notification data; for Blantyre, Malawi, with lower case counts, we used two age groups (15 to 49, and 50+ years).

Demographic data
In order to model mortality and project national numbers of prevalent cases to relate to notifications, we used age-and sex-specific background mortality rates and population sizes from 2019 World Population Prospect (WPP) data, using the mid-year population estimates [23]. For Blantyre, we rescaled the Malawi demographic data to local 2008 and 2018 population censuses [24,25].
The demographic and notification data to replicate the analysis are available at Source Data.

Estimation of TB progression and care-seeking behaviour
We developed three state transition models of TB case-detection to match available data, and fitted them to estimate parameters driving TB progression and care-seeking behaviour. Figure 1 shows that all models contain an asymptomatic phase and progress to the symptomatic phase. Model A presents a basic transition between asymptomatic/symptomatic phases; Model B divides the symptomatic phase by presence of careseeking intentions, and Model C details the conversion between smear-negative and smear-positive and careseeking by smear status. Model B is only applicable to the prevalence surveys that reported care-seeking behaviours. We did not combine Model B and C because the symptoms, smear status, and care-seeking behaviours prevalence surveys were not cross-tabulated in the published reports. We constructed a likelihood depending on these states to fit to TB prevalence and case-notification data. For Kenya and Blantyre data with Model C, we incorporated proportional hazard models relating the rate of symptom onset and the careseeking rates by smear status to covariates of age, sex, and HIV status.

Metrics calculated
For each setting, we used posterior samples to calculate: the TB incidence; the mean times from incident disease to development of symptoms (i.e. the duration of asymptomatic disease), to case-seeking initiation (where applicable), and to notification; the case detection ratio (CDR) as the ratio of incidence and notification rates; and the proportion of TB cases reaching each stage of care. Aggregated quantities were computed weighted by model-estimated incidence. We also output a joint posterior for the proportion of symptomatic cases initially smear positive and the subsequent smear-conversion rate. We report means and 95% credible intervals (CrIs).

Ethical approval
The Blantyre, Malawi TB prevalence survey received ethical approval from the College of Medicine, University of Malawi, and from London School of Hygiene and Tropical Medicine.

Duration of asymptomatic disease, time to care-seeking initiation and time to case-detection
The national estimates for duration of asymptomatic disease typically ranged around 3-8 months. However, three countries in Asia (Lao PDR, Cambodia, Philippines) showed longer durations of over one year (see Figure 2). These countries also had long total durations: 22 months for Cambodia and Philippines, and three years for Lao PDR. Only one country (Ethiopia) had a total duration lower than 12 months. In the seven countries where we could estimate the delay to care-seeking initiation, apart from the Philippines (16 months), delay varied between 1.2 (0.7 -1.9) months for ) months for Malawi. The asymptomatic phase represented between 27% and 63% of time as a prevalent case, and treatment initiation occurred between 15% and 59% of the way between symptom onset and notification (or death or self-cure). Figure 3 shows the joint posterior probability densities of smear conversion rates and the proportion of symptomatic cases that were smear-positive at symptom onset. Estimates of smear conversion rates were linearly related to the initial proportion of symptomatic cases that were smear-positive. Pooling weighted by TB notifications in 2019 (excluding Malawi and Vietnam as outliers) yielded a gradient of 0.239 years, and a pooled intercept of 53.4% smear positive (for zero smear conversion rate). Table 1 shows the empirical estimates of total duration and total asymptomatic duration based on P:N ratios. Empirical estimates of duration systematically overestimate the durations, because they implicitly assume all episodes of TB disease end in notification. Modelling self-cure and death leads to shorter estimates of duration and differences in estimated CDRs from WHO estimates. The proportions of incident TB who die or self-cure before being detected are shown in Table 1. Estimates of TB incidence are also distinct but comparable to WHO incidence estimates. We estimated rates of TB incidence change from a decline of 7.3% (95% CrI: 7.1% -7.5%) per year in Ethiopia up to an increase of 8.3% (95% CrI: 8.2% -8.4%) per year in the Philippines. Figure 4 shows care cascade estimates for the year of national prevalence surveys, showing the proportion of incident TB cases that develop symptoms, begin seeking care, initiate treatment, and finally successfully complete treatment. There were notable differences between countries, with the proportion initiating TB treatment ranging from 30% (95% CrI: 24% -38%) in Lao PDR to 83% (95% CrI: 72% -88%) in Ethiopia. Versions of these figures with cohort timings are shown in Figure 4-figure supplement 1.

Sex, age, and HIV
For Kenya and Blantyre, Malawi, we were able to examine results stratified by sex, age, and HIV-status (see Table 2). For Kenya we found total durations of 18.8 months (95% CrI: 16.1 -21.8) for men and 15.4 months (95% CrI: 13.0 -18.1) for women, and for Blantyre 9.9 months (95% CrI: 6.7 -13.8) for men and 6.6 months (95% CrI: 3.5 -10.2) for women, in line with previous work suggesting longer durations for men [3]. Adjusted results for Kenya suggested a longer asymptomatic phase in men than women; however, there were no clear differences in unadjusted results and results from Blantyre (see risk ratios in Table 2). There were also no clear patterns with respect to age. However, HIV status had a strong effect, with asymptomatic and symptomatic durations being shorter in PLHIV than those that were HIV negative. The total duration in PLHIV was 3.7 months (95% CrI: 1.9 -6.0) for Blantyre and 7.8 months (95% CrI: 5.8 -10.2) for Kenya.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 We found that CDRs were consistently higher in PLHIV than in those without HIV infection, even with higher deaths rates for PLHIV with TB assumed ( Table 3).
Without HIV-specific case notification data at the national scale, Figure 5 demonstrates the relationship between the asymptomatic duration estimates and the PLHIV among case-notification. We found no clear pattern of the association.

Sensitivity Analyses
Across all mortality scenarios considered, differences in duration were around one month, except for Lao PDR ( Table 4). The highest levels of assumed mortality led to up to a 7 percentage-point lower casedetection ratio than for the lowest assumed mortality. Sensitivity analysis assuming an extremely high excess TB mortality for PLHIV ( Table 3 and Table 4-table supplement 2) led to up to 15 percentage-point lower CDRs and correspondingly higher incidence rates. However, for both Kenya and Blantyre, the case-detection ratios were still not lower than that of HIV negative populations. Considering a broader range of symptoms resulted in shorter durations of the asymptomatic phase but longer time-spent on care-seeking given the TB prevalence survey and notification data ( Table 4-table supplement 3).
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 Discussion By calibrating simple models of TB disease and care-seeking progression to prevalence survey and notification data, we were able to infer the typical duration of asymptomatic bacteriologically-positive TB to be around six months. However, there is large variation between settings, with longer durations of asymptomatic disease in the included Asian settings. The asymptomatic phase typically comprised around half the total time before notification. For countries that reported care-seeking history in their prevalence surveys, we were able to estimate the average timing of initial care-seeking, finding this was approximately halfway between symptom onset and ultimate diagnosis. We found limited evidence of age-dependence in overall durations, with a hint of longer durations in older age groups, but did find substantially shorter disease durations in people living with HIV. Our analysis of TB care-seeking, diagnosis and treatment outcome cascades also showed substantial differences between settings, and meaningful losses before symptom onset and care-seeking. Taken together, these findings suggest that important opportunities exist to identify people with tuberculosis earlier in their disease course through screening and community-based active case finding interventions, potentially improving individual outcomes and reducing transmission.
TB disease prior to care-seeking, including the asymptomatic phase, is beyond the reach of passive casefinding: improvements in diagnosis and retention at the clinic will not shorten these delays nor avert deaths during this phase. Cases who are truly asymptomatic can only be found by active approaches to screening based on exposure, chest X-ray or other novel rule-in tests -based in either the community or clinics. The symptomatic period prior to care seeking would be amenable to intervention by symptom-based screening approaches and potentially health-messaging or improved access to care. Understanding the duration of these phases and their contributions to transmission and mortality is therefore key to understanding the relative potential benefits of more active approaches to finding cases [27]. Although, between 25% and 50% of the total duration (and likely a higher proportion of transmission and pre-treatment mortality) occurs once those with TB have started seeking care, when improvements in passive case-finding and reductions in pretreatment loss-to-follow-up (which has been observed a 13% in Asia and 18% in African settings) can help [28]. While there is considerable consistency between settings in our results, the reasons for the longer durations of asymptomatic disease in Asian settings are not clear, and seem not to directly relate to HIV prevalence. These may include cultural factors around recognition of disease or willingness to report it, differences in strains, or differences in cough associated with smoking or air quality which may mask TB symptoms. It is noteworthy that these settings also have longer total durations of TB disease.
Our method also generates estimates of TB incidence: using literature estimates of TB disease mortality and self-cure rates, we inferred the proportions that fail to reach notification. Our incidence estimates are similar to those of WHO, which is not surprising since these are based on the same data. Our estimates have lower uncertainty, which may reflect under-estimation of uncertainty by our approach in not including uncertainty in TB mortality rates.
Our approach has quantified aspects of TB natural history that are potentially important but poorly characterised, relying on historical or anecdotal data. For example, mathematical models of TB transmission have from the outset often included progression in smear status [29]. In analysis of cross-sectional data, this smear conversion rate is confounded with the proportion of individuals who are smear-positive from very early in their disease. Our posterior shows high correlation between these parameters, but the gradient of . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 initial smear-positivity with respect to conversion rate is broadly consistent between settings. Our pooled estimate should be useful for modellers wishing to parametrize joint uncertainty around these features.
There has been an increased realisation that TB disease is less dichotomous and more dynamic than conventional paradigms have allowed [8]. Our findings complement work in this area by describing the dynamics through asymptomatic disease and different levels of infectiousness while symptomatic. (We have used the term asymptomatic TB as synonymous with the term sub-clinical TB used in some discussions.) The substantial fraction of time spent asymptomatic means there is the potential for this phase to make important contributions to TB transmission; the key unknown factor being the relative infectiousness of the asymptomatic phase [9]. While some evidence suggests that the natural history of TB disease may include progression and remission [8], we have assumed a one-way progression through the asymptomatic phase, smear status, and care-seeking -we were limited here by our reliance on cross-sectional data. However our transitions between these cruder states could be considered as characterising the net average transitions in a population experiencing more complex dynamics among sub-states. Similarly, there may be heterogeneity in disease-course, which we were not able to consider.
HIV-associated TB has long been known to have a more rapid TB disease progression, but data to quantify this has been limited, and most models and estimates of TB burden assume the same case detection ratio in PLHIV as among HIV-negative people. We were able to provide the first empirical estimate of this quantity. Our sensitivity analysis considered from zero excess TB/HIV mortality through to an extremely high level, and consistently found TB case-detection ratios for PLHIV that were higher than for HIV-negative people. We were not able to stratify by ART status, but increases in ART coverage and regular healthcare visits for people on ART may have helped improve TB case detection among PLHIV. We found an asymptomatic TB disease duration shorter than five months in people living with HIV, and the delays from symptom onset to case-detection also much shorter than those for HIV-negative people.
Comparing our model estimates of the delays from TB onset to notification with empirical estimates from P:N ratios, we find that empirical estimates systematically overestimate duration. This is because the empirical estimator implicitly assumes that all prevalent TB ends in notification, whereas in the model (and reality) self-cure and death are possible outcomes. These unobserved competing events add to the rate at which individuals cease being prevalent, thus shortening the prevalent duration. The discrepancy between empirical and model estimates is largest where the CDR is lowest. Under-notification, e.g. in settings with large private sectors can further overestimate durations.
The duration of symptomatic TB is necessarily shorter than the total duration. Empirical estimates of TB duration based on TB prevalence surveys that used symptoms as an entry point to bacteriological testing may therefore be biased. Where symptoms alone are used to rule-in, the empirical P:N ratio is measuring the duration of the symptomatic phase rather than the total duration of TB disease. The range of symptoms used can also affect estimated durations, with broader definitions shortening the duration of asymptomatic phase. All the surveys we used either applied bacteriological testing to everyone, or used chest X-ray (potentially OR symptoms) to rule-in for bacteriological testing.
A limitation of our approach is that it relies on self-report for defining whether individuals have symptoms or have begun seeking care. Stigma, fear of diagnosis, or expectations of treatment may all affect and influence participants' willingness to report symptoms, even if they are recognised; recognition of symptoms may itself be influenced by cultural and epidemiological factors [30,31]. Recall bias may play a role in limiting the accuracy of reports of care-seeking. We were limited in our ability to detect effects that may be real, such as age-dependence in durations, and to investigate smear-status while asymptomatic, due to the relatively small . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. numbers of TB cases found in prevalence surveys. Our projections of growth or decline rely solely on trends in notification data; where notification trends reflect changes in routine case finding performance, our projections of underlying incidence and prevalence trends may be incorrect. Finally, while most of our parameters have incorporated prior uncertainty, being treated as random variables in a Bayesian framework, we treated TB mortality and self-cure as fixed, which may under-estimate uncertainty in our estimates.
A strength of our approach is that we have adopted a rigorous Bayesian framework, which includes uncertainty in inputs and outputs and could easily be generalised for use on other national or subnational data. Unlike most approaches to P:N analyses, we do not make an assumption of equilibrium. Our assumption is the slightly more general and realistic constant rate of decline, which is an additional output.

Conclusion
Active approaches to TB screening and case-finding should be considered in high TB burden settings, where up to 4 to 12 months with infectious bacteriologically-positive disease is spent without symptoms, and a comparable time again with symptoms before care-seeking initiation.  is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021 29. Dye C, Garnett GP, Sleeman K, Williams BG. Prospects for worldwide tuberculosis control under the . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021 (5):1114-6.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. ; https://doi.org/10.1101/2021.03.17.21253823 doi: medRxiv preprint White boxes are all bacteriologically-positive TB accessible to prevalence surveys, with structure matched to available data. Models are additionally stratified by sex, and for Kenya and Blantyre, Malawi by age and HIV in addition.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. ; https://doi.org/10.1101/2021.03.17.21253823 doi: medRxiv preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. ; https://doi.org/10.1101/2021.03.17.21253823 doi: medRxiv preprint : symptomatic TB without care-seeking attempts; : symptomatic TB started care-seeking. (B) steady declining condition, which has identical percentage annual decline in prevalence, incidence rate and notification rate for all states: : overall active TB prevalence; is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. ; https://doi.org/10.1101/2021.03.17.21253823 doi: medRxiv preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021   is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021  . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021  . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021  . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021  The values are the fraction of the incident cohort reaching each stage. The second bars may be missing if the TB prevalence survey did not report results on care-seeking behaviour.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021  The values are the fraction of those beginning the previous stage who reach the labelled stage. The second bars may be missing if the TB prevalence survey did not report along with the results of care-seeking behaviour. The treatment complete here were sourced from the WHO TB outcome data, including treatment completion and treatment success.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021  . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. ;https://doi.org/10.1101/2021 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. ;https://doi.org/10.1101/2021 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. ;https://doi.org/10.1101/2021 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 *** Case-detection ratio is defined as the ratio of notified cases and the estimated incident cases in percentage . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted April 4, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021  is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.