## Abstract

**Background** While pathogens often evolve towards reduced virulence, many counterexamples are evident. When faced with a new pathogen, such as SARS-CoV-2, it is highly desirable to be able to forecast the case fatality rate (CFR) into the future. Considerable effort has been invested towards the development of a mathematical framework for predicting virulence evolution. Although these approaches accurately recapitulate some complex outcomes, most rely on an assumed trade-off between mortality and infectivity. It is often impractical to empirically validate this constraint for human pathogens.

**Results** Using a compartment model with parameters tuning the degree to which symptomatic individuals are isolated and the duration of immunity, we reveal kinetic constraints where the variation of multiple parameters in concert leads to decreased virulence and increased pathogen fitness, whereas independent variation of the parameters decreases pathogen fitness. Smallpox, SARS-CoV-2, and Influenza are analyzed as diverse representatives of human respiratory viruses. We show that highly virulent viruses, such as Smallpox, are likely often constrained by host behavior, whereas moderately virulent viruses, such as SARS-CoV-2, appear to be typically constrained by the relationship between the duration of immunity and CFR.

**Conclusions** The evolution of human respiratory epidemics appears to be often kinetically constrained and a reduction in virulence should not be assumed. Our findings imply that, without continued public health intervention, SARS-CoV-2 is likely to continue presenting a substantial disease burden. The existence of a parameter regime admitting endemic equilibrium suggests that herd immunity is unachievable. However, we demonstrate that even partial isolation of symptomatic individuals can have a major effect not only by reducing the number of fatalities in the short term but also by potentially changing the evolutionary trajectory of the virus towards reduced virulence.

## Background

The rates of morbidity, mortality, and infection determine whether a pathogen is tolerated by its host and, in turn, the survival of the pathogen itself. Constraints imposed by the host behavior and pathogen biology prevent independent variation of these rates. Therefore, trends in virulence evolution can be predicted only through understanding these constraints. Although comprehensive models of the evolution of virulence capable of describing complex environments have been developed[1-5], most studies to date impose constraints assumed from first principles and lacking experimental or empirical validation[6,7]. Most commonly, a trade-off function is assumed[8,9] between the rate at which the pathogen is transmitted between hosts and the case fatality rate (CFR).

Hosts with high pathogen loads are more likely to die than those with lower loads, but they also shed the pathogen at increased rates and therefore could transmit it to a greater number of new hosts. However, this straightforward picture is complicated by a landscape of opposite evolutionary outcomes. For example, leaky vaccination has been suggested to increase virulence in malaria [10,11] and decrease virulence for diphtheria and pertussis [12], likely due to differences in the cost of toxin production [13]. This demonstrates how predictions of specific outcomes useful for informing public health intervention cannot be generalized across pathogens. The infectivity-virulence trade-off has been demonstrated for malaria[11], but not for most human pathogens and is extremely hard to validate empirically due to the impracticality of comprehensive contact tracing. Thus, models that avoid the use of this constraint as a parameter have the potential to produce useful observations and predictions.

Towards this goal, we explore a range of epidemiological outcomes for human pathogens modelled using available data on well-characterized respiratory viruses. The apparent inverse relationship between the time during which the host is asymptomatic but infectious and the mortality rate[14-22], as well as the difficulty of vaccination against lower mortality viruses, such as Influenza, relative to higher mortality viruses, such as Smallpox, imply the existence of intrinsic constraints that could link the duration of immunity and mortality. We demonstrate that, in addition to trade-offs, virulence evolution is often subject to “kinetic constraints” that prevent virulence reduction by imposing a barrier in the fitness landscape, analogous to an energetically favorable chemical reaction with a high activation energy. Virulence evolution would be thermodynamically constrained if it were impossible to both increase pathogen fitness and decrease CFR. In contrast, virulence evolution is kinetically constrained when there exists a parameter regime where pathogen fitness is higher and CFR is lower, but accessing this regime requires simultaneously modifying multiple parameters, some of which might be determined by host behavior. In other words, there is a path to decreased CFR on the fitness landscape, but it is narrow. We show that, for high CFR viruses such as Smallpox, the relationship between transmission rate and mortality is likely a kinetic constraint, rather than an actual trade-off, whereas for intermediate CFR viruses, such as SARS-CoV-2, there is a kinetic constraint between immunity and mortality. Analysis of such constraints could open avenues for prediction of epidemic outcomes and quantitative validation of such predictions.

## Methods

The introduction of a novel pathogen into a host population is likely to result in one of three outcomes. 1) The number of fatalities is large enough for the pathogen to wipe out the host population (as a result, the pathogen itself also goes extinct) (Fig. 1A). 2) The number of infections is large enough, whereas the number of fatalities is small enough, such that the pathogen creates a bottleneck in the susceptible host population. With all hosts either infected or immune, the pathogen is eliminated from the host population (Fig. 1B). 3) The number of infections is small enough such that a bottleneck in the susceptible population is avoided and long-term co-evolution with the host is possible if the number of infections is not too small and the basic reproduction number is greater than one, *R*_{0}>1 (Fig. 1C). With *R*_{0}>1, a state of stable endemic equilibrium can be reached where the fraction of the host population susceptible to infection remains constant (up to fluctuations). However, if the pathogen-associated fatality exceeds the birth-rate of the host population, the host-pathogen relationship becomes unsustainable in the long term. Such an unsustainable relationship would pose an intense selective pressure on the host population, likely resulting in the extinction of the pathogen through the modification of host behavior or the emergence of resistant hosts.

Which of these courses an epidemic follows, largely depends on the balance of four factors. 1) The frequency of host-host interaction, with or without isolation of infected individuals or prophylactic quarantine (Fig. 1D). 2) The infectivity of the pathogen, that is, the likelihood of an uninfected host to become infected after interacting with an infected host (Fig. 1E). 3) The virulence of the pathogen, that is, the likelihood that an infected host will experience symptoms or die (Fig. 1F). 4) The duration of host immunity post infection (Fig. 1G). Although the effect of tuning each of these parameters often appears obvious - for example, decreasing the frequency of host-host interaction almost always decreases the number of infections - many counterintuitive observations become apparent. For example, decreasing the frequency of host-host interaction under conditions that would otherwise lead to a bottleneck in the susceptible population (Fig. 1B) can result in stable co-evolution with the host which, avoiding pathogen extinction, ultimately increases the total number of infections sustained over time. In an effort to better delineate how the host-pathogen relationship varies across the space of these factors, we constructed the following model.

Hosts are assigned one of the four possible states: 1) immune *I*, 2) susceptible *S*, 3) asymptomatic *A*, and 4) symptomatic or “clinical” *C*. New hosts are assumed to be born susceptible at a rate *k*_{B} and a baseline death rate *k*_{D} is assumed constant across all compartments. Susceptible hosts can be infected by coming into contact with either asymptomatic or clinical hosts. Asymptomatic hosts either recover at rate *k*_{R} or progress to the clinical compartment at rate *k*_{P}. Clinical hosts either recover at rate *k*_{R} or die due to the pathogen at rate *k*_{DV}. Recovery confers immunity which is then lost at rate *k*_{L}. At endemic equilibrium, the parameter 0 ≤ α ≤ 1 can be used to represent immunity such that *k*_{L} = (1 − α)*k*_{R}(*A* + *C*)/*I*. At the beginning of the epidemic, α ≈ 1. The population is well mixed, with the exception of the clinical compartment, a fraction (1 − *β*) of which is isolated and cannot infect susceptible hosts. The rate at which susceptible hosts become infected is the triple product of the rate of contact between hosts, the fraction of the population (not isolated) which is infected, and the probability of infection upon contact:. For simplicity, we consider the product, *k*_{I} ≡ *k*_{contact}*P*(*infect*), which depends on both host behavior and pathogen biology. This yields the system of ordinary differential equations (Fig. 2A):

With two infected states (asymptomatic and clinical) and isolation, ISAC is a simple model within the range of models [23-25] that have been developed in response to the SARS epidemic.

The basic reproduction number, *R*_{0}, for this model can be derived through the construction of next generation matrices[26] (see Appendix A in Additional File 1):

Short term dynamics are determined by the *R*_{0}, value. When *R*_{0}<1, the pathogen will go extinct. When *R*_{0}>1, a wide range of dynamics are possible, depending on the parameter regime. In most cases, a unique, stable endemic equilibrium exists [27,28]; however, if the susceptible population first reaches a bottleneck (Fig. 1B), the pathogen could go extinct. At endemic equilibrium, the fraction of the total population, *N*, in each compartment, *X*, is constant:. The case with the simplifying assumption of a constant total population is presented in Appendix B (Additional File 1). More generally, *n*^{′} = *k*_{B} − *k*_{D} − *ck*_{DV} which yields:

This system can be solved to yield a fourth order polynomial with respect to *c*_{*} (we used the MATLAB symbolic toolbox[29], see Appendix C in Additional File 1) which is cumbersome enough that a numerical solution appears preferable; however, endemic equilibrium requires a constant or growing population . For human populations and pathogens, it is reasonable to assume that the birth rate is much lower than the recovery rate, , yielding the limit: .

Thus, for an endemic equilibrium to exist, either the clinical compartment has to be very small or the death rate due to the virus has to be very low compared to the recovery rate (in the opposite limit, when the birth rate is high, parameter regimes exist where neither endemic equilibrium is reached nor does either the host population or the virus go extinct). This allows us to linearize the model with respect to either the size of the clinical compartment, for pathogens with high mortality, or the ratio of the death rate to the recovery rate, for low mortality pathogens. For the high mortality case (see Appendix D in Additional File 1), the linearized model yields a unique, stable analytic solution for endemic equilibrium, given a sufficiently large *α* corresponding to at least partial immunity whenever *R*_{0} > 1[27,28]. In this case, the additional constraint *k*_{P} > *k*_{R} + *k*_{DV}, which largely holds for the pathogens considered here, is applied for convenience. For the low mortality case (see Appendix E in Additional File 1), the stability of the solution depends on the parameters, and both stability and the general solution for the critical point are calculated numerically; however, analytic forms for endemic equilibria in the stricter limit are provided.

The parameters were fit for three respiratory pathogenic viruses: Smallpox, SARS-CoV-2, and Influenza representing a range of phenotypes (Table 1). Here, *k*_{B} and *k*_{D} are fixed whereas *k*_{I} is varied. Then, *k*_{R} and *k*_{P} are fit to an estimated disease course for a host which is asymptomatic and infectious for the time *t*_{P} = 1/*k*_{P}, and symptomatic and infectious for the time *t*_{R} = 1/*k*_{R} before recovering. For simplicity, death due to infection is assumed to occur only during the symptomatic and infectious phase: . Smallpox is modelled with a CFR of 30%, mean time to recovery 1 week, and no asymptomatic and infectious period.

SARS-CoV-2 is modelled at a CFR of 1%, 1 week recovery period, and 3 days asymptomatic and infectious. Influenza is modelled at a CFR of 0.05%, 1 week recovery period, and 3 days asymptomatic and infectious. Smallpox infection is assumed to confer permanent immunity. SARS-CoV-2 and Influenza infections are assumed to confer immunity for one year on average. A constant birthrate of 2.5 births per 2 people over 100 years and death rate of one death per person over 100 years is assumed.

## Results

Endemic equilibrium is bounded within a range of (host) contact rates (Fig. 2B). When the mean time between contacts is too long and *k*_{I} is too low, , *R*_{0} <1, the pathogen goes extinct, and disease-free equilibrium is reached. Likewise, when contacts are too frequent, a bottleneck in the susceptible population occurs (Fig. 1B, here assumed to drop down to 10% of the total population), and the pathogen goes extinct. For some viruses, such as Smallpox and SARS-CoV-2, but not Influenza, a range of contact rates exists where the fraction of the infected population at endemic equilibrium is large enough so that the death rate exceeds the birth rate and the host-virus relationship is unsustainable in the long term, resulting in population decline without decreased pathogen virulence or modified host behavior. This range is very narrow for Smallpox but notably broad for SARS-CoV-2 (Fig. 2B, not shown to scale) encompassing a wider range of host behavior than endemic equilibrium. The existence of this region in the parameter space implies that herd immunity might be impossible to reach. In the middle-range of contact rates admitting endemic equilibrium (Fig. 1C), the decreased fraction of the infected population for all three examined viruses is offset by increasing CFR leading to an increased death rate. Under these model assumptions, the yearly death rate for SARS-CoV-2 is approximately 6 times that of Influenza.

To examine an expanded two-dimensional phase space, we allowed the CFR to vary from 10% to 100% for Smallpox-like viruses, with no asymptomatic spread and permanent immunity, and from 0% to 10%, for SARS-CoV-2-like and Influenza-like viruses, with asymptomatic spread and temporary immunity (Fig. 3). As the CFR increases, both threshold contact rates, corresponding to *R*_{0}*=*1 and to the bottleneck in the susceptible population, increase and the range admitting endemic equilibrium narrows. Across much of the phase space, the host-pathogen relationship is unsustainable, and at very high CFR, the total host population falls below 10% (dark red) in the initial phase of the epidemic, signaling possible extinction at short timescales. The contours within the region corresponding to the endemic equilibrium indicate the total size of the infected population and are thus proportional to the size of the viral population and virus fitness. Within this region, increasing contact rate and decreasing CFR increases the size of the infected population. At extremely high CFR, the gradient points primarily in the direction of decreasing CFR, and at low contact rates, the gradient points primarily in the direction of increasing contact rate.

Throughout most of the region in this phase space corresponding to endemic equilibrium for Smallpox-like viruses (Figure 3), moving Southwest increases the size of the infected population and suggests evolution towards decreased virulence. However, this is not the case at the Northeast corner representing viruses with extremely high CFR and extremely high contact rates (or infectivity). Such, hypothetical, viruses would have an unsustainable relationship with the host population if virulence decreased (and infected hosts were less likely to die before interacting with uninfected hosts) and are, in a sense, kinetically constrained, likely, by the host behavior. The population size of such viruses would dramatically increase if the CFR was reduced but would remain in endemic equilibrium only if contact rates or infectivity simultaneously decreased. Smallpox evolution could be similarly kinetically constrained. On the phase diagram for high CFR viruses, Smallpox, with a CFR of 30%, is located near the triple point for host populations with high contact rates where the regions of disease free equilibrium, endemic equilibrium and unsustainability meet (highlighted in Figure 3). To increase the size of the infected population, both CFR and contact rate must decrease, whereas decreasing only the CFR ultimately results in disease-free equilibrium when such a virus enters new host communities, due to a bottleneck in the size of the susceptible population.

The corresponding triple point located at the boundary between Influenza-like and SARS-CoV-2-like viruses lacks this property. For these viruses, decreasing CFR increases the size of the infected population throughout the parameter space. However, the boundary highlighted within the SARS-CoV-2 diagram (Fig. 3) represents a different kinetic constraint, this one, between immunity and mortality. While decreasing CFR below 10% minimally affects the threshold contact rates (*R*_{0}*=*1 and susceptibility bottleneck), varying the duration of immunity has a dramatic impact on the phase diagram (Fig. 4A). As the duration of immunity decreases, with α = 0.01 corresponding to approximately 1.5 years at the phase boundary, the endemic equilibrium region shrinks substantially.

As is apparent from the analytic solutions given in the Appendix (Additional File 1) and illustrated for a hypothetical virus with a CFR of zero (other parameters matching Influenza and Sars-Cov-2) and the maximum admitted contact rate (Fig. 4B), decreasing the duration of immunity dramatically increases the size of the infected population at endemic equilibrium across the parameter regimes. While typically viewed from a host-centric perspective, immunity is almost always necessary for the maintenance of endemic equilibrium and thus required for maintaining large viral populations over long time scales. Consider a SARS-CoV-2-like virus near the highlighted boundary in Figure 3. Suppose this virus acquires an adaptation enabling immunity evasion and thus decreasing the mean duration of immunity post infection. The size of the infected population will increase and, being near the boundary, the host-pathogen relationship will become unsustainable. This is another example of a kinetic constraint. Decreasing both CFR and the duration of immunity increases the size of the infected population, but maintaining a stable host-pathogen relationship and long term viral fitness requires that the reduction in CFR is proportional to the reduction in immune duration. Otherwise, the overall death rate for the host population might increase despite a reduction in pathogen virulence. Notably, decreasing the host-host contact rate moves the population farther from this boundary in the phase space and alleviates this kinetic constraint. Therefore, even if host-host contact rates cannot be reduced enough to break endemic equilibrium by reaching *R0*<1, a modest reduction can change the evolutionary trajectory of the pathogen from one of stagnant or increasing virulence to one of decreasing virulence.

One way a host population can effectively decrease the rate of infection is through the isolation of symptomatic individuals. Isolation decreases death rate at the peak of the epidemic (Fig. 4C) and can prevent an epidemic entirely by driving *R*_{0} below 1, which requires. For pathogens with substantial asymptomatic or presymptomatic spread (small *k*_{P}), this approach might not be feasible. Although isolation narrows the range of contact rates that admits endemic equilibria or unsustainability (Fig. 5A), the death rate at endemic equilibrium varies little with decreasing *β* and even increases in the case of SARS-CoV-2. Notably, however, SARS-CoV-2 is particularly sensitive to changes in *β* such that a modest decrease in *β*, that is, increased isolation, can change the long term outcome from endemic equilibrium to disease-free equilibrium. Furthermore, despite a 30-fold difference in CFR, at endemic equilibrium, Smallpox and SARS-CoV-2 have a comparable host mortality rate of approximately 0.1%/year, highlighting how pathogens with low or intermediate virulence can cause as many fatalities as highly virulent pathogens if allowed to reach endemic equilibrium.

## Discussion

We show here that virulence evolution in human pathogenic respiratory viruses can often be kinetically constrained. For Smallpox-like pathogens with high CFR in communities with frequent contact, a reduction in CFR can create a bottleneck in the size of the susceptible population, resulting in disease-free equilibrium (that is, extinction of the virus) when not accompanied by a decrease in the rate of infection. The rate of infection is determined by both the infectivity of the virus and the host-host contact rate. Under these conditions, if infectivity were internally constrained by CFR, this would not constitute a fitness trade-off and could facilitate host adaptation. On the other hand, high host-host contact rates could externally constrain the rate of infection making reduction in virulence costly to the virus. Evolution of the smallpox virus shows a steady pattern of gene losses that likely lead to increasing infectivity and virulence [30,31]. This evolutionary trend appears to be compatible with the conclusion that high CFR, Smallpox-like viruses are unlikely to evolve towards decreasing CFR due to constraints imposed by the host behavior.

For SARS-CoV-2-like pathogens with moderate CFR, evolution towards decreased virulence can be kinetically constrained by the relationship between CFR and the duration of immunity. Decreasing the duration of immunity increases the size of the infected population and the overall death rate which can make the host-pathogen relationship unsustainable. The existence of a large region of the phase space corresponding to unsustainable or kinetically constrained moderate CFR viruses implies two distinct forms of host response over two different timescales. Over long timescales, unsustainable viruses are likely to face extinction due to the elimination of the susceptible host population. Although, in principle, this could occur via extinction of the entire host population, the emergence of host resistance is likely. Over short timescales, especially for modern human populations, the emergence of an unsustainable virus, such SARS-CoV-2, may be considered societally unacceptable, leading to drastic measures that result in a major reduction in host-host contact rates. Both trends likely contribute to the paucity of moderate CFR human respiratory viruses which are subject to this kinetic constraint between immunity and virulence.

Perhaps paradoxically, immune evasion could incur a fitness cost for the pathogen and even lead to its extinction due to the host response. However, some level of immune evasion is required to maintain any state of endemic equilibrium in the case where lifelong immunity is conferred against individual strains and the duration of immunity is determined by antigenic drift rather than the decline of immunity itself. Evolution towards decreased virulence is uncertain in this case, and a better understanding of internal genomic constraints [7] could help predict the effects of immunomodulation. This is important when assessing the impact of novel or imperfect vaccination which can lead to counterintuitive results [32,33]. Diversification related to immune evasion commonly enables the maintenance of large virus populations over long time scales, as is the case for Influenza [34-36]. In the case of SARS-CoV-2, although immune evasion remains to be experimentally confirmed, diversification and host adaptation of SARS-CoV-2 have already been demonstrated [37-39]. Furthermore, products of virus genes that are specifically found in pathogenic beta-coronaviruses have been implicated in immunomodulation [40], suggesting that the virus adapts to maintain an endemic equilibrium in this way. Although the present model cannot predict whether SARS-CoV-2 will become more or less virulent, our results do suggest that its virulence evolution is kinetically constrained such that the region of the parameter space where reduced virulence could evolve is small. On the other hand, we show that even modestly decreasing the host-host contact rate can alleviate this kinetic constraint and promote virulence reduction.

During both the ongoing SARS-CoV-2 pandemic and the first Sars-Cov-1 epidemic, stringent public health measures were taken to limit transmission, extending beyond isolation of symptomatic individuals and into the quarantine of asymptomatic, and likely uninfected, contacts [17,25]. Although only a crude depiction of the nuanced dynamics underlying SARS-CoV-2 transmission, the analysis presented here suggests that isolation and quarantine are particularly effective towards changing the long-term outcome for viruses with moderate CFR and high infectivity, such as SARS-CoV-2. The phase diagram for SARS-CoV-2 is sensitive to the parameter *β* which reflects the isolation of symptomatic individuals. The evolution of the epidemic for such viruses is dominated by disease-free equilibrium or an unsustainable host-virus relationship. Endemic equilibrium is possible only in a narrow parameter range and is therefore unlikely. Nonetheless, the existence of this range suggests that herd immunity is unlikely and would amount to an extreme number of fatalities. Our analysis shows that, while highly amenable to public health intervention, without such efforts, SARS-CoV-2 can be expected to contribute to a substantially higher death toll than Influenza, comparable instead to that of Smallpox, for a protracted period.

## Conclusions

Human respiratory epidemics often evolve under kinetic constraints. These constraints can prevent the reduction in virulence for both high and moderate CFR viruses. The incorporation of these constraints can assist in the interpretation of classical model results for epidemics where some parameters, such as host-host contact rate, are unknown. We show that SARS-CoV-2 is unlikely to reach a state of endemic equilibrium; however, the potential for such equilibrium implies that herd immunity is likely unachievable. At equilibrium, moderate CFR viruses can cause as many fatalities as high CFR viruses with both SARS-CoV-2 and Smallpox leading to death of about 0.1% of the population per year. However, even partial isolation of symptomatic individuals can have a major effect not only by reducing the number of fatalities in the short term but also by potentially changing the evolutionary trajectory of the virus towards reduced virulence. Such simple public health interventions can dramatically decrease the forecasted cost of the virus over both the short and long term.

## Data Availability

NA

## Declarations

### Ethics approval and consent to participate: Not applicable. Consent for publication

Not applicable.

### Availability of data and materials

Not applicable.

### Competing interests

The authors declare that they have no competing interests.

### Funding

NDR, YIW, and EVK are supported by the Intramural Research Program of the National Institutes of Health (National Library of Medicine).

### Authors’ contributions

NDR, YIW, and EVK conceived of and designed the study; NDR implemented the mathematical model; NDR, YIW, and EVK analyzed the results; NDR and EVK wrote the manuscript that was read and approved by all authors.

## Acknowledgements

The authors thank Koonin group members for helpful discussions.