## ABSTRACT

COVID-19 has circled the globe, rapidly expanding from an outbreak in China to a pandemic in a matter of weeks. Lacking an effective vaccine or proven, pharmaceutical therapy, countries have enacted an unprecedented series of non-pharmaceutical interventions to combat this disease. However, the potential for variation in free-living virus survival to modulate the efficacy of these interventions remains unclear. Using an empirically determined understanding of SARS-CoV-2 natural history and detailed, country-level case data, we elucidate how variation in free-living virus survival influences key features of COVID-19 epidemics. Our findings suggest that COVID-19’s basic reproductive number and other key signatures of outbreak intensity are modulated by transmission between infected individuals and the environment. In addition, country-level outbreaks vary in the degree to which environmental transmission appears to modulate COVID-19 dynamics. Summarizing, we propose that informed models of “sit and wait,” environmental transmission are essential in emerging outbreaks, as they highlight how variation in environmental transmission may explain observed differences in disease dynamics from setting to setting, and can inform public health interventions.

## INTRODUCTION

Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), the etiological agent of coronavirus disease 2019 (COVID-19), has caused one of the most devasting pandemics of the last century. The complex set of epidemiological characteristics defining COVID-19 outbreaks presents a number of challenges for controlling this disease. As a consequence, countries have achieved varying levels of success in reducing transmission and protecting vulnerable populations, often with dramatic variation from setting to setting in the epidemic growth rate, intensity, and/or severity. Developing a mechanistic understanding of how SARS-CoV-2 is transmitted in different settings is thus essential for guiding ongoing and future non-pharmaceutical interventions.

The basic reproductive number (1–3), fatality rate (4–5), incubation period (4, 6–8), transmission interval (9), prevalence of super-spreading events (10–11) and other relevant aspects of COVID-19 epidemiology that provide a mechanistic window into how SARS-CoV-2 is transmitted in different settings. However, one feature of SARS-CoV-2 transmission that was validated in laboratory settings (12), but whose epidemiological role remains uncertain, is SARS-CoV-2 free-living survival. Specifically, while several laboratory and epidemiological findings have suggested that environmental transmission may play a role in some settings (11, 13-14), none have fully investigated how this route of transmission may influence outbreaks.

In this study, we identify key laboratory and epidemiologically validated parameters associated with SARS-CoV-2 environmental transmission. We then integrate these findings into a mechanistic transmission model of SARS-CoV-2 to evaluate the potential for variability in environmentally-mediated transmission to explain variability in COVID-19 outbreak intensity. This framework includes parameters corresponding to the transmission of the virus from both presymptomatic/asymptomatic and clinical (symptomatic) carriers of SARS-CoV-2, and the possibility that susceptible hosts can acquire infection through environmental reservoirs. We examine how outbreak dynamics are influenced by differences in SARS-CoV-2 free-living survival that include values corresponding to empirical values for survival on various abiotic surfaces (e.g. plastic, copper, steel, cardboard) (12).

Using confirmed case data from seventeen countries from around the world with large outbreaks, we find evidence that the role of environmental SARS-CoV-2 transmission on COVID-19 epidemics can vary from setting to setting. Our findings highlight the need to incorporate the potential for environmental transmission into COVID-19 non-pharmaceutical intervention plans and forecasting models. Specifically, we propose that environmental transmission—including the particulars of how free-living virus survives on different abiotic surfaces—should be a greater focus in emerging infectious disease outbreaks, as undertreatment of this route can obfuscate essential properties of how the disease is spreading and potential avenues for intervention.

### A *Waterborne*, *Abiotic*, and other *Indirectly Transmitted* (WAIT) model of SARS-CoV-2 transmission

Several models have been engineered to explore aspects of COVID-19 dynamics. For example, models have been used to investigate the role of social distancing (2, 15), social mixing (16), the importance of undocumented infections (17), the role of mobility in the early spread of disease in China (18), and the potential for contact tracing as a solution (19). Only a few notable models of SARS-CoV-2 transmission incorporate features of indirect or environmental transmission (13-14, 19), and none consider the dynamical properties of viral free-living survival in the environment. Such a model structure would provide an avenue towards exploring how variation in free-living survival influences disease outbreaks. Environmental transmission models are aplenty in the literature and serve as a theoretical foundation for exploring similar concepts in SARS-CoV-2 transmission (20–29).

Here, we parameterize and validate an SEIR-W model: Susceptible (*S*), Exposed (*E*), Infectious (*I*), Recovered (R), and WAIT (*W*) model. Here *W* represents the environmental component of the transmission cycle during the early stage of the SARS CoV-2 pandemic. This model is derived from a framework previously developed called the “WAIT” modeling framework—which stands for *Waterborne, Abiotic,* and other *Indirectly Transmitted*—that incorporates an environmental reservoir where a pathogen can sit and *wait* for hosts to interact with it (30–31).

## METHODS

### Building the SEIR-W model framework for SARS-CoV-2

Here *W* represents the environmental component of the early stage of the SARS CoV-2 pandemic (Figure 1). In the context of SARS CoV-2, this environmental compartment refers to surfaces that people may have contact with on a daily basis, such as doorknobs, tables, chairs, or mail packages. The *W* compartment of our model represents the fraction of these environmental reservoirs that house some sufficiently transmissible amount of infectious virus. We emphasize that the *W* compartment is meant to only represent surfaces that are common sites for interaction with people. Thus, inclusion of the *W* compartment allows us to investigate the degree to which the environment is infectious at any given point, and its impact on the transmission dynamics of SARS CoV-2. We will sometimes use the term “environment” and “surfaces” or “objects” interchangeably.

Model parameters are described in detail in Table 1. The system of equations in the proposed mathematical model corresponding to these dynamics are defined in equations 1–6:

### Infection trajectories

Our model also deviates from the traditional SEIR form by splitting the infectious compartment into an *I _{A}*-compartment (

*A*for asymptomatic), and an

*I*-compartment (

_{S}*S*for symptomatic). The former represents an initial infectious stage (following the non-infectious, exposed stage), from which individuals will either move on to recovery directly (representing those individuals who experienced mild to no symptoms) or move on to the

*I*-compartment (representing those with a more severe response). Finally, individuals in the

_{S}*I*-compartment will either move on to recovery or death due to the infection. This splitting of the traditional infectious compartment is motivated by mounting evidence of asymptomatic transmission of SARS CoV-2 (17, 33-36). Thus, we consider two trajectories for the course of the disease, similar to those employed by (15): (1)

_{S}*E*→

*I*(

_{A}→*R*and (2)

*E →*(

*I*→ (

_{A}*I*(

_{S}→*R*(

*or death*). More precisely, once in the

*E*state, an individual will transition to the infectious state

*I*, at a per-person rate of ε. A proportion

_{A}*p*will move from

*I*to the recovered state

_{A}*R*(at a rate of

*p*ω). A proportion (

*1 - p*) of individuals in the

*I*state will develop more severe systems and transition to

_{A}*I*(at a rate of (

_{S}*1 - p*) w). Individuals in the

*I*state recover at a per-person rate of

_{S}*v*or die at a per-person rate

*μ*. In each state, normal mortality of the individual occurs at the per-person rate

_{S}*μ*and newly susceptible (

*S*) individuals enter the population at a rate

*μN.*The important differences between these two trajectories are in how likely an individual is to move down one path or another, how infectious individuals are (both for people and for the environment), how long individuals spend in each trajectory, and how likely death is along each trajectory.

### Interactions between the environment and people

The model couples the environment and people in two ways: (1) people can deposit the infectious virus to surfaces (infecting the environmental reservoirs), and (2) people can become infected by interacting with these surfaces (infecting the people). Surfaces infect people through the *β _{w}* term (equations 1 and 2), a proxy for a standard transmission coefficient, corresponding specifically to the probability of successful infectious transmission from the environment to a susceptible individual (the full rate term being

*β*Hence, the

_{w}W·S).*β*factor is defined as the fraction of people who interact with the environment daily, per fraction of the environment, times the probability of transmitting infection from surfaces to people. The factor

_{w}*β*(where

_{w}*W*is the fraction of surfaces infected) represents the daily fraction of people that will interact with the infected portion of the environment and become infected themselves. The full term

*β*is thus the total number of infections caused by the environment per day.

_{w}W·SIn an analogous manner, we model the spread of infection *to* the environment with the two terms *σ _{A} I_{A}-(1 - W) / N* and

*σ*representing deposition of infection to the environment by asymptomatic individuals, in the former, and symptomatic individuals, in the latter. In this case,

_{S}I_{S}-(1 - W) / N*σ*(and analogously for

_{A}*σ*) gives the fraction of surfaces/objects that interact with people at least once per day, times the probability that a person (depending on whether they are in the

_{S}*I*or the

_{A}*I*compartment) will deposit an infectious viral load to the surface/object. Thus,

_{s}*σ*I

_{A}_{A}/ N and

*σ*(where

_{s}I_{s}/ N*N*is the total population of people) represent the daily fraction of the environment that interacts with asymptomatic and symptomatic individuals, respectively. Lastly, the additional factor of

*(1 - W)*gives the fraction of surfaces/objects in the environment that have the potential for becoming infected, and so

*σ*(and analogously for

_{A}I_{A}-(1 - W) /N*I*gives the fraction of the environment that becomes infected by people each day. We use

_{S})*W*to represent a fraction of the environment, although we could have just as well multiplied the

*W*equation by some value representing the total number of surfaces/objects in the environment (expected to remain constant throughout the course of the epidemic, assuming no intervention strategies).

### Parameter values estimation

Table 1 displays information on the population definitions and initial values in the model. Tables 2 and 3 contain the fixed and estimated values and their sources (respectively). Because this model iteration is relatively underexplored with regards to COVID-19, we have worked to justify its use in various ways. The model’s estimated parameters are based on model fits to 17 countries with the highest cumulative COVID-19 cases (of the 181 total countries affected) as of 03/30/2020, who have endured outbreaks that had developed for at least 30 days following the first day with ≥10 cumulative infected cases within each country (37) (See Supplemental Tables S1 – S3). In addition, we compare country fits of the SEIR-W model to fits with a standard SEIR model. Lastly, we compare how various iterations of these mathematical models compare to one another with regards to the general model dynamics. For additional details, see the Supplemental Information.

### Estimation of fixed parameters

There are 6 fixed parameters, 6 fitted parameters, and one parameter (*ω*) dependent on the values of one of the fixed parameters (*η*) and one of the fitted parameters (**ε**). These fixed parameters are *η*, *μ*, *μ*_{S}, *v*, *k,* & *p.* The first, *η*, is the incubation period) (6, 38), and we assume that the expected time in the *E* state (1/**ε**) and the expected time in the *I _{A}* state (1/ω) sums to

*η*, i.e.

*η*= 1/

**ε**+ 1/ω. Fixing

*η*constrains one of the two parameters,

**ε**or ω, and the other can be fitted; we choose to fit

**ε**and therefore constrain ω. The second fixed parameter

*μ*, the normal death rate, was calculated by taking the reciprocal of the average life expectancy (in days) of the 17 countries sampled, weighted by population size. We calculated a value of 80.3 years, based on data from individual countries (39). The third parameter

*μ*

_{s}is the sum of the normal death rate and an additional death rate due to a more severe form of the infection. We assumed a death rate of 3.8% (38) and that death follows after initial symptoms between 3 and 4 weeks (38). Thus

*μ*

_{s}=

*μ*+ 0.038/(3.5

^{*}7), where we use the average of 3 and 4 weeks and we convert to days with the factor of 7. The fourth fixed parameter

*v*, the recovery rate once in the symptomatic state, was assumed to be the reciprocal of the average of 3 and 6 weeks (the range of recovery times) (4, 38) times the fraction of individuals in the symptomatic state that do not die, i.e. 1 - 0.038, so

*v*= (1 - 0.038)/(4.5

^{*}7). The fifth fixed parameter

*k,*the rate of viral decay in the environment, is the reciprocal of the average time that SARS-CoV-2 is expected to survive in the environment across a set of physical surfaces, based on the survival times across a handful of materials (19). The sixth fixed parameter

*p*, the fraction of individuals in the

*I*state that move on to recovery without experiencing severe symptoms, was taken to be 0.956 (2).

_{A}We fit our model variations to the *daily new cases data* provided (See: Supplemental Information) starting on the day when there were ≥10 cumulative infected cases in that region. We choose the starting point of 10 cumulative cases in order to allow the outbreak to settle into a more consistent doubling time while also providing enough of an early-on window to capture the dynamics relevant to the and force of infection estimations.

We calculate the number of daily new infections in our model by numerically integrating the influx rate of new symptomatic infections over the course of a single day (i.e. ∫ *(1 - p) ω I _{A}* dt). We perform this calculation for each of 30 consecutive days and fit these values to the daily new cases data. We use the influx rate of symptomatic infections, as opposed to the total rate of new infection (including asymptomatic individuals), as we expect that the large majority of reported cases in the early COVID-19 outbreak to be symptomatic. And we expect that—in the outbreaks—almost all asymptomatic cases go unreported (17).

### Initial conditions

For each country, we use the first cumulative count that is ≥10 as a proxy for the initial number of *active* symptomatic cases *I _{s0}.* We can justify this by proposing that, given that the doubling time is expected to fall between 3 and 6 days (40) then the exponential growth rate parameter of the infection (

*r in exp*(

*rt*)) would fall between 0.231 days

^{-1}and 0.116 days

^{-1}respectively. And, assuming 1 initial infected individual, the time to reach 10 cases for the former rate would be about 10.0 days (~log(10)/0.231) and the time for the latter would be 19.8 days (~log(10)/0.116). Thus, since recovery of symptomatic individuals, which takes between 3 and 6 weeks (36, 38) exceeds this interval, we expect that at the point when 10 cases have accumulated, all cases are still active. Lastly, in fitting the data to our model, we initialize all fitting parameters to a value of 1.5, in whatever units are appropriate for that parameter (expected to be close to the true value for most of the fitting parameters). As an estimate for the initial number active

*asymptomatic*cases, we take

*I*= I

_{A0}_{S0}. That is, we expect that there are approximately as many asymptomatic cases as symptomatic cases early on. This assumption appears to be consistent with empirical findings. For example, data from the Diamond Princess cruise liner (11, 41), where all passengers were tested, revealed that approximately half of positively-testing cases were asymptomatic. Lastly, we assumed that the initial number of exposed individuals was approximately , based on the supposition that each of the initially infectious individuals

*(I*will have exposed the infection to approximately other individuals. We take the value of in this case to be 2.5, based on prior studies (15, 17). The

_{A0}+ I_{S0})*R*population is assumed to be 0 in the early stage of the outbreak, given the (average) 3 to 6-week recovery delay of COVID-19. The

*W*population is assumed to be 1%.

### Basic reproductive ratios

We can express the (eq. 7) in a form that makes explicit the contributions from the environment and from person-to-person interactions. In this way, the full is observed to comprise two sub-components: one the number of secondary infections caused by a single infected person through person-to-person contact alone (*R _{p}*) and the other is the number of secondary infections caused by exchanging infection with the environment (

*R*). where

_{e}*R*and

_{p}*R*are defined in equations 8a and 8b.

_{e}Note that when *R _{p}* = 0, the reduces to

*R*and when

_{e}*R*0, the reduces to

_{e}=*R*. Thus, when person-to-person transmission is set to zero, the consists only of terms associated with transmission from the environment, and when transmission from the environment is set to zero, the consists only of infection directly between people. When both routes of transmission are turned on, the two -components combine in the manner in equation 7.

_{p}While *R _{e}* represents the component of the formula associated with infection from the environment, the square of this quantity represents the expected number of

*people*who become infected in the two-step infection process: people → environment → people, representing the flow of infection from people to the environment, and then from the environment to people. Thus, while

*R*gives the expected number of people infected by a single infected person when the environmental transmission is turned off, gives the expected number of

_{p}*people*infected by a single infected person by way of the environmental route exclusively (no direct person-to-person transmission).

Elaboration on formulas 8a-b—and associated derivation-discussions—appear in the Supplemental Information.

## RESULTS

### Evidence for environmental transmission in different countries

Using the Akaike information criterion (AIC), SEIR models with an environmental compartment (SEIR-W) provide a strong relative fit to country incidence data. As discussed in the **Methods**, we compared the performance of models with (SEIR-W) and without (SEIR) environmental transmission across multiple countries to assess the role of environmental transmission in different contexts. Using the fitted parameters provided in Tables 1–3, and S1-S3, we calculate AIC values for the two mechanistic models: the standard SEIR model and the SEIR-W model. Table S4 displays the summary of the AIC values for each model-type fit to the first 30 days after the first day with total counts ≥10. In 10/17 countries (including 9/11 European countries), the SEIR-W model provided a better fit to the country data. In Figure 2, we display the comparative individual country fit results for 4 of the countries with the fastest 30-day case growth rates—Spain, Italy, Iran, and Switzerland. The SEIR-W variant provides a better fit (significantly lower AIC score) than the standard SEIR model for all of these. Note that, as features of independent country epidemics are myriad and difficult to disentangle, several aspects independent of the model structure could explain the superior fit of the SEIR-W models. Results for additional country fits can be found in the Supplemental Information, Figure S1.

### Environmental transmission modulates COVID-19 epidemiology

Partial Rank Correlation Coefficient (PRCC) analyses for the four examined features of the outbreak—(i) , (ii) total number of infected individuals after 30 days, (iii) time to peak number of infected individuals, and (iii) size of peak number of infected individuals. Figure 3 demonstrates the PRCC calculations for all four of these outbreak characteristics. For , we observe that the model was strongly sensitive to several aspects related to virus transmission— *β _{A},* β

_{S}, β

_{w}—as well as the rate at which asymptomatic individuals develop symptoms (ω), the rate of recovery (

*v*) and SARS-CoV-2 free-living survival rate (

*k*).

One can also observe how some parameters are better suited to modify the peak of the infection, such as the recovery rate (*v*; which includes in it the swiftness of diagnosing and treating the virus). Others modulate the timing of the peak, such as *ε*, the rate of leaving the “exposed” compartment (or equally well, the reciprocal of the average time spent in the exposed compartment). Note that across all features, the fraction of cases that move along the “mild” route (p), from *E→I _{A}→R*, has a powerful influence on all factors.

### The SARS-CoV-2 comprises person-to-person and environmental transmission

In the **Methods**, we described how the is composed of two sub- components, corresponding to different infectious interactions: person to person (*R _{p}*), person to environment and environment to person (

*R*). Tornado plots were constructed that demonstrate how the -components have their own architecture and sensitivity (Figure 4).

_{e}In Figure 5, we observe how variation in free-living survival (1/*k*) influences four characteristics of a SARS-CoV-2 outbreak: , total number of infected individuals after 30 days, time to peak number of infected and symptomatic individuals, and maximum number of symptomatic individuals in the first 30 days. Note the annotations on the figure that highlight where the empirically-determined survival times of SARS-CoV-2 on a range of surfaces (copper, plastic, cardboard, stainless steel) (12). Also note that the quantitative relationships between 1/*k* and various outbreak features are slightly different. For example, the increases more gradually across a wider range of free-living survival values than some of the other features (Figure 5).

### Surface composition modulates outbreak dynamics

Figure 6 (simulations) depict the results of “surface world” simulations, where the *k* values correspond to those from a 2020 study highlighting the survival of SARS-CoV-1 and SARS-CoV-2 on different physical surfaces (12). The summary of these simulations (Figure 6) highlights that the surface composition of a setting has a meaningful impact on several features of outbreak dynamics. “Copper world” (Figure 5a) takes the longest amount of time (88.4 days) to rise to the peak number of infected-symptomatic individuals, indicating an outbreak which is slower to develop. Relatedly, the values are much different in the different “surface world” scenarios: The “copper world” simulation has an of 2.4, while the plastic “surface world” simulation has an of 3.18 (Figure 7 and Table S5). In addition, the total number of individuals infected after 30 days of the outbreak, and the total number dead after 30 days are both significantly lower in the “copper world” setting (Figure 7 and Table S5). The peak value of infected individuals is not dramatically different across “surface worlds.” That is, while many features associated with severity differ greatly across “surface world” settings, we observed significantly less variation in the peak of the epidemic as compared with the time to the peak of the epidemic (Table 4). Maybe the most noteworthy of the differences is the vast disparity in the number of deaths in the first 30 days of the outbreak, where the “plastic world” setting has more than 30 times the number of deaths as the “copper world” scenario (1,814 vs. 55, respectively; Figure 7 and Table S5).

## DISCUSSION

### In any emerging infectious disease, the degree of environmental transmission should be the focus of early inquiry

In this study, we introduce a model for SARS-CoV-2 transmission that rigorously incorporates environmental transmission, labeled SEIR-W. We demonstrate that the SEIR-W model is often superior to SEIR models with regards to fitting the empirical data for the early trajectory of outbreaks across the world. That the SEIR-W model fits certain country data relative to others may be the consequence of many characteristics of an epidemic (e.g. quality of data, testing capacity), none of which represents anything meaningful about the mechanism of an outbreak. However, given that features of environmental transmission can influence central properties of disease dynamics, we should consider the possibility that variation in environmental transmission may contribute to variation in aspects of disease dynamics. The model is strongly sensitive to several aspects related to virus transmission, including the rate at which both symptomatic and asymptomatic individuals transmit infection to susceptible hosts, the rate at which asymptomatic individuals develop symptoms, the rate of recovery (*v*) and SARS-CoV-2_decay rate (*k*).

Deconstructing the basic reproductive number into subcomponent reveals the role of environmental transmission. By deconstructing the basic reproductive number into components, we can better understand how variation in the —by setting, time, or geography—may reside in how these contexts are driven by environmental transmission. Many of these effects may be (as they are in this study) localized to one component of the , labeled *R _{e}* in this study. Notably, the

*R*component is highly sensitive to the transmission interaction between people and the environment (

_{e}**β**

_{w}), and the decay rate of virus in the environment (

*k*). Interestingly, the

*R*is relatively robust to the rate of infectious virus shed into the environment from the asymptomatic infected individuals (the parameter called

_{e}*σ*in this model).

_{A}### SARS-CoV-2 dynamics in different “surface world” settings resemble essentially different outbreaks

Analysis of the and its subcomponents highlights that many aspects of outbreak dynamics are sensitive to the parameter associated with environmental decay rate (*k* in the model presented in this study). Analysis of hypothetical settings purely comprising substances of a certain kind (“surface world”) fortifies the significance of free-living survival on physical surfaces and environmental transmission in outbreak dynamics. While our findings cannot speak to the outbreak dynamics in any particular setting in the real world, they do reveal that the surface composition of a setting can significantly influence the behavior of an outbreak. For example, The SARS-CoV-2 in the “plastic world” simulation is over 1.3 times the in the “copper world” simulation . Many other differences between these outbreaks come as a consequence of the different values. For example, the “plastic world” simulation reaches a peak number of symptomatic infectious individuals almost 1.7 times faster than the “copper world” simulation, and kills over 30 times more people in the first 30 days (1,814 deaths in “plastic world” vs. 55 deaths in the “plastic world”).

These differences are so significant that they might be naively interpreted as completely different outbreaks early on. Note, however, that the maximum value of the infected-symptomatic populations are roughly equivalent across “surface worlds,” and so the influence of SARS-CoV-2 survival on physical surfaces (mediated by difference in free-living survival) doesn’t affect all aspects of outbreak dynamics equally.

As of April 15, 2020, the scientific community remains in the fact-finding phase of SARS-CoV-2 biology and COVID-19 understanding. A significant source of fear and speculation in the pandemic involves the plausibility that SARS-CoV-2 has undergone local adaptation in certain settings, translating to different epidemiological properties. While there is no currently convincing molecular or clinical support for local adaptation in SARS-CoV-2, our findings highlight how easy it is to conflate an environmental (or ecological) difference for a genetic one: the same virus, spreading in populations of identical size and behavior, differing only in the composition of physical surfaces where the virus can be transmitted through the environment, can have values between 2.4 and 3.18, with early death rates 30 times apart.

These disparities may underlie the difficulty in predicting outbreaks of a single type from setting to setting, a property that has recently been captured by a concept called permutation entropy (43). Even more, settings composed of certain physical surfaces (plastic-like in our model) may be associated with phenomenon resembling a “superspreading” event, where individual variation in contagiousness can drive unusually large numbers of infections (44). Perhaps a better understanding of how, and on what surfaces, viral populations survive may one day improve the predictability of outbreak trajectories.

### Public Health Implications

Our findings suggest that the effect of social distancing in ameliorating pandemics can be amplified by limiting interactions between susceptible individuals and surfaces, rather than solely interactions between individuals. Said differently, it is not enough that individuals remain separated from other individuals during a pandemic like COVID-19, but also that individuals remain protected from surfaces where other infectious individuals may have interacted. Highlighting the importance of measures like mask wearing in stemming COVID-19 outbreaks.

## Data Availability

Data are either available or the source is referenced in the main text and supplemental information. The code used for the analyses in this study is publicly available on Github: https://github.com/OgPlexus/Copperland

## DATA AND CODE AVAILABILITY

Data are either available or the source is referenced in the main text and supplemental information. The code used for the analyses in this study is publicly available on Github: https://github.com/OgPlexus/Copperland

## ACKNOWLEDGMENTS

The authors would like to thank W. Turner, L. Smolin, S. Ramachandran, and D. Weinreich for seminar invitations where iterations of the ideas in this manuscript were discussed. ALM would like to acknowledge NIH U54GM115677 for funding support.