Abstract
In response to the outbreak of SARS-CoV-2 many governments decided in 2020 to impose lockdowns. Although the package of measures which constitute such lockdowns differs between countries, it is a general rule that contacts between people, and especially in large groups of people, are avoided or prohibited. The main reasoning behind these measures is preventing that healthcare systems become overloaded. As of 2021 vaccines against SARS-CoV-2 are available, but these do not guarantee 100% risk reduction and it will take a while for the world to reach a sufficient immune status. This raises the question whether and under which conditions events like theater shows, conferences, professional sports events, concerts and festivals can be organized. The current paper presents a COVID-19 Risk taxation method for (large scale) events. This method can be applied to events to define an alternative package of measures replacing generic social distancing.
1. Introduction
Most of the available literature estimates the reproduction number R0 of Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is between 2-4 (1, 2). For Western Europe the R0 is estimated on 2.2 (3) and the Dutch National Institute for Public Health and the Environment (RIVM) estimated a range between 2 and 3 (4). The probability of an infected person infecting another person peaks during the first week of illness, after which this probability drops significantly (5). Based on data from Wuhan the incubation time was estimated on average days and minimal 4 days (6). Later studies based on more data indicated that the average incubation time was between 5.2 and 6.65 days and can be up to 14 days (7). The probability that an individual gets infected can be estimated based on the R0, the duration that contagious people can infect others and the prevalence. When R0 is 2.2, the duration is 7 days and the prevalence is 0,77% (this is the highest prevalence during the test events) then the probability that a person infects another person is 1.04 10−4 per hour (based on a population of the Netherlands of 17 million people).
Social distancing, to reduce the number of contacts, is the main measure to reduce the transmission of SARS-CoV-2 (8). To support social distancing, governments all over the world took measures, resulting in various lockdowns to reduce the number of contacts between people and the amount of people that can gather in a group.
As a consequence, large scale events were generally prohibited. At the same time, Dutch event-organizers affirmed that organizing events on the basis of social distancing would be economically detrimental. The question there is if an alternative package of measures can be defined instead of generic social distancing at these events, while requiring that the average individual risk of becoming infected at an event is equal to the infection risk for staying at home. In this study we distinguish between four types of events which can be representative for almost all events:
Type I: Indoor, passive (theater show or conference),
Type II: Indoor, active (concert or dance events),
Type III: Outdoor, active (public sports events),
Type IV: Outdoor, active festival (festivals).
An alternative for physical or biological models is a data driven analysis. Based on available data of infections and contacts among people in this period a model can be developed which relates the risk of infection to the number of contacts and other measures which can be implemented. Because governments directly implemented numerous measures to support social distancing and avoid large groups, the normal social contact data (9) do not apply because the behavior of people had been forced to change.
2. Methodology SARS-CoV-2 Risk taxation Model (SCRTM)
2.1 Methodology
In this study a SARS-CoV-2 risk taxation method was developed based on National Institute for Public Health and the Environment (RIVM) and Municipal health services (GGD) collected data sets during the SARS-CoV-2 pandemic. The method resulted in the average risk of infection (Ri) per hour, whereby Ri is a function of the prevalence (P) and the number of contacts per hour at a given location. We distinguish two contact-classes: up to 1,5m for droplets (C1) and up to 10m for aerosols (C2). The risk can further be reduced by:
Testing prior to an event which results in a reduction of P with factor FT.
Additional measures can be taken to improve ventilation for indoor events and reduce the exposure to aerosols by FY,x. x is the level of ventilation for C2. For x= 0 locations are ventilated as standard, for x= 1 indoor locations are very well ventilated and when x= 2 locations are outdoor.
Smart logistics at an event to reduce contacts and gathering of large groups of people. Therefore, specific data were collected at test events.
Personal protection measures such as mouth-nose masks. This reduces the risk with factor FM.
The impact of vaccination, which reduces the risk of transmission with a factor FV. These measures resulted in an average individual risk of infection per hour, given by:
The parameters A1 and A2 are transmission-coefficients for exposure to droplets and aerosols. These transmission-coefficients were based on data-analyses. The expected number of infections at an event can be defined by s and is given by the product of Ri, the duration of the event t in hours and the number of people at the event (N):
2.2 Transmission coefficients A1 and A2
The number of positive SARS-CoV-2 tests in the Netherlands are reported by the RIVM on a weekly base, and include the probable locations where people got infected. Examples of these locations are at home, at work, at home while having visitors (friends or family), leisure, but also schools, catering industry and elderly houses and events or parties. The measures taken by the government strongly influence the locations people frequently visit and their number of contacts.
The transmission coefficients A1 and A2 have been estimated based on national (RIVM) and regional (GGD-Amsterdam) data over the period 15 September – 13 October 2020 and between 14 October and 15 December 2020 using linear regression via Least-squares Minimization. On 13 October additional measures were implemented by the Dutch Government, leading to the final lockdown on 15 December 2020.
One of the first estimates was the risk per location (see table 1). As a consequence of the stay-at-home advice, most people, in absolute numbers, became infected at home. When corrected for the number of people present and duration of contact, the risk of getting infected at these locations can be standardized and the risk estimates be compared with other locations. The hours while sleeping (8 hours) are excluded.
Data used for the least-squares regression of A1 and A2 based on RIVM and the Survey.
In the second step, we estimated the number of contacts between people at a location. To do so, we combined data from a specifically developed questionnaire with statistical data (10). We focused on the locations at home, at work, visitors at home and leisure, for the other locations the amount of available data was not sufficient. In the questionnaire we asked people to estimate:
The time spent at a certain location.
The number of people in a range of 10m.
The number of contacts in a distance < 0,5 m, between 0,5 and 1,5 and between 1,5 and 2,0 m for less than a minute, between 1 and 15 minutes and more than 15 minutes.
The proportion of the time which was indoor, indoor and well-ventilated or outdoor at the location.
In the third step, we performed the data-analyses using linear regression via Least-squares Minimization on function (1) to determine transmission factor A1 and A2. The data gathered from step 1 and 2 was used as a training set (see table 2). FT and FV are assumed to be 0 because pre-testing and vaccines were not available. For each class of contacts, we assumed an average duration (30 seconds for the class < 1 minute, 8 minutes for contacts within 1 and 15 minutes and 30 minutes for contacts more than 15 minutes. For all indoor locations we assumed standard ventilated locations Fy,O = 0, for well ventilated indoor Fy,1 = 0.5, for outdoor events Fy,2 = 1. This resulted in a probability per hour that a person is infected given a contagious person in a distance of 1.5m (A1 = 6.376 10−3 per hour) or 10m (A2 = 0.986 10−3 per hour).
Data used for the least-squares regression of A1 and A2 based on RIVM and survey data on the number of contacts/hour within a certain contact category, per setting.
The complete role of virus-laden droplet and aerosol transmission is poorly understood (11). Table 3 shows sensitivity analyses showing that the impact of other choices in the distances of 1,5m and 10m or for Fy is limited (given a prevalence of 0.5%). For the reference situation C1(for a radius of 1.5m) is 5 contacts per hour for the low contact event and 12.5 for the high contact event. C2 (with a radius of 10m) is 10 contacts for the low contact event and 30 for the high contact event. The sensitivity analysis shows that the impact on the results is within a bandwidth of a factor 0.5.
Sensitivity analyses for ventilation and distance for contact classes. FV, FT and FM are all 0
3 Validation of the model
The model determines an average risk for infection. The data concerning SARS-CoV-2 infections gathered at the test events can be used to validate the model. The model outcome is based on a skewed probability distribution. For example, consider a large-scale event where 1.000 people will join the event given a prevalence of 0.75%. Without pre-testing 7.5 contagious persons would have attended the event, when FT = 0.95 on average 0.38 person would have been contagious at the event and would thus be a source. Also the number of contacts per individual person will vary or some will have many close contacts and others will see only a few people. Therefore, it is expected that many events will be organized with no or limited number of infections, and a few with many infections.
A model validation would need a large dataset during the SARS-CoV-2 pandemic. This large dataset is expected to cover the skewed probability distribution including events with no infections and events with many infections. Such a database, however, is not available. Data from media and literature of superspreader events are biased as these always attract more attention. For loss of life modelling for natural hazards the limited availability of data also leads to difficulties for conducting model validation. For example, the loss of life models for river and storm surge flooding in The Netherlands is based on the 1953 flood and Katrina in the US (12, 13). However, despite the limited validation, the model is still used to define the safety standards for Dutch levees which implies an investment program of multiple billions of euros (14).
Although a perfect validation is not possible, the available data can be used for a first validation of the model.
2.3.1 Internal validation: reproduction of infections at different settings
First, the performance of the model can be checked by the reproduction of infections for the settings at work, visitors and leisure time. Given the value for A1 and A2, the prevalence and contacts C1 and C2 (data for at home is not gathered in the survey) and ventilation, the number of infections is estimated (see figure 1). The infections “at work” are overestimated in the model, while infections in the setting of “at home with visitors” are underestimated. An explanation can be that people received more visitors than they admitted to in the survey or than allowed within the prevailing COVID-19-rules (two visitors per day until mid-October, and one per day afterwards). Overall, we concluded that the outcomes support the results of the model.
Number of infections based on internal validation by reproduction of infections at work, visitor at home and leisure.
2.3.2. External validation: test-events
The test-events organized by Fieldlab phase 1 can be used for a second validation. The test events occurred while the prevalence was high and a lockdown was still in place. These test events have been organized to measure C1 en C2 given the implemented measures. Very short, “passing” contacts of less than 10 seconds were not taken into account because these “passing contacts” are assumed not to be significant with regard to transmission. At the test events, the generic measures for social distancing were not in place. Instead, so called “Fieldlab measures” were taken, such as the separation of groups of people in bubbles. During the test events, several packages consisting of variation in occupation rate, catering, routing, instructions on when to wear masks and indoor/outdoor situations have been tested.
Persons with COVID-19 (like) symptoms were banned from participation. All visitors and crew needed a negative PCR test taken within 48 hours before the event in order to attend. As the PCR test may pick up low viral loads such as in cases of persons who recently recovered from COVID-19, the ratio of positive tests is higher than the ratio of asymptomatic people only.
All Fieldlab participants and crew were asked to get tested on day five after the event, a request that was followed by more than 80%. In addition, all positive cases related to a Fieldlab event, identified by the regional Health Care Services (GGD) were included in the data set.
Infections identified after an event, included cases infected just before or after the pre-test or had a PCR test around the cut-off of the PCR, thereby varying in outcome. People can be infected at the event, but also at other locations.
The input for the model is defined as:
While in the post-event test results, the crew is also taken into account, the calculated risk for infections applies to visitors only.
FT = 0,95, was set based on expert judgment and the sensitivity of a PCR test (15), combined with the restriction of testing within 48 hours before the event.
Ventilation of all indoor places is according to the building codes, Fy,O = 0, all locations have been checked in advance. For all outdoor events we took into consideration that festival tents, and roofs at a stadium are not completely outdoor, we assumed Fy,2 = 0.9.
The effectivity of masks was estimated by medical experts. While the effectivity under in-vitro conditions can be high (16), a low estimate was used for their effectiveness during in-vivo events, in order to consider the impact of how masks are used. During events, masks were not used while eating or drinking, not while being stationary, were inadequately used, ill-fitting or not used at all. Therefore, we set FM = 0,05 when the mask is only used when people are seated, and FM = 0.1 when the mask is also used while people move. During type 2 and 4 events, FM was set at 0, as masks compliance was extremely low to non-existent.
In Table 4 the results of the test events have been summarized per type of event and compared to the model results. The detailed results of all events and measures are described elsewhere (17).
Comparison model results with realization at test events
The risk at these locations or settings can, given the actual or expected prevalence be used to define a threshold to decide about the permission for events and the required measures. In figure 2, such an example is given. The figure shows the expected number of infections per 100.000 people per hour as a function of the prevalence and possible measures. Given a threshold of, for example 1 or 2 acceptable infections per hour per 100.000, the model can be used to design the events in such a way that the risk is acceptable.
Risk of infections as a function of the prevalence for different packages of measures, related to a possible reference.
Depending on the package of measures (as testing, maximum occupation rate, ventilation) the risk at an event without social distancing can be reduced to a risk which is equal or less while staying at home during the lockdown in the Netherlands. A second comparison can be made between the risk at an event and having visitors at home. The risk of visitors at home is about four times higher than the risk at home.
4. Discussion
The number (and risk of) infections at the event are estimated with the model based on the prevalence and measures at the test-events. The model results can be compared with the confirmed infections after the events.
As the number of events was limited related to the expected probability distribution, the data analysis would have profited from a higher number of events. Still, the data can be used to check if the model is plausible. We believe that we included nearly all infected people at the event because we combined the information of regular testing procedures and the after-event tests.
In the model, we assumed an average prevalence for the Netherlands. Our assumptions were actually corroborated by the pre-event test results, which were in the range of the nationally reported prevalence of SARS-CoV-2. However, it might be that the prevalence among the visitors was higher than the assumed average. A first argument is that more young people attended the test events, and these age groups contribute relatively more than elderly (as > 60 years old) to the positive PCR tests (18). A second argument is that the test events were during the lockdown, and the non risk-averse people which attend these events also might have more other activities. Because of these other activities it can be expected that the source of infections for some of the cases identified after the event may be unrelated to the event.
At day 5 after the event, visitors and crew were tested, but our model exclusively calculates the risk for visitors, as contact data of the crew were not measured. During the events, the crew attempted to keep their distance from the participants, and wore masks continuously.
In general, we found more PCR-positive people in the pre-event than in the post-event testing. As the PCR test may pick-up low viral loads, some of the people testing positive, especially in the pre-tests group, may have been recovered from COVID-19, consequently resulting in a higher positive test ratio. This in part explains the positivity rate in the pre- and post-event testing. However, in the after test, also people with COVID-19 symptoms were included. Only for (one of) thetype 4 events, the ratio of positive tests in the after test was higher than in the pre-test. This corresponds with higher numbers of infections at the event.
In the post-event tests, 14 cases have been identified where persons were possibly infected at the events. Other positive tests were excluded during interviews as participants had known contacts with SARS-CoV-2 positive cases at their home or unrelated events around the same time. Even the 14 cases are only “possible” cases, as infections could have occurred in other places and situations at any time from around the pre-test, the day of the event, or even the after the event. If we use the Dutch average infection risk during the event as a control group we can estimate the number of infections which could be expected outside the event. The time spent at the event is about 4-10% of the time a period of, for example, three days where people were at different locations exposed to the risk of infection. If we also consider the risk at other locations, one or two of the possible infections could be (on average) related to the events.
During the eight test-events, four persons were confirmed to be infected during the event. Confirmed infections are those infections which can be related to each other for example by sequencing or because of proven contacts with other positive tested people during the event. Two infections occurred while travelling home with a contagious person, which caused two infections at the type IV event. These are not infected at the event (and part of the model) but these are related to the event.
A last remark about the model can be made with regard to the data which is used to estimate A1 and A2. The contacts used to train the model were gathered during a period of a (partial) lockdown. Large scale events were already prohibited or regulated. This could cause an underestimation of the risk in dynamic settings.
Based on the available but scarce information, the current risk taxation model results in plausible results, the model cannot be rejected as being invalid. The test events show that the Fieldlab measures at the events, which replace generic social distancing, reduce the risk to a level below the threshold level (which was the risk at home). The model can be extended with the risk for loss of life and hospitalization using the relation with the age of people. However, if more data is available, the model can be improved because of the larger training set. If more data of especially type IV and maybe type II events are available, research can be done on an additional factor for increased transmission at dynamic events. Therefore, it is recommended to collect more data about infections and events while the COVID-19 pandemic continues.
Data Availability
All data produced in the present work are contained in the manuscript