## Abstract

Testing for SARS-CoV-2 in the United States is currently targeted to individuals whose symptoms and/or jobs place them at a high presumed risk of infection. An open question is, what is the share of infections that are undetected under current testing guidelines? To answer this question, we turn to COVID-19 testing data from Iceland. The criteria for testing within the Icelandic medical system, processed by the National University Hospital of Iceland (NUHI), have also been targeted at high-risk individuals, but additionally most Icelanders qualify for voluntary testing through the biopharmaceutical company deCODE genetics. We use results from Iceland’s two testing programs to estimate the share of infections that are undetected under standard (NUHI) testing guidelines. Because of complications in the deCODE testing regime, it is not possible to estimate a single value for this this undetected rate; however, a range can be estimated. Our primary estimates for the fraction of infections that are undetected range from 88.7% to 93.6%.

## COVID-19 Testing in Iceland

Iceland has conducted one of the most extensive testing regimes in the world on a per-capita basis. The first SARS-CoV-2 test was administered on January 31, and the first case was confirmed on February 27. Almost 21,000 Icelanders—about 6 percent of the island nation—have been tested for SARS-CoV-2 as of April 2.

Guðbjartsson et al (2020) describes the testing programs in Iceland in detail. In brief, Iceland has two ongoing testing programs. The first is through the regular healthcare system, which targets individuals exhibiting severe symptoms (cough, fever, muscle ache, and shortness of breath) and/or at high risk of infection because of close contact with a diagnosed individual or having returned from “high-risk areas” defined by the Directorate of Health. On March 19, the Directorate of Health broadened the criteria to include anyone returning from abroad and any symptomatic individual.^{1} Samples are taken at local clinics or the National University Hospital of Iceland (NUHI) and analyzed by the hospital’s microbiology department.

The second is a free voluntary program, with testing done by the biopharmaceutical company deCODE Genetics. The deCODE tests, which began on March 13, are open to individuals who have no or few symptoms, are not under quarantine, and have not been tested through NUHI. Individuals are placed under quarantine if they have been in close contact with someone diagnosed with COVID-19 or have been traveling, initially in high-risk areas—mostly ski areas in Italy and Austria—and later anywhere abroad. Quarantines last 14 days or until the individual develop symptoms and are tested positive for SARS-CoV-2 by NUHI.^{2} At its peak, about 10,000 Icelanders were under quarantine.

The presence of these two testing programs makes it possible to draw inferences about the rate of infections in Iceland that are undetected under the NUHI testing guidelines. Because the NUHI testing guidelines are similar to those recommended by the Centers for Disease Control and Prevention (CDC) in the United States,^{3} an estimate of the rate of undetected infections can inform epidemiological modeling in the United States. Estimates from the Icelandic population complement other estimates of the rate of undetected cases among the infected (Li et. al. (2020), Mizumoto et. al. (2020), Nishiura et. al. (2020), Russell et al. (2020); see Qui (2020)).

## Methods

We are interested in the fraction of the infected who are not eligible for NUHI testing (the “undetected rate”). Expressed as a probability, this rate is Pr(*NE* = 0<*I* = 1), where *NE* = 1 if the individual is eligible for NUHI testing (NUHI-eligible) and 0 otherwise (NUHI-ineligible), and *I* = 1 if the individual is infected with SARS-CoV-2 and 0 otherwise. Denote this rate by *θ*. Using Bayes Law, the undetected rate is related to the rates of infection among the NUHI-eligible and the NUHI-ineligible:
where *f* ^{NE} = Pr (*I* = 1|*NE* = 1) is the positive testing probability among the NUHI-eligible and *f* ^{NI} = Pr (*I* = 1|*NE* = 0) is the positive-testing probability among the NUHI-ineligible. Note that the denominator in (1) is the total infection rate.

If the (a) fraction of the population that were NUHI-eligible on a given day were known and if (b) deCODE testing was a random sample of the NUHI-ineligible, then the first expression in (1) could be evaluated directly using data on the NUHI positive testing rate, Pr(*I* = 1|*NE* = 1), and the rate of infection among the NUHI-ineligible, Pr(*I* = 1|*NE* = 0), could be estimated from the deCODE testing data. Neither condition (a) nor (b) hold, however, and we address these issues in turn.

The source of problem (a) is that, while the number of NUHI tests on a given day is known, the number of individuals who would qualify for a NUHI test is not. If all those who qualify for a NUHI test are referred to NUHI for testing, then they will get tested at some point, but on any given day only some of those eligible are tested.

Our first approach to addressing the NUHI stock/flow issue (a) is to estimate the probability of NUHI-eligibility, Pr (*NE* = 1), as the fraction of the Icelandic population tested during an 8-day window, where the 8 days aligns with the data window reported in Guðbjartsson et al (2020). We refer to this as the “eligibility window” method.

Our second approach to addressing problem (a), which we refer to as the “odds ratio” method, is to use the laws of probability and algebraic manipulation to eliminate the probability of NUHI-eligibility from the expressions above. Doing so, however, introduces a new parameter, *λ*_{0} = Pr (*NE* = 1|*I* = 0), which is the baseline rate of cough/cold/flu symptoms that are severe enough to qualify for a NUHI test, even though the individual turns out not to be infected. This baseline symptomatic rate can be estimated using historical data on flu cases in Iceland. It is shown in the appendix that the Bayes Law expression (1) and its counterpart for the NUHI-eligible imply,

Equation (2) express the undetected rate, *θ*, in terms of *f*^{NE}, *f*^{NI}, and *λ*_{0}, so if those three terms can be estimated, then *θ* can be estimated using (2).

The term *f*^{NE} can estimated directly from NUHI testing data, and the term *λ*_{0} can be estimated by the proportion of Icelanders who see a doctor about flu-like symptoms, see Appendix Section B for more details.

Both the eligibility window method (Equation (1) and the odds ratio method (Equation (2)) require an estimate of *f* ^{NI}, the fraction of infections among the NUHI-ineligible. If the deCODE sample were a random sample of the NUHI-ineligible population, this would be estimated by the NUHI positive testing rate. However, the deCODE sample has two complications: it was voluntary, and those in quarantine were excluded. The next two sections discuss these issues turn, and show how the deCODE positive testing rate can be used to bound *f*^{NI}that accounts for the complications in the deCODE testing sample. For the eligibility window method, we thereby estimate a lower bound (a conservative estimate of *θ*), and for the odds ratio method we are able to estimate both a lower and upper bound.

### Selection into deCODE testing

Because deCODE testing was voluntary during the time period analyzed here, it could overrepresent those who suspect they have the virus but are NUHI-ineligible. Approximately 44% of individuals tested by deCODE exhibit cold/flu symptoms (Guðbjartsson et al, 2020, Table 1), much higher than the historical prevalence of cold/flu symptoms in Iceland. We will refer to these 44% as “mildly symptomatic.” As a result, the deCODE positive testing rate could overestimate the true fraction of infected among the NUHI-ineligible.

To address this problem, note that *f*^{NI}is a weighted average of the infection rates among the mildly symptomatic and the asymptomatic. Under the assumption that the asymptomatic and mildly symptomatic deCODE subsamples are randomly drawn from the asymptomatic and mildly symptomatic populations, respectively, then *f*^{NI} can be estimated as the weighted average of these two (known) deCODE positive testing rates. Although the rate of mild symptoms among the NUHI-ineligible is not observed, we bound it below by *λ*_{0} (the rate of severe flu symptoms in a normal March) and bound it above by the 44% of mild symptoms among the deCODE testing volunteers. See Appendix Section B for details.

### Exclusion of quarantined individuals

Individuals in quarantine were excluded from deCODE testing and while some of the quarantined qualified for NUHI testing, some did not. Infections among those quarantined but NUHI-ineligible are therefore not detected by either the NUHI or deCODE testing. Under the assumption that the infection rate among the NUHI-ineligible quarantined does not exceed the infection rate of among NUHI-eligible, it is possible to use the laws of probability to bound the overall rate of infection among the NUHI-ineligible in terms of the infection rate among the NUHI-eligible and the deCODE rate (infections among the NUHI-ineligible who are not in quarantine). See Appendix Section B for details.

## Data

We use data released by Icelandic public health authorities on the COVID-19 pandemic and on normal flu seasons.

The Icelandic Directorate of Health releases daily counts of new confirmed cases and tests conducted on its COVID-19 information website.^{4} We use these data where possible. However, infection rates for individuals with no vs. mild symptoms are only available in the results by Guðbjartsson et al (2020), who report infection and test counts by symptom status for deCODE testing between March 13 and 19. We therefore focus on this time period when calculating parameters for the bounds. Only the most updated quarantine counts are available on the COVID-19 website, so we use daily announcements by the Icelandic Department of Civil Protection and Emergency Management to find past quarantine totals in the bound analysis.

To estimate the underlying prevalence of mild and severe flu-like symptoms, we use data from the Directorate of Health’s website on influenza containing weekly counts of individuals who come to clinics or emergency rooms with flu-like symptoms for every flu season from the 2009-10 through the 2017-18 flu season (the most recent season available).^{5}

Appendix Section B contains the details on the data and construction of estimates.

## Results

The results are summarized in Table 2. The primary estimate of the lower bound using the eligibility window method uses the infection rate among the asymptomatic from the deCODE tests and assumes that none of the NUHI-ineligible quarantined are infected; this yields an undetected rate of 88.7% (95% confidence interval 83.9% to 93.5%). The second line re-estimates the lower bound using the overall deCODE positive testing rate, which includes the mildly symptomatic; because the mildly symptomatic have a somewhat higher deCODE positive testing rate than the asymptomatic, the estimate of the population infection rate is somewhat larger and the undetected fraction is somewhat larger, 92.5%. As discussed, the NUHI testing eligibility guidelines were broadened after March 19, and using data postdating this broadening results in slightly lower estimates of the undetected rate, approximately 85%.

For the odds ratio estimate, we estimate *λ*_{0} by the maximum weekly caseload for flu-like symptoms in the Icelandic medical system for all weeks from 2009 to 2017. This highest-caseload week occurred in November 2009 at the peak of the H1N1 pandemic and was 0.63%. The resulting estimated range is consistent with the estimates from the eligibility window method, with a range of 88.9% to 93.6%.

## Discussion

The fraction of infections that are undetected under standard current testing guidelines is differs from the asymptomatic rate, which is the fraction of infected who are asymptomatic. Because the NUHI-ineligible include those who are mildly symptomatic, it is not surprising that the estimated nondetected rate exceeds estimates of the asymptomatic rate in the literature (Mizumoto et. al. (2020), Nishiura et. al. (2020)). Our estimate of the undetected rate is comparable to Li et. al (2020)’s 86% estimate for Wuhan.

One threat to the validity of these estimates is that they assume that the deCODE testing of the asymptomatic is a reliable estimate of the rate of infection among the asymptomatic in the Icelandic population. Because deCODE test recipients did not meet the quarantine requirements (returning from high-risk countries or close contact with a confirmed COVID-19 individual), this assumption seems plausible; it cannot be evaluated with existing data, however. Data to provide unbiased estimates of the undetected rate, and the overall rate of infection in the population, need to come from random testing. Such trials are commencing in Iceland, Germany, and Norway. If a random testing design excludes those eligible for tests under medically-based guidelines and/or those in quarantine, the combined data sets could be analyzed using the methods laid out here.

## Data Availability

All data are publicly available. We use counts of tests and confirmed COVID-19 infections from the Icelandic Health Directorate's COVID-19 information website. We also use counts of cases reporting flu-like symptoms from the influenza website of the country's Health Directorate. We use quarantine numbers from the Department of Civil Defense's announcement page.

## Appendix

### A. Derivations

*Derivation of Equation* (2).

Let *P*(*NE* = 1<*I* = 0) = *λ*_{0} and *θ* = *P*(*NE* = 0<*I* = 1). By Bayes Law,
and

Taking the ratio of odds ratios, we obtain that which yields and by rearranging,

*Bounds for θ*.

The bounds for *θ* arise because of the overrepresentation of mildly symptomatic in the (voluntary) deCODE sample and the exclusion of quarantined individuals from the deCODE sample, some of whom are not NUHI-eligible. The available information, however, provides bounds on *f*^{NI}. We make the following assumptions:

*λ*0≤*P*(*C*=1|*Q*=0,*NE*=0)≤*P*(*C*=1|*deCODE volunteer*)*P*(*I*=1|*NE*=0,*Q*=1)≤*P*(*I*=1|*NE*=1)

where *C* = 1 indicates whether an individual has mild symptoms and *Q* = 1 indicates whether an individual is in quarantine. Note that (*Q* = 0, *NE* = 0) corresponds to deCODE eligibility.

By the law of total probability, and

Note that *p*^{C} ≡ *P*(*I* = 1<*NE* = 0, *Q* = 0, *C* = 1) and *p*^{A} ≡ *P*(*I* = 1<*NE* = 0, *Q* = 0, *C* = 0) are identified from Guðbjartsson et al. (2020) and are estimated by the deCODE positive testing rates for the symptomatic and asymptomatic, respectively.

We consider each term individually. Bayes Law and the fact that 0 ≤ *P*(*NE* = 0<*Q* = 1) ≤ 1 yields that
and
by the complement rule. Next, Assumption 1 generates
and

Combining these results with assumption 2, the implied bounds for *f*^{NI} are
and

Evaluating (2) at the extremes of the range for *f*^{NI} and taking the resulting extremes for *θ* yields the identified set for *θ*.

### B. Data Construction

See the text and Appendix A for definitions of the various terms.

Population: Statistics Iceland estimates that the population of Iceland was 364,134 on January 1, 2020.

^{6}*f*^{NE}: From the Directorate of Health’s counts, we calculate the total count of NUHI infected cases and divide by the total count of NUHI tests from March 13 through March 19, to match the deCODE testing dates.*λ*_{0}: We use Directorate of Health weekly influenza count data to estimate*λ*_{0}by the greatest one-week fraction of Icelanders reporting flu symptoms.^{7}This period occurred in November 2009 during the H1N1 pandemic and was 0.63%. To bound the proportion mildly symptomatic, we use a lower value of*λ*_{0}which is the average rate of reported flu symptoms in the two-week period around March 15 from 2011-2018, which is 0.17%.*P*(*I*<*C*= 1,*NE*= 0,*Q*= 0),*P*(*I*<*C*= 0,*NE*= 0,*Q*= 0): Under the assumption that deCODE randomly tests*within*the deCODE-eligible symptomatic and asymptomatic groups, then these terms are estimated by the deCODE positive testing rates within these two groups as reported in Table 1 of Guðbjartsson et al (2020) for deCODE testing from March 13 through March 19.*P*(*C*= 1<*NE*= 0,*Q*= 0): As a lower bound, we use the lower bound of*λ*_{0}of 0.17%. As an upper bound, we use the proportion symptomatic from Guðbjartsson et al (2020), 44%, which reflects the assumption that mildly symptomatic individuals will be oversampled in the voluntary deCODE testing.*P*(*Q*= 1): The COVID-19 website only reports the most updated count of individuals under quarantine, but we require the proportion not quarantined*P*(*Q*= 0) at the time of testing. We use the number of quarantined individuals on March 19, the last day of deCODE testing reported in Guðbjartsson et al (2020), from the report of the Department of Civil Defense on March 20.^{8}*P*(*NE*= 1) (for use in bounds calculation, odds ratio approach): We sum the number of NUHI tests from March 13 through March 19 from the Health Directorate count data to match the timing of the deCODE testing and the quarantine count figure in the bound analysis, then divide this sum by the population of Iceland.

## Footnotes

↵

^{1}https://www.landlaeknir.is/um-embaettid/greinar/grein/item39194/Skilgreind-ahaettusvaedi---Defined-high-risk-areas↵

^{2}Individuals with a positive test, whether quarantined or not at the time of testing, must enter a 10-day isolation with more stringent restrictions than those for quarantine. See https://www.landlaeknir.is/servlet/file/store93/item39065/GA.%20Instructions%20for%20the%20general%20public%20on%20home-based%20quarantine%2002.04.2020.pdf and https://www.landlaeknir.is/servlet/file/store93/item39066/KSJ%20Instructions%20for%20the%20general%20public%20on%20home-based%20isolation%2004042020.pdf.↵

^{3}https://www.cdc.gov/coronavirus/2019-nCoV/hcp/clinical-criteria.html↵

^{4}English: https://www.covid.is/data Icelandic: https://www.covid.is/tolulegar-upplysingar↵

^{5}Only in Icelandic: https://www.landlaeknir.is/servlet/file/store93/item32967/influensulik.einkenni.uppfaersla.a.vef.okt2017.AG.xls_%C3%A1n%20myndar%20og%20aldursdreifingu.xls↵

^{6}https://px.hagstofa.is/pxen/pxweb/en/Ibuar/Ibuar_mannfjoldi_1_yfirlit_yfirlit_mannfjolda/MAN00000.px/table/tableViewLayout1/?rxid=cdf68733-56cd-41c8-8801-d73ca7ded1cd↵

^{7}In Icelandic: https://www.landlaeknir.is/servlet/file/store93/item32967/influensulik.einkenni.uppfaersla.a.vef.okt2017.AG.xls_%C3%A1n%20myndar%20og%20aldursdreifingu.xls↵

^{8}In Icelandic: https://www.almannavarnir.is/utgefid-efni/stoduskyrsla-koronaveira-covid-19-20032020/?wpdmdl=24791