Abstract
Background The outbreak of COVID-19 caused by a novel Coronavirus (termed SARS-CoV-2) has spread to over 120 countries around the world. Currently, real-time qPCR (RT-qPCR) is used as the gold standard for diagnostics of SARS-CoV-2. However, the positive rate of RT-qPCR assay of pharyngeal swab samples is reported to be 30∼60%. More accurate and sensitive methods are urgently needed.
Method We established a digital PCR (dPCR) protocol to detect SARS-CoV-2 on 194 clinical pharyngeal swab samples, including 103 suspected patients, 75 close contacts and 16 supposed convalescents.
Results The limit of blanks (LoBs) of the dPCR assays are about 1.6, 1.6 and 0.8 copies/reaction for ORF 1ab, N and E gene. The limit of detection (LoD) is 2 copies/reaction. The overall accuracy of ddPCR is 95.5 %. For the fever suspected patients, the accuracy of SARS-CoV-2 detection was significantly improved from 28.2% to 87.4% by dPCR. For close contacts, the suspect rate was greatly decreased from 21% down to 1%. In addition, quantification of the virus load for convalescents by dPCR showed that a longer observation in the hospital is needed for aged patients. Conclusion: dPCR could be a confirmatory method for suspected patients diagnosed by RT-qPCR. Furthermore, dPCR is more sensitive and suitable for low virus load specimens from the both patients under isolation and those under observation who may not be exhibiting clinical symptoms.
1 Introduction
In late December 2019, a number cases of pneumonia infection were reported in Wuhan, Hubei Province, China. It was officially named as Coronavirus disease (COVID-19) by the World Health Organization (WHO) and has since spread to 129 countries around the world till March 14, 2020 (1, 2). The pathogen causing the outbreak of disease was identified as a novel Coronavirus (termed SARS-CoV-2), belonging to the family Coronaviridae, order Nidovirales, all of which are enveloped, non-segmented positive-sense RNA viruses (3, 4). According to the WHO and Chinese Center for Disease Control and Prevention (CDC), the current gold standard for the diagnosis of SARS-CoV-2 infection is based on the real-time fluorescent quantitative PCR (RT-qPCR). However, RT-qPCR is reported to have issues of low positive rates for throat swab samples (5) and there were 3% of patients who had negative RT-qPCR test results at initial presentation while chest CT checks indicated typical symptoms of viral pneumonia(6). In order to identify and hospitalize COVID-19 patients in time, more sensitive and accurate tests are required.
Digital PCR (dPCR) is a technology which partitions nucleic acid molecules across a large number of smaller reactions and acquires amplification data of each partition at end point based on the intensity of fluorescence (7-9). Quantification is performed by applying Poisson statistics to the proportion of the negative partitions to account for positive partitions that initially contained more than one target molecule. dPCR can offer greater precision than qPCR and is far simpler to use for copy number quantification due the binary nature in which the partitions are counted as positive or negative. Additionally, dPCR is more tolerant of PCR inhibition compared with qPCR due to partitioning and because it is an end-point PCR measurement and consequently less dependent on high PCR efficiency (10, 11).
In this study, we established one step RT-dPCR for detection of ORF1ab open reading frame 1ab (ORF1ab) and nucleocapsid protein (N) and E gene of SARS-CoV-2. Moreover, we compared RT-qPCR and RT-dPCR on 194 clinical samples and found RT-dPCR can significantly improve the sensitivity and accuracy of Coronavirus disease (COVID-19) diagnostics.
2 Materials and methods
2.1 Ethics statement
Data collection of cases and close contacts were determined by the National Health Commission of the People’s Republic of China to be part of a continuing public health outbreak investigation and were thus considered exempt from institutional review board approval. The analysis was performed on existing samples collected during standard diagnostic tests, posing no extra burden to patients.
2.2 Clinical samples
Respiratory samples were obtained during February and March 2020 from patients hospitalized or close contacts tested by Beijing CDC (BJCDC), Wuhan CDC (WHCDC) and a government designated clinical test laboratory (Wuhan considering laboratory for medical test, KXR). RNA was extracted from clinical specimens by using the MagMAX-96 viral RNA isolation kit (Thermo Fisher Scientific). The typed avian influenza virus RNAs A/H3N2 Virus and Influenza B/Victoria Virus was available at Wuhan CDC. RNA extracts containing human coronaviruses (HCoV)-229E and (HCoV)-OC43 provided by BJCDC were tested in all three assays, respectively.
2.3 One step reverse transcription dPCR
The primer and probe sequences for detecting N and ORF1ab gene target of the SARS-CoV-2 published by Chinese center for disease control and prevention (CDC) were used for this study(12). For detecting E gene target, primer and probe recommended by world health organization (WHO) was used(13). The 20 μL reaction mixture comprise 5 μL of One-Step RT-ddPCR Supermix, 2 μL of reverse transcriptase, 1 μL of 300 mM DTT, 1 μL of mixture of primers and probe and 11 μL of RNA template. Each reaction mix was converted to droplets using the QX200 Droplet Generator (Bio-Rad, USA), transferred to a 96-well plate, heat sealed and amplified in a GeneAmp System 9700 thermal cycler (Applied Biosystems, USA). The thermal cycling conditions were as follows: 45 °C for 10 min (reverse transcription); 95°C for 5 min; and 40 cycles of 95°C for 15 sec, and 58°C for 30 sec. The cycled plate was then transferred to the QX200 Droplet Reader (Bio-Rad, USA) and analyzed using the QuantaSoft droplet reader software (V1.7.4, Bio-Rad, USA).
2.4. Limit of blank (LoB) and detection (LoD) of dPCR
To establish the limit of blank (LoB) (14), 60 blank measurements were obtained from 3 blank mutant samples on three days. 70 to 76 measurements from 4 samples with low concentration (1 to3 cp/reaction) were used to determine the limit of detection (LoD) according to the CLSI guideline of EP17-A(15).
2.5 RT-qPCR
Three different commercial RT-qPCR kits (Huirui from Shanghai Huirui Biotechnology Co., Ltd, BioGerm from Shanghai BioGerm Medical Biotechnology and Daan from Daan Gene Co., Ltd) were used for the detection. Briefly, a 25-μL reaction containing 7.5 μL of PCR reaction buffer, 5 µL of primer and probe mixture and 5∼11 μL of RNA was prepared. Thermal cycling was performed at 50 °C for 15 min for reverse transcription, followed by 95°C for 5 min and then 45 cycles of 95 °C for 10 s, 55 °C for 45 s in ABI 7500 RT-PCR thermocycler.
3 Results
3.1 Dynamic Range of the dPCR assay
The linear range was investigated by varying the mean copy number per droplet, denoted as λ.(16) The precision or relative error of dPCR is related to λ because of dPCR relies on the Poisson statistics to account for droplets with multiple molecules.(17) The upper limit of the linearity was 7.8 copies/partition tested by N gene assay. To determine the lower limit of all three assays, serial dilutions of each RNA transcript were prepared (Table S-1). The measured targets matched the anticipated values in each tested interval. A good linearity (0.93<slope<1.02, R2 ≥ 0.9997) between the measured RNA target and the prepared value was observed over the range from approximately 104 to 100 copies/reaction (Fig. 1). Reactions containing a mean of 60 E, 66 N or 11 ORF1ab copies fulfilled the criterion for an LoQ with a CV lower than 25%.
3.2 Establishment of LoB and LoD for dPCR assay
Sixty blank measurements obtained from 6 blank samples on five days were analyzed to determine the LoB. As the distribution of the 60 blank measurements is skewed (Figure S-1), the LoB was estimated nonparametrically as the 95th percentile of the measurements. The 15 highest blank values for each target are displayed in Table S-2. The 95th percentile corresponds to the 57.5 ordered observation (=60*(0.95/60+0.5))(15). Linear interpolation between the 57th and 58th observation yields a LoB estimate of 1.6, 1.6, and 0.8 copies/reaction for E, ORF1ab and N, respectively.
For determining the LoD of ORF1ab gene assay, 76 measurements were performed on five samples in 3 different runs on three different days to ensure the total assay variation is reflected. The distribution of the 76 measurement results from low concentration samples is not Gaussian (Fig. S2A) and so that nonparametric statistics was used according to the guideline of EP17-A. Consequently, the LoD is determined to be 2 copies/reaction, the lowest level material where the β-percentile is 5 %.
To determine the LoD of N and E assay, 83 measurements of E assay on 5 samples and 71 measurements of N assay on 4 samples were performed in 4 different runs. Similar to ORF1ab gene, the distribution of the 71 measurements for N gene and 83 measurements are not Gaussian (Fig. S2B an S2C), and so that nonparametric statistics was used. Consequently, the LoD is determined to be 2 copies/ reaction.
3.3 Specificity testing
The Specificity of the assays for ORF1ab and E gene has been tested in a previous report. To further validate the specificity of all assays, Influenza virus were collected. All assays were tested on human clinical samples at Wuhan CDC and National institute of Metrology, China. All tests returned negative results (in table S3).
3.4 Comparison between RT-qPCR and RT-dPCR on febrile suspected patients
103 pharyngeal swabs were collected from febrile suspected SARS-CoV-2 infected patients and the relevant information is listed in table 1. Among the 103 specimens, 81 (P1 to P81) were tested at KXR with the H&R qPCR kit and 7 (P82-88) were tested at WHCDC by the Daan qPCR kit. Firstly, the criteria claimed by the H&R kit manufacturer are: Ct value≤35 are positive, Ct value >39.2 are negative, and 35<Ct<39.2 are equivocal. The criteria of the Daan qPCR kit are: ct>40, negative, ct =<40, positive, and equivocal if only one gene with ct =< 40 and no amplification for another gene. According to such criteria, 14 positive, 25 negative and 49 suspected SARS-CoV-2 infections were reported by qPCR.
For dPCR, three targets are tested in parallel and the determination of a positive result should meet the following criteria: quantification of any one of the three gene targets is ≥2 copies/ reaction. If no positive droplet was detected in FAM channel but positive droplets were detected in VIC indicating RNAseP positive for human reference control(18), the sample can be judged negative. If 0<result<2, it should be attributed to equivocal and need further check. According to such criteria, 44 out of 49 suspects and 17 out of 25 negatives were corrected to be positive by dPCR and the positive rate significantly increased. No positive droplet was detected for the 6 negatives and copy numbers were quantified under the established LoD for 7 suspects infections, due to either no virus sampled or ultra-low virus load in these specimens.
15 samples (from P89-P103) were tested at BJCDC with BioGermqPCR kit and assays recommended by Chinese CDC. Ct values were not available and only negative or positive information were reported. Single gene target positive was determined to be SARS-CoV-2 positive based on parallel test with a commercial kit and Chinese CDC assays. Therefore, these 15 samples were reported positive by BJCDC. 8 qPCR negatives for ORF1ab were positive tested by dPCR, showing high sensitivity for ORF1ab by dPCR. Only 3 negatives for ORF1ab which can be complemented by E gene targets.
Among the 103 specimens, 29 positive, 25 negative and 49 suspected were reported by RT-qPCR. However, 61 samples including 17 negative and 44 suspected tested by qPCR were confirmed to be positive by dPCR, thus 90 patients in total whose SARS-CoV-2 nucleic acid were positive tested can be diagnosed with COVID-19. All the 103 patients were confirmed SARS-CoV-2 infection according to a follow-up survey. The accuracy of SARS-CoV-2 detection was significantly improved from 28.2% to 87.4% (Fig. 2).
3.5 Comparison on close contacts and convalescent
75 specimens were collected from contacts and close contacts. 48 specimens from contacts were reported negative based qPCR test by BJCDC on Feb 6 and were confirmed by dPCR on Feb 7 in table S4. According to a follow-up survey, all of them were in good health and isolation was lifted after 14 days.
27 specimens (table 2 and Fig. 3) were detected at WHCDC by qPCR with a kit from Daan gene on March 2, 4 and 6. According to qPCR result, 10 positive, one negative and 16 suspect were reported. It is very difficult to detect the SARS-CoV-2 nucleic acids due the low virus load at the early stage for the close contacts. However, 15 out of 16 equivocal and one negative can be determined positive by dPCR. The suspect rate was significantly decreased from 21% down to 1% according to the detection of dPCR. Consequently, except 5 patients can not be tracked, the rest 10 dPCR positive were confirmed as SARS-CoV-2 infected patients based on a follow-up survey.
Furthermore, among the 16 specimens corrected by dPCR, 6 persons (P14,18-21and P23 in table S5) were directed for secondary testing following an initial negative test 2 to 10 days prior. Based on qPCR results, further isolation and observation was still needed to be conducted as the testing result is suspect or negative and no clinical symptoms were observed for them. However, if based on dPCR, all the six patients can be diagnosed with COVD-19 infected by SARS-CoV-2 and treatment could be conducted earlier. This indicates dPCR is more sensitive and suitable for low virus load specimens from the patients under isolation and observation without clinical symptoms, which is in agreement with the very recent online report (19).
Additionally, 16 pharyngeal swabs were collected from convalescent patients (Table 3). 12 positive, 3 suspect and 1 negative were reported by qPCR. However, all of these 16 patients are diagnosed to be positive by dPCR, indicating that all of them still need to be observed in hospital. Correlation between age and the RNA virus copy number was analysis (Fig.4). Interestingly, except P15, with increasing age, the copy number of virus load was much higher, which indicates a longer observation in the hospital is needed. We set up the threshold of 15, 20 and 25 copies/reaction for ORF1ab, N and E, respectively. The ORF1ab, N and E gene copy number were higher than their threshold for 100% patients elder than 60 and 75% (6 out 8 patients) elder than 55 (the median).
4 Discussion
RT-qPCR, as the standard method of diagnostics of SARS-CoV-2, plays an important role in this outbreak, though a low positive rate has been reported (5). A number of factors could affect RT-PCR testing results including sample collection and transportation, RNA extraction and storage, and proper performance of the kit (20). More recently, more than 145 RT-qPCR kits have been developed by the in vitro diagnostic manufactures (IVDs) in China (21). Among the qPCR kits, those with low sensitivity would cause high false negative rate or high equivocal rate. For the equivocal results it is necessary to conduct a retest, but due to the daily burden of thousands of incoming samples it is often impossible to do a same day retest. The testing laboratory should initially report a result based on a single test, while secondary sampling for a later retest does not need to be sent to the same laboratory. Therefore, availability of a highly sensitive and accurate confirmatory method is of particular importance for the diagnosis of SARS-CoV-2 in this outbreak.
Currently, besides RT-qPCR, other methods such as next generation sequencing (NGS) and immunological detection of IgM and IgG could be used as confirmatory methods for diagnosis of COVID-19 according to the latest guideline of Diagnosis and Treatment of Pneumonitis Caused by SARS-CoV-2 (trial seventh version) published by National Health Commission (22). This would improve the false negative rate by applying multiple methods. However, diagnostics of nucleic acids is still considered as the gold standard as this is the most direct way to detect the presence of the virus. Thus, the established digital PCR method in this study could be a powerful complement method because it can significantly improve the positive rate for the suspect patients. Furthermore, it is very sensitive for the very low virus load in close contacts and suitable for monitoring the change of the virus load in the convalescent patients. An additional advantage of quantification of SARS-CoV-2 copy number by dPCR is that comparisons can be conducted between different dates and different laboratories as absolute quantitation of targets by dPCR provides high concordance between sites, runs and operators (14, 23, 24). However, it is not possible to compare Ct values on different runs or different machines. Thus, dPCR is an ideal method to for measuring the change of virus load in the convalescent patients.
CONCLUSIONS
This work demonstrates that dPCR significantly improves accuracy and reduces the false negative rate of diagnostics of SARS-CoV-2, which could be a powerful complement to the current RT-qPCR. Furthermore, dPCR is more sensitive and suitable for low virus load specimens from the patients under isolation and observation who may not be exhibiting clinical symptoms.
Data Availability
All relevant data are available upon request.
ASSOCIATED CONTENT
Research Funding
Fundamental Research Funds for Central Public welfare Scientific research Institutes sponsored by National Institute of Metrology, P.R. China (31-ZYZJ2001/AKYYJ2009)
Acknowledgments
We would like to thank Academy of Military Medical Sciences for providing the purified virus RNA for method validation, Wuhan considering laboratory for medical test for conducting the dPCR measurement and BioRad, China for donating the one step RT-dPCR mastermix.