Abstract
Key Features
Virus Watch is a national community cohort study of COVID-19 in households in England and Wales, established in June 2020. The study aims to provide evidence on which public health approaches are most effective in reducing transmission, and investigate community incidence, symptoms, and transmission of COVID-19 in relation to population movement and behaviours.
28,527 households and 58,628 participants of age (0-98 years, mean age 48), were recruited between June 2020 - July 2022
Data collected include demographics, details on occupation, co-morbidities, medications, and infection-prevention behaviours. Households are followed up weekly with illness surveys capturing symptoms and their severity, activities in the week prior to symptom onset and any COVID-19 test results. Monthly surveys capture household finance, employment, mental health, access to healthcare, vaccination uptake, activities and contacts. Data have been linked to Hospital Episode Statistics (HES), inpatient and critical care episodes, outpatient visits, emergency care contacts, mortality, virology testing and vaccination data held by NHS Digital.
Nested within Virus Watch are a serology & PCR cohort study (n=12,877) and a vaccine evaluation study (n=19,555).
Study data are deposited in the Office of National Statistics (ONS) Secure Research Service (SRS). Survey data are available under restricted access upon request to ONS SRS.
Why was the cohort set up?
The United Kingdom Research and Innovation (UKRI) Medical Research Council (MRC) & the Department of Health and Social Care National Institute for Health and Care Research (DHSC NIHR) funded Virus Watch in April 2020 under the COVID-19 Rapid Response Call 2. During the early stage of the pandemic in the UK, (February/March 2020), data on COVID-19 was largely collected in hospital settings. Our aim was to bring together an experienced team of respiratory infectious disease epidemiologists to rapidly establish a national community cohort study of COVID-19 in households living in England and Wales that built upon our experience from Flu Watch - a community cohort designed to estimate community burden of influenza and influenza-like illness (1). The DHSC/UKRI awarded additional funding to the study (under the Rapid Response Initiative call) in August 2020 to recruit larger numbers of minority ethnic and migrant populations when it became increasingly apparent that these groups were underrepresented in research studies whilst experiencing greater risk of hospitalisation and death from COVID-19.
Virus Watch aims to provide evidence on which public health approaches are most likely to be effective in reducing the spread and impact of the virus and investigates community incidence, symptom profiles, and transmission of COVID-19 in relation to population movement and behaviour (2).
The main research questions we set out to address were: what is the rate of infection; what is the rate of infected people experiencing symptoms; what is the rate of people seeking healthcare; what are the hospitalisation and mortality rates associated with COVID-19, at different points in time and within different population groups. We wanted to describe COVID-19 symptoms and their severity and to understand people ‘s behaviour in terms of infection prevention as well as movement, travel, activities and social contact. Understanding how these outcomes differ by ethnicity, migration and deprivation, and what risk factors may explain any differences, constituted some of our key objectives.
In addition, we wanted to understand how negative consequences of the COVID-19 pandemic and public health control measures affect economic circumstances and mental health. We had a particular focus on people from minority ethnic and migrant backgrounds and how access to primary care for COVID-19 varied among these groups compared to the White British population and what factors might explain this.
After vaccines became available in the UK in December 2020, further funding was provided by the DHSC for undertaking serological testing for a subset of the Virus Watch cohort. This program ran from February 2021 – April 2022 and was designed to assess the effectiveness and impact of COVID-19 vaccines on both symptomatic and asymptomatic infections and on transmission. We wanted to compare the effectiveness of vaccines in different population subgroups and assess the duration of protection, correlates of protection and immunity against emerging strains.
Who is in the cohort?
As of the 7th of July 2022, 58,628 participants aged 0 - 98 years (mean age 48 years) from 28,527 households had enrolled into Virus Watch (see Table 1).
Virus Watch is a prospective household community cohort study. Recruitment methodology was adapted throughout the study (see Table S1) to ensure the target number of participants from the general population was reached. We also aimed to recruit a sufficiently large sample of participants from minority ethnic backgrounds to investigate infection risk and impact of the pandemic on specific groups of interest. Postcards, leaflets and adverts used to recruit participants were designed to inform individuals about the study and direct them to our website http://ucl-virus-watch.net/ where they could self-select into the study.
We used the Royal Mail Post Office Address File to generate a list of sampled residential addresses to send Virus Watch recruitment postcards to. The initial sample design was a single-stage stratified probability sample. Within each region, residential addresses were sorted by quintiles of Index of Multiple Deprivation (IMD), within quintiles by local authorities, postcodes and address (2). We also delivered invitation leaflets to letterboxes in residential areas around our blood taking clinics to ensure we could recruit our target number of participants into the laboratory subcohort. Enrolled participants were invited to join the laboratory subcohort if they lived within 5km (urban areas) or 10km radius (rural areas) of the nearest blood taking clinic for serological sampling.
We worked with nine of the 15 NIHR Local Clinical Research Networks (LCRNs) across England to send SMS messages from General Practitioner (GP) clinics to their patient lists with a link to the study website inviting participants to take part. To boost recruitment of minority ethnic participants into the cohort, we sent targeted letters based on ethnicity via 90 GP clinics from 9 LCRNs that included a £20 voucher incentive per household to sign up.
Digital invitations were shared via trusted networks including patient advocacy group websites, Twitter, Facebook and WhatsApp. We also undertook a paid advertising campaign via Facebook.
Study participants were emailed and asked to share an invite with their family and friends. Throughout the pandemic, multiple newspaper articles, radio and TV appearances of the study team also contributed to public-facing exposure and recruitment.
How often have the participants been followed up?
After signing up to the study and completing a baseline survey for every member of the household, a nominated household study lead completed a weekly online illness survey, and monthly surveys (from Dec 2020) about pandemic-relevant sociodemographic and clinical factors.
Participants in the laboratory subcohort were invited to either attend a clinic in their local area or schedule a home visit to have their blood taken on 2 occasions (Oct 2020 - Jan 2021 and again between May - Aug 2021). Participants that were unable to visit a clinic and could not receive a home visit were asked to provide a finger prick sample (in clinic or self-collected at home). Between Oct 2020 and May 2021, participants also posted self-administered nasal swabs for PCR assays of SARS-CoV-2 if they experienced any of the following symptoms for 2 or more days; fever, cough or loss or change of sense of taste or smell. The design of the laboratory subcohort, including the sample collection algorithm and specific symptoms of interest has been published in the study protocol (2).
Adults taking part in the vaccine evaluation subcohort posted self-collected finger prick capillary blood microsamples monthly between Feb-Aug 2021, and every other month from Sep 2021-Mar 2022.
For participants living in England, linkage of Virus Watch to data held by NHS Digital (Hospital Episode Statistics, Death Registrations, National Immunisation Management System (NIMS), COVID Vaccine Adverse Events Data, virological surveillance data) will take place quarterly during the study and for up to 5 years after the end of the study (until 30 September 2026).
Retention of participants in the study has decreased over time. Many participants dropped out of the study and failed to complete any weekly surveys after enrolling but among those who completed at least one weekly survey, engagement was high. In the first 6 months of the study approximately 75% of enrolled participants were regularly completing the weekly illness surveys (Figure 2). Over the course of the pandemic the proportion of participants who were lost to follow-up steadily increased and by May 2022 the proportion still regularly completing surveys had reduced to around 50% of enrolled participants. Participants self-select into the study and are free to stop participating at any time. We used a 75% survey completion rate for weekly surveys as a cut-off to compare the characteristics of high responders and low responders. Participants who completed less than 75% of all possible weekly surveys were more likely to be younger, from an ethnic minority background and living in London (Table 2).
Missing demographics from the baseline survey were requested via a short one-off survey of 5312 study households (n=14,166 participants) in February 2021. This provided 857 (16%) household addresses, the sex of 3,985 (28%) people, and the ethnicity of 3,819 (27%) people.
For records with missing sex data after linkage to NHS Digital data, we assigned gender using the probability of given names being male or female based on US names from 1930-2015 (https://data.world/howarder/gender-by-name) (5). Sex was reported by 49,255 participants, and following gender matching by name, a further 8,552 classifications of sex were inferred. The accuracy of this technique was tested on the 49,255 complete records and found to be 99.82%. Notably, this method fails to account for the minority of individuals who are intersex/other non-binary gender identities and should be interpreted with this in mind.
Data linkage to NIMS for immunisation records up to 23rd December 2021 yielded an additional 11,221 records for Dose 1 of COVID-19 vaccine that were not self-reported, 12,593 for Dose 2, 14,009 for Dose 3, 41 for dose 4 and 12 for Dose 5. We anticipate future data linkage to increase the number of matched missing records for doses 4 & 5. Linkage to the ONS mortality dataset held by NHS Digital, identified 153 participants that had died (up to November 2021) compared to 59 reported deaths by household or family members.
Study participant data were linked to the national ‘Pillar 2 ‘ COVID-19 community testing programme by NHS Digital, which provided an additional 291,067 test results (negative and positive results). We asked participants to self-report only positive test results and the first subsequent negative result via the weekly surveys; consequently only one result was recorded per week per participant. This likely explains the difference between the high number of tests recorded via Pillar 2 compared to our study. Linking to the Second Generation Surveillance System (SGSS) which contains results of testing performed in hospital patients and health and care workers and is held by NHS Digital, provided 5,345 additional test results.
What has been measured?
Participants completed detailed study questionnaires online via REDCap database tools hosted on the secure UCL Data Safe Haven (6). Demographic, clinical, and socioeconomic data were collected at baseline for each household member, as well as any previous COVID-19 illness. Symptoms, activities, COVID-19 test results and vaccinations were reported in weekly illness surveys and monthly surveys asked questions on a broad range of topics including behaviours, mental health and disability tailored to the pandemic phase. Table 3 summarises the survey data collected from Virus Watch participants, additional data acquired from linkage via NHS Digital, biological data collected from participants in the laboratory and vaccine efficacy subcohorts and geolocation data from an optional movement tracker app sub-study.
As of 7th December 2022, 1,619,300 weekly surveys had been submitted with 21,299,348 person days of follow up. 351,751 individual responses to monthly surveys were received, 190,993,995 GPS coordinates. From laboratory substudies we collected 10,974 full serological samples, 107,708 finger prick samples and of these, 4,972 live virus neutralisation activity of capillary microsamples were tested.
What have we found?
Virus Watch aimed to provide evidence on the transmission and impact of COVID-19 and aimed to estimate key epidemiological measures including: the incidence of PCR-confirmed COVID-19; incidence of hospitalisation among PCR-confirmed COVID-19 cases; incidence of respiratory infection symptoms, including COVID-19 disease case definitions; and secondary household attack rates. Other outcomes of interest included investigating the effectiveness and impact of control measures including testing, isolation, social distancing and vaccine effectiveness against asymptomatic and symptomatic infections.
Early in the pandemic, we took an active decision to avoid duplication of effort in reporting the incidence of infection and hospitalisation once the ONS COVID-19 Infection Survey and the UK Health Security Agency dashboard data were established and chose to focus on investigating: the symptoms of COVID-19 and COVID-19-like illness; risk factors for, and behaviours associated with infections and vaccination; and immunity against COVID-19. We have published a summary of key findings on the study website https://ucl-virus-watch.net/?page_id=1323 and a full list of publications and pre-prints is available at: https://ucl-virus-watch.net/?page_id=1388.
More deprived communities have been disproportionately impacted by the health, social and economic effects of the COVID-19 pandemic. Greater day-to-day exposure to people outside of their household and/or support bubble (e.g. lesser ability to work from home, greater dependence on public transport, etc) may be driving higher infections, hospitalisations and deaths in deprived areas. To explore this we used participant reported data on daily activities during 3 weekly periods in late November 2020, late December 2020 and mid-February 2021 (7). During the final week of November and the December holiday period (23-27 Dec 2020), participants living in more deprived areas were more likely to: leave their house to go to work or school; use public transport; share a car with a non-household member; visit an essential shop; and have close contact with a non-household/support bubble member than participants living in the least deprived areas. Participants living in more deprived areas were not more likely to undertake social and entertainment activities or visit non-essential shops and services. Our findings suggest that differences in essential daily activities are likely to be contributing to higher infection rates in more deprived regions. These differences are likely to reflect circumstances that constrain individual choice, e.g. car ownership, ability to work from home and disposable income. There was no observed difference in activities that are more likely to reflect individual decision making, such as attending non-essential shops or social and entertainment activities.
Work investigating anti-spike SARS-CoV-2 antibody levels among Virus Watch participants who had received COVID-19 vaccines provided key insights into antibody waning and protection against breakthrough infections (8–11). In one analysis following the vaccine rollout in the UK, we measured antibody levels in almost 9000 study participants who had received two doses of ChAdOx-S1 or BNT162b2 vaccine at 3 weeks after the second dose and 20 weeks after the second dose (9). Antibody levels dropped at the same rate for both vaccines, but peak levels were much higher following the BNT162b2 vaccine. We found those with lower antibody levels were at increased risk of infection. We also showed that peak anti-S levels are higher post-booster than post-second dose, but that levels are projected to be similar after six months for BNT162b2 recipients. While peak antibody levels post-second dose were substantially lower for ChAdOx-S1 than BNT162b2 recipients, no differences in post-booster antibody levels by primary course type were observed (Figure S1). The magnitude and trajectory of post-booster anti-S response was similar across age groups and by clinical vulnerability status. Higher peak anti-S levels post-booster may partially explain the increased effectiveness of booster vaccination compared to two-dose vaccination against symptomatic infection with the Omicron variant.
What are the main strengths and weaknesses?
Following up whole households longitudinally, rather than being limited to individuals is a unique strength of Virus Watch. The cohort has data on all ages including children and adolescents and is broadly representative of the UK population on geographic spread and deprivation (Table 1), with some exceptions. A particular focus was placed on engaging participants from minority ethnic groups with recruitment methods and research question prioritisation being guided by our study advisory group composed of community leaders, policy experts and charity representatives. Study surveys and data collection methodologies were developed based upon 6 years of experience running large national surveillance cohorts of pandemic and seasonal influenza (12)and we used validated questionnaires wherever feasible (e.g. PHQ-9 and GAD-7 to assess depression and anxiety within the cohort).
The study dataset has been linked to national Pillar 2 testing (PCR & lateral flow testing data through national Test Trace and Isolate Programme), the national vaccine register as well as hospitalisations and deaths. We have collected detailed demographic information, clinical co-morbidity data and have stored serum from participants in the laboratory subcohort at 2 time points, as well as longitudinal serum micro-samples from the vaccine evaluation subcohort. Virus Watch is one of the few longitudinal studies to present quantitative spike and nucleocapsid antibody test data among adults and qualitative (positive/negative) serology among children, together with detailed vaccination information and clinical comorbidities The cohort has several important limitations. Households self-selected into the study after receiving an invitation via multiple routes, biassing the sample towards participants with an interest in COVID-19 and health research. Households with more than six members were not eligible for the study due to the limitations of the REDCap survey infrastructure, and people living in institutional settings such as care homes, university halls of residence and boarding schools were not eligible to participate, limiting the generalisability of findings for these groups. Households also needed to have either a mobile telephone, tablet or computer with an internet connection, a valid email address and at least one household member who could read and respond in English to complete regular surveys.
Several demographic groups are underrepresented in the study. The cohort is older (mean age = 48 years), with a greater proportion of people in the 45–64-year age group when compared to the general population. Some ethnic groups are also under-represented, notably the Black and Other Asian groups (Table 1). Our ability to disaggregate data into more granular categories of ethnicity is limited due to the small number of people in these groups enrolled in the study. Retention of participants has decreased significantly over the 2 years the study has been running. As restrictions have been lifted and interest in the pandemic has waned among the general public, around half of the participants enrolled have stopped regularly completing study surveys. Participants who have disengaged are more likely to be younger, from an ethnic minority background and living in London, limiting statistical power and likely biasing analyses using more recent study data.
Can I get hold of the data? Where can I find out more?
Given the sensitive content in our dataset (information on health, income and household characteristics) for this study, we cannot publish data at the individual level, publicly. We are sharing individual record level data (excluding any data or variables originating from linkage via NHS Digital) on the ONS SRS (DOI: 10.57906/s5f5-nq13). The data are available under restricted access and can be obtained by submitting a request directly to the SRS. We regularly share results and updates on the study via a “Findings so far” section on our website - https://ucl-virus-watch.net/
Data Availability
Given the sensitive content in our dataset (information on health, income and household characteristics) for this study, we cannot publish data at the individual level, publicly. We are sharing individual record level data (excluding any data or variables originating from linkage via NHS Digital) on the ONS SRS (DOI: 10.57906/s5f5-nq13). The data are available under restricted access and can be obtained by submitting a request directly to the SRS. We regularly share results and updates on the study via a "Findings so far" section on our website - https://ucl-virus-watch.net/
Ethics approval
This study has been approved by the Hampstead NHS Health Research Authority Ethics Committee. Ethics approval number – 20/HRA/2320.
This study uses NHS HES (Admitted Patient Care, Outpatient Bookings, Emergency Care Dataset and Critical Care), ONS mortality, Vaccination (NIMS), and COVID-19 testing data (Pillar 2 and SGSS) that were provided within the terms of a data-sharing agreement (DARS-NIC-372269-N8D7Z-V1.6) to the researchers by the Health and Social Care Information Centre (NHS Digital). The data do not belong to the authors and may not be shared by the authors, except in aggregate form for publication. Data can be obtained by submitting a data request through the NHS Digital Data Access Request Service.
Author contributions
Contribution Authors
Conceptualisation and study design AH, EF, JK, PH, EN, IC, VL, RAM, TC, AMJ, SM, JG, RG, AR, RWA
Project administration AH, EF, JK, VN, SB, TB, AA, PH, LW, WLEF, CG, PP, MSh, AMDN, EN, MSp, RWA
Data management VN, WLEF, SB, AMDN, CG, TB, RWA
Writing–original draft preparation JK, TB
Writing–review and editing All authors
Funding
This work was supported by the Medical Research Council [Grant Ref: MC_PC 19070] awarded to UCL on 30 March 2020 and Medical Research Council [Grant Ref: MR/V028375/1] awarded on 17 August 2020. The study also received $15,000 of advertising credit from Facebook to support a pilot social media recruitment campaign on 18th August 2020. The antibody testing undertaken by the vaccine evaluation subcohort was supported by funding from the Department of Health and Social Care from Feb 2021 - March 2022.
From 1 May 2022 Virus Watch received funding from the European Union (Project: 101046314). Views and opinions expressed are however those of the author(s) only and do not necessarily reflect those of the European Union or the European Health and Digital Executive Agency (HaDEA). Neither the European Union nor the granting authority can be held responsible for them.
This study was supported by the Wellcome Trust through a Wellcome Clinical Research Career Development Fellowship to RWA [206602] and a Clinical PhD Fellowship to AA [206441/Z/17/Z] IB is supported by an NIHR Academic Clinical Fellowship. SB and TB are supported by an MRC doctoral studentship (MR/N013867/1). Research at UCL Great Ormond Street Institute of Child Health led by PH and LW is supported by the NIHR Great Ormond Street Hospital Biomedical Research Centre.
Conflicts of interest
ACH serves on the UK New and Emerging Respiratory Virus Threats Advisory Group. AMJ is Chair of the Committee for Strategic Coordination for Health of the Public Research.
Footnotes
(joint first author)
(joint last author)
List of acronyms
- DHSC
- Department of Health and Social Care
- GP
- General Practitioner
- HES
- Hospital Episode Statistics
- IMD
- Index of Multiple Deprivation
- LCRN
- Local Clinical Research Network
- MRC
- Medical Research Council
- NHS
- National Health Service
- NIHR
- National Institute for Health and Care Research
- NIMS
- National Immunisation Management Service
- ONS
- Office for National Statistics
- PCR
- Polymerase Chain Reaction
- SGSS
- Second Generation Surveillance System
- SRS
- Secure Research Service
- UKRI
- UK Research and Innovation