Abstract
Objectives To estimate the evolution of the prevalence of SARS-CoV-2 virus infection among residents aged 18 years or over in the municipality of São Paulo.
Methods This is a population-based household survey conducted every 15 days, between June and September 2020 and January and February 2021. In total, 11 phases were performed. The presence of antibodies against SARS-CoV-2 was identified in venous blood using a lateral flow test, Wondfo Biotech. In the last phase, it was combined with an immunoenzymatic test, Euroimmun. Participants also answered a semi-structured questionnaire on sociodemographic and economic factors and social distancing measures. Prevalence estimates and 95% confidence intervals were estimated according to the region, Human Development Index, sex, age group, ethnicity, education, income and variables associated with risk or prevention of the infection. To compare the frequencies among the categories of each variable, the chi-square test with Rao Scott correction was used, considering a 5% significance level.
Results In total, 23,397 individuals were interviewed and had their samples collected. The estimated prevalence of antibodies against SARS-CoV-2 ranged from 9.7% (95%CI: 7.9-11.8%) to 25.0% (95%CI: 21.7-28.7). The prevalence of individuals with antibodies against SARS-CoV-2 was higher among black and pardo people, people with lower schooling, people with lower income and among residents of regions with lower Human Development Index. The lowest prevalences were associated with recommended measures of disease protection. The proportion of asymptomatic infection was 45.1%.
Conclusion The estimated prevalence of SARS-CoV-2 infection was lower than the cumulative incidence variation, except for the last phase of the study. The differences in prevalence estimates observed among subpopulations showed the social inequality as a risk of infection. The lower prevalence observed among those who could follow prevention measures reinforce the need to maintain the social distancing measures as ways to prevent SARS-CoV-2 infection.
Introduction
In December 2019, the World Health Organization (WHO) received a notification of pneumonia outbreak in Wuhan, Hubei Province, People’s Republic of China. The identification of the etiological agent occurred quickly, a new coronavirus called SARS-CoV-2. On January 30, 2020, WHO declared the disease outbreak caused by the SARS-CoV-2 virus – COVID-19 – as a Public Health Emergency of International Concern (PHEIC), the highest WHO alert level according to the International Health Regulations (IHR). On March 11, 2020, WHO declared the outbreak of COVID-19 as a global pandemic1.
In Brazil, the Ministry of Health (MoH) declared the COVID-19 as a Public Health Emergency of National Concern (PHENC) on February 03, 20202. The first case was diagnosed on February 26, 2020 and on March 20, 2020, MoH declared the community transmission of COVID-19 in the national territory3.
From the date of the first case until March 28, 2021, the disease was detected in 12,490,362 people and caused 310,550 deaths, the incidence rate was 5,943.6 per 100,000 inhabitants and the mortality rate was 147.8 per 100,000 inhabitants4.
In the Municipality of São Paulo (MSP), until March 26, 2021, 2,390,256 cases of acute respiratory infection (ARI) in residents were notified in the report system (e-SUS Notifica), of which 609,380 (25.5%) were confirmed for SARS-CoV-2 infection. In the Epidemiological Surveillance Information System of ARI (SIVEP Gripe), 161,758 cases of severe acute respiratory syndrome (SARS), residing in the MSP were reported, of which 90,049 (55.7%) were confirmed for COVID-19. Among the COVID-19 cases, 21,051 (3%) evolved to death.
Given the epidemiological situation of COVID-19 in the MSP, on March 23, 2020 the city council adopted strategies to diminish the disease transmission, establishing voluntary quarantine with the closure of non-essential services5. On May 29, 2020, the Decree No. 59,473, gradually reopened some non-essential services6, therefore, it became extremely important to know the serological situation of the population regarding COVID-19 infection to support decision-making.
This serial serological survey was designed to represent people with 18 years or older in the MSP. Population-based data was needed to direct strategies to combat the pandemic and evaluate the effects of COVID-19 actions to prevent and control the disease. In this sense, our study aimed to estimate the prevalence of SARS-CoV-2 virus infection in adults with 18 years or older, living in the MSP; estimate the proportion of asymptomatic individuals with positive tests and describe the evolution of the prevalence of SARS-CoV-2 virus infection over time.
Methods
This is a serial serological survey to estimate the prevalence of SARS-CoV-2 infection in the MSP.
Study area
The MSP, the capital of the state of São Paulo, has a Human Development Index (HDI) of 0.805 and an estimated population in 2020 of 11.9 million inhabitants, of which 9.2 million were 18 years or older. The municipality presents a large economic and social disparity, reflecting heterogeneities in the education, income and housing. The Gini Index in 2010 was 0.64537.
For the planning of healthcare actions, the municipality is divided into six regions: North, Central, West, Southeast, East and South, 27 Technical Health Supervisions, 472 primary healthcare units (PHU) and their respective coverage areas (CA) (Figure 1).
Figure 1 shows the social and economic disparities between and within the regions of the MSP.
Sample methodology
In 2020, a pilot study was performed in June and seven cross-sectional studies were repeated periodically in different samples, every 15 days, from June to September. In 2021, four other studies were repeated in January and February. In each phase, a stratified probabilistic sample with a simple casual sampling was used within each stratum8.
For a prevalence ranging from 5% to 20%, the sample size was determined so that, the estimates were produced with coefficients of variation lower than 15% for the MSP and lower than 30% for the regions, with the exception of the Central and West regions, which have a smaller number of PHU. This process results in eight dwellings per stratum. The sample size was increased to 12 dwellings to maintain the accuracy of the estimates affected by the non-response (closed households or refusals to participate in the study). In the Central and West regions, the sample size per CA-PHU was also increased to 15 to also compensate the smaller number of PHU in the region (Table S1, supplementary material).
Statistically, this is a single stage sampling (the dwelling is the sample unit), however, operationally, the selection was made in two stages; in the second stage, the resident was selected in the household using the last birthday method. The selected person collected material for laboratory analyses and was interviewed. If the selected property was closed or the selected resident was not at home at the time of the visit, the PHU’s team should return up to twice.
Database
The addresses of dwellings were accessed from a database composed by three different types of records updated between the study phases: a) residential property taxpayers (IPTU) of 2020; b) hydrometers of sanitation company (SABESP) of 2017; c) Family Health Strategy (FHS). The distribution of dwellings by record was unequal in the municipality. Some areas did not have representativeness from one of the records or the number was insufficient in the sample. The number of dwellings in each stratum was selected from the database, proportionally to the number of dwellings in each record.
Testing and questionnaire
Testing was performed using the SARS-CoV-2 Antibody test® (Wondfo Biotech, Guangzhou, China) that detects IgM/IgG antibodies against SARS-CoV-2, without discriminating the type of immunoglobulin. In Brazil, the test is distributed by MoH under the name One Step COVID-2019 Test® and the legal manufacturer is Celer Biotecnologia S/A9.
The test is based on the principle of lateral flow immunochromatographic for the detection of IgG/IgM antibodies against SARS-CoV-2 in human blood, serum or plasma10.
In this survey, venous blood samples were used to obtain the serum, since the validation study of the OneStep Wondfo Test indicated an increase of sensitivity with serum sample in comparison with capillary blood obtained by digital puncture. Pellanda et al11 estimated a sensitivity of 84.8% and a specificity of 99.0% by assessing the results of four validated studies.
In the last phase, to increase the sensitivity and the specificity of virus detection, the samples were also processed using an immunoenzymatic test (anti-SARS-CoV-2 ELISA) from EUROIMMUN, which uses the S1 spike protein as an antigen for the detection of IgG antibodies against SARS-CoV-2 in serum12,13.
The official laboratory results were reported to the study participants. Participants were interviewed with a semi-structured questionnaire. The information collected were: sex, age, schooling, ethnicity (self-reported), family income, household size, symptoms potentially related to COVID-19, healthcare service use, previous SARS-CoV-2 test, contact history with suspected or confirmed cases of COVID-19. Also, the interviewees were questioned about their work regime, the social distancing adopted, facemask use, visit to non-essential places and public transportation use.
Data analysis
Data were included in a standardized electronic form in FormSUS/DATASUS platform, version 3.014. Data processing and analysis were performed using the statistical packages R and STATA version 13.
Indeterminate results were classified in the analyses as negative.
Data were analyzed considering five regions in the MSP: Central-West, East, North, Southeast and South. Prevalence estimates were weighted according to the sampling design.
Prevalence with 95% confidence intervals (CI) were estimated according to the region and HDI of the MSP, sex, age, ethnicity, education, income and presence of symptoms of the individual, risk factors, recommended measures for prevention and control of the disease and social distancing. Also, the proportion of asymptomatic infections were calculated.
For statistical data analyses, four phases were randomly chosen from the 11 phases performed in this study.
Rao Scott chi-square test was used to compare the frequencies between the categories of each variable, considering a 5% significance level.
Ethical aspects
The study was approved by the Brazilian’s National Ethics Committee (CAAE 32947920.3.0000.0008). Blood sample were collected and the individual were interviewed only after written informed consent from all participants.
Results
Out of 63,372 selected individuals with 18 years or over, residing in the five regions of the municipality of São Paulo (MSP), 23,397 (36.9% participation rate) were interviewed and collected samples.
Table 1 shows the prevalence estimates of antibodies against SARS-CoV-2 in the MSP and their respective 95% CI for each phase of the study. All values found were within the range of CI variation of the previous phases, except in the last phase, when two laboratory tests were used to increase the sensitivity and the specificity of virus detection.
Table 2 shows the prevalence estimates of SARS-CoV-2 infection in phases 1, 4, 7 and 11. The results of phase 11 with only the rapid test (11a) and with the addition of the ELISA test (11b) are presented separately.
The prevalence estimates of this study did not follow the evolution of the accumulated cases of SARS-CoV-2 infection in the MSP, except for the last phase as shown in Figure 2A.
The design effect (deff) varied between the study phases, from 1.93 in phase 7 to 3.09 in the last phase; in phase 1, the deff was 2.71 and in phase 4; 2.95.
The coefficients of variation of prevalence estimates for the municipality were below 15% in all phases and below 30% in all regions, except for the Central-West region.
There was a significant difference between the regions of the municipality of São Paulo in phases 4 and 7 of the study. The Central-West region presented the lowest estimates, whereas the Southern region presented the highest.
Table 2 also presents the prevalence estimates of SARS-CoV-2 infection, according to demographic and clinical characteristics. Regarding age, the estimated prevalences varied between the phases of the study, in phase 1, the highest prevalence was found in the age group from 50 to 64 years; in phase 4, from 18 to 34 years; in phase 7, from 35 to 49 years and in phase 11, from 18 to 34 years. Only in phase 7, the difference among age groups was significant.
In all phases, women presented the highest prevalence, especially in phase 11, when the difference was statistically significant. The estimated prevalence of black and pardo people was the highest in all phases. There was a significant difference in ethnicity in all phases.
The highest prevalence estimates were found in the groups with the lowest education levels. Table 2 shows significant differences between the education levels in phases 1, 4 and 7.
The proportion of asymptomatic individuals among the positive cases ranged from 35.3% (95%CI: 29.5-41.6) in phase 7 to 45.1% (95%CI: 38.2-52.2) in phase 11. In phase 1, the proportion was 43.6% (95%CI: 37.4-50.0) and in phase 4, 40.5% (95%CI: 31.8-49.9). As shown in Table 2, the prevalence of cases without symptoms increased throughout the study phases.
Table 3 shows the prevalence estimates of SARS-CoV-2 infection, according to socioeconomic factors and recommended measures adopted, for phases 1, 4, 7 and 11.
Prevalence estimates were inverse to the categories of family income that were distributed in A/B (≥ R$ 8,641.00), C (R$ 2,005.00 to R$ 8,640.00) and D/E (≤ R$ 2,004.00). Moreover, HDI ranges were used to estimate the regional differences of the city, range A (0.84 to 0.95) had the lowest prevalence estimates whereas range C (0.62 to 0.73) presented the highest estimates. The difference among HDI ranges and family income categories were significantly different, except in phase 11 (Table 3).
Regarding the work situation of the interviewees, the study showed that those in telework had lower prevalence when compared to the other categories, especially in phases 1 and 4 (3.9% and 4.4%, respectively). Likewise, those who did not use public transportation and practiced social distancing presented the lowest prevalence estimates. The difference was significantly in phases 1 and 11.
The difference between the categories for mask use, “always”, “most of the time” and “sometimes” was significant in phase 11 (24.4%).
Table S2 (supplementary material) shows the reasons for non-response in each phase of the study. In 62.6% (39,169) of the selected dwellings it was not possible to conduct the interview and sample collection. The non-response rate presented an increase trend throughout the study.
Discussion
The results showed that most people in MSP are still susceptible to the SARS-CoV-2 virus even with the increasing number of SARS-CoV-2 infections during the pandemic. The prevalence of individuals with positive results was higher among the black and pardo people when compared with the white people. Also, the prevalence was inversely associated with the education level and the income category of the individual and with the HDI of the primary healthcare unit coverage area (CA-PHU) of the selected dwelling. The lowest prevalences were associated with recommended protection measures against coronavirus disease. In 2020, several studies in Brazil and around the world aimed to estimate the prevalence of SARS-CoV-2 infection in their populations15-21. The results of different prevalence studies should be carefully compared, since they depend on the moment of the pandemic evolution and the laboratory test used22.
In our study, the estimated prevalence of antibodies against SARS-CoV-2 in the MSP ranged from 9.7% (95%CI: 7.9-11.8) in phase 1 to 16% (95%CI: 13.1-19.3) in phase 10; in phase 11, when two laboratory tests were used, the estimate was 25% (95%CI: 21.7-28.7). In Brazil, Hallal et al.20 conducted a large study involving two serological surveys in 133 sentinel cities in all Brazilian states, they used the same laboratory test of our study with finger prick blood samples and estimated prevalences between 0% and 25%. In the state of Espírito Santo, Gomes et al.22 conducted a population-based serial cross-sectional study in 11 municipalities, they found a prevalence of 2.1% using the Celer Technologies Inc test in a finger prick blood sample. In the municipality of São Paulo, a study conducted in six administrative districts found a low prevalence (4.7%) using the MAGLUMI 2019-nCoV chemiluminescence immunoassay test21.
In the present study, the highest prevalence estimates were initially found in the age group from 50 to 64 years, which is a vulnerable population most affected at the beginning of the pandemic. In the following phases, the younger age groups were the most prevalent, which is the economically active population that most circulates in the city. This fact may be associated with the economic recover of the municipality with the reduction of restrictions and reopening of commercial establishments6.
Similar to other studies, there was no difference between the prevalence of men and women 16,17,20.
Disparities in prevalence estimates by ethnicity and level of education are consistent with the inequalities associated with demographic, socioeconomic and risk factors for the disease transmission in the studied municipality, as shown by Tess et al.21 and Rosenberg et al.15.
The prevalence of SARS-CoV-2 infection was inverse to the individual’s income and to the HDI of CA-PHU, therefore, the lower the income and HDI in the CA-PHU, the higher the prevalence of SARS-CoV-2 infection. Similar to the studies of Bermudi et al.23 and Menezes et al.24, vulnerable populations with lower income and poor housing conditions have a higher risk of virus transmission and difficulties to acess healthcare for diagnosis and treatment.
The proportion of individuals that have not reported symptoms related to COVID-19 since the beginning of March 2020 among those with positive test for the SARS-CoV-2 virus varied among the study phases. The proportion of up to 45.1% was high when compared with other studies. The proportion of asymptomatic individuals with 18 years or older reported in seroprevalence studies in England18 and in a USA city (Chelsea)25 were 32.2% and 24.7%, respectively. Similar to this study, Tess et al.21 found a high proportion of asymptomatic individuals in residents of six districts of the MSP (45.3%).
The risk associated with the work situation was lower among individuals working from home when compared to those who work at the office, this results were consistent with other seroprevalence studies of SARS-CoV-226,27.
The use of masks was a protective measure against SARS-CoV-2 infection, similar to that found in studies performed in the state of Maranhão26 and in China28.
This study has some limitations. It included the high rate of non-response, the address or the dwelling selected in the database were not identified (29.0%) or the selected individual refused to receive the work team or participate in the study (26.7%), since venous blood samples were collected instead of finger prick in order to increase the sensitivity of the test, as shown by Hallal et al.20 and Tess et al.21 The large number of teams (471) composed of employees of the primary healthcare units and responsible for data collection may have contributed to divergences in the approach of individuals to undergo the interview and the sample collection. The design effect (deff) observed for the prevalence estimates for the municipality and for the regions was higher than expected for stratified samples, possibly due to the large number of strata in the study8. The high deff contributed to the low accuracy of the estimates; therefore, future studies should review the sample design. Individuals under 18 years of age were not included in the sample. The use of a second test in phase 11 enhanced the sensitivity of the test. ELISA test detected 53.8% more positive cases than the rapid test. Possibly, the prevalence estimates may have been underestimated in phases 1 to 10.
Serological tests may present false-negative results in the first days of the infection, therefore, it has little diagnostic value for acute cases. The proportion of negative cases among symptomatic individuals with sample collection performed up to the 14th day of the date of symptoms onset decreased throughout the study, from 18.7% in phase 1 to 10.1% in the last phase. The probability of selecting individuals during the first 14 days of infection in the sample was decreasing and the recall bias of the date of symptoms onset was 16.4%.
In conclusion, the prevalence variation of SARS-CoV-2 infection was lower than the variation of the cumulative incidence rate, except for the last phase of the study. The differences in prevalence estimates according to protective measures against SARS-CoV-2 infection reinforce the need to maintain social distancing, mask use and telework in all age groups and social classes.
Sequential phases will allow the monitoring of the pandemic evolution and will verify the effectiveness of the current recommended protection measures for the population. Further studies should consider vaccinated people and the influence of vaccination on the population.
Data Availability
The data that support the findings of this study are available on request from the corresponding author, SMS/COVISA. The data are not publicly available due to restrictions of their containing information that could compromise the privacy of research participants.