PT - JOURNAL ARTICLE AU - Leandro Pereira Garcia AU - André Vinícius Gonçalves AU - Matheus Pacheco Andrade AU - Lucas Alexandre Pedebôs AU - Ana Cristina Vidor AU - Roberto Zaina AU - Ana Luiza Curi Hallal AU - Graziela De Luca Canto AU - Jefferson Traebert AU - Gustavo Medeiros de Araujo AU - Fernanda Vargas Amaral TI - ESTIMATING UNDERDIAGNOSIS OF COVID-19 WITH NOWCASTING AND MACHINE LEARNING – EXPERIENCE FROM BRAZIL AID - 10.1101/2020.07.01.20144402 DP - 2020 Jan 01 TA - medRxiv PG - 2020.07.01.20144402 4099 - http://medrxiv.org/content/early/2020/07/02/2020.07.01.20144402.short 4100 - http://medrxiv.org/content/early/2020/07/02/2020.07.01.20144402.full AB - Background Brazil has the second largest COVID-19 number of cases, worldly. Even so, underdiagnosis in the country is massive. Nowcasting techniques have helped to overcome the underdiagnosis. Recent advances in machine learning techniques offer opportunities to refine the nowcasting. This study aimed to analyze the underdiagnosis of COVID-19, through nowcasting with machine learning, in a South of Brazil capital.Methods The study has an observational ecological design. It used data from 3916 notified cases of COVID-19, from April 14th to June 02nd, 2020, in Florianópolis, Santa Catarina, Brazil. We used machine-learning algorithm to classify cases which had no diagnosis yet, producing the nowcast. To analyze the underdiagnosis, we compared the difference between the data without nowcasting and the median of the nowcasted projections for the entire period and for the six days from the date of onset of symptoms to diagnosis at the moment of data extraction.Results The number of new cases throughout the entire period, without nowcasting, was 389. With nowcasting, it was 694 (UI95 496-897,025). At the six days period, the number without nowcasting was 19 and 104 (95% UI 60-142) with. The underdiagnosis was 37.29% in the entire period and 81.73% at the six days period.Conclusions The underdiagnosis was more critical in six days from the date of onset of symptoms to diagnosis before the data collection than in the entire period. The use of nowcasting with machine learning techniques can help to estimate the number of new cases of the disease.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis work has not received any financial support. Thus, there is no funding interest in the study design, data collection, data analysis, data interpretation, writing of the manuscript, or in the decision to submit the manuscript for publication. The content is solely the responsibility of the authors and does not necessarily represent the official views of the funding sources.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This project was submitted to the Ethics in Research with Human-Beings Council at the Federal University of Santa Catarina to guarantee the alignment with Resolution n 466/2012 of the National Health Council of Brazil. The research project was approved under CAE n 33374820.2.0000.0121/2020. We used exclusively secondary and anonymized databases.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesAnonymous scripts and all databases used are available at: https://github.com/lpgarcia18/underdiagnosis_of_covid_19_cases_in_brazil https://github.com/lpgarcia18/underdiagnosis_of_covid_19_cases_in_brazil