TY - JOUR T1 - A comparative study of SIR Model, Linear Regression, Logistic Function and ARIMA Model for forecasting COVID-19 cases JF - medRxiv DO - 10.1101/2021.05.24.21257594 SP - 2021.05.24.21257594 AU - Saina Abolmaali Y1 - 2021/01/01 UR - http://medrxiv.org/content/early/2021/05/25/2021.05.24.21257594.abstract N2 - Starting February 2020, COVID-19 was confirmed in 11,946 people worldwide, with a mortality rate of almost 2%. A significant number of epidemic diseases including human Coronavirus display patterns. In this study with the benefit of data analytic, we develop regression models and a Susceptible-Infected-Recovered (SIR) model for the contagion to compare the performance of models to predict number of cases. first, we implement a good understanding of data and perform Exploratory Data Analysis (EDA). Then, we derive the parameters of the model from the available data corresponding to the top 4 regions based on the history of infections and the most infected people as of the end of August 2020. Then models are compared and further research are introduced.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis research was not fundedAuthor DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:There was no IRB needed.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.Yesor this research we have used GitHub data repository managed by Johns Hopkins University which contains daily time series summary tables, including confirmed, deaths and cases infected for more than once per day. Daily data of the influenced individuals are very helpful for data scientists. All data are from the daily case report, retrieved from: https://github.com/CSSEGISandData/COVID-19. https://github.com/CSSEGISandData/COVID-19 ER -