Covid-19 Cases in India: A Visual Exploratory Data Analysis Model

Covid-19 outbreak was first reported in Wuhan, China. The deadly virus spread not just the disease, but fear around the globe. On January 2020, WHO declared COVID-19 as a Public Health Emergency of International Concern (PHEIC). First case of Covid-19 in India was reported on January 30, 2020. By the time, India was prepared in fighting against the virus. India has taken various measures to tackle the situation. In this paper, an exploratory data analysis of Covid-19 cases in India is carried out. Data namely number of cases, testing done, Case Fatality ratio, Number of deaths, change in visits stringency index and measures taken by the government is used for modelling and visual exploratory data analysis.


Introduction
The new panic in town, the novel Coronavirus disease, abstracted as COVID-19 was first reported as an unexpected outbreak in Wuhan, the capital city of Hubei Province in the People's Republic of China. Within a few weeks, the disease emerged contagious and spread to different parts of the world. The stupendous containment measures taken by the Chinese government to arrest the spread of the virus got washed down the drain. Being victims of the panic and lack of sustainability issues, countries around the globe went to lock down.
A rising infectious disease alarms as rapid spreading, putting in danger the health of a large community, the weak being most vulnerable. Immediate action at the root level is required to tackle the virus [1][2][3]. The initial case of Covid-19 in India was reported on January 30, 2020 in Kerala, a southern state in India. The Kerala government accepting advices from the central boosted up its health and hygiene department and bought in vivid containment, isolation, tracking and many other containment measures to arrest the community spread. On India's case fatality rate is one among the lowest in the world at 2.41% as of 23 July and is steadily declining.
Lakshadweep is the only region which has not reported a case. India's recovery rate positions at 63.18% as on 23 July 2020. On 10th June, India's recoveries surpassed active cases for the first time [19][20][21]. In this paper an exploratory data analysis and regression analysis is conducted for the cases in India by considering number of cases, testing done, Case Fatality ratio, Number of deaths, change in visits stringency index and measures taken from January to July 2020.

Data and Methods
Data for this analysis were collected from various sources. The Government of India bulletins publishes the daily details of the condition of Covid-19 situation. Apart from that author uses data from World Health Organization (WHO), 'Our World in Data', John Hopkins Corona Resource centre [15][16][17]. Tweets and Facebook corona updates of Health minister were also used for the study. Media reports around the world were also a source of data. All these sources were used to collect data. MATLAB 2020a is used for the exploratory data analysis. The raw data was cleaned using the software. The platform used for the analysis is Windows 7(64 bit). The regression analysis is done using R. All the data collected and the code generated in MATLAB and R is made available in Github (https://github.com/callmejhe1/Covid-19-India). Following equations were used for the analysis. Linear regression is conducted using the equations . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint

Measures Taken by the country
The outbreak was pronounced as epidemic in many states and union territories, where Epidemic Diseases Act, 1897 have been implemented, leading to the closure of educational and commercial institutions. All tourist transit visas were temporarily suspended, as most of the confirmed cases were mainly from outside [18]. State governments took various methods to contain the spread of the virus inside their local bodies.
In preparation to tackle the pandemic cases various measures and decisions were taken by the state governments as well as Central governments. Throughout the world, the impact of COVID-19 pandemic has been extraordinary, and industries, Small and medium enterprises (SME) in specific, are stressed to maintain and keep up their trade. The government announced intensive measures to ease the affected businesses and taxpayers. The government has asked employers not to dismiss their employees or decrease salaries during the pandemic phase. Additionally, the finance ministry has been immensely working on an economic package to mitigate the impact of coronavirus on the Indian economy. the government announced a complete waiver of introduced a special refund and drawback disposal drive for priority processing and disposal of all pending refund and drawback claims. The measures taken by the government during the pandemic worth appreciation. It provided the much-needed assistance to the industries and eases trade at large.

New Covid-19 Cases
The first case of Covid-19 in India was reported on 30 January in Kerala. There were only three cases by  Case fatality ratio(CFR) of India is found to be falling progressively and is 2.25% in July end. This CFR is lowest in the world. Figure 2 represents the stacked chart representing the new cases reported and new deaths due to Covid-19. The CFR was at 3.33% during mid-June 2020. On July 24, the number of cases reached near 15 lakhs. The worst affected state in India is Maharashtra and Tamilnadu. Tamilnadu saw a slow spread of the virus. Andra Pradesh, Delhi and Karnataka had crossed 1 lakh cases.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted September 23, 2020. . https://doi.org/10.1101/2020.09.11.20193029 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted September 23, 2020. . https://doi.org/10.1101/2020.09.11.20193029 doi: medRxiv preprint

Case Fatality Rate and Test per Million
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review) preprint
The copyright holder for this this version posted September 23, 2020. . https://doi.org/10.1101/2020.09.11.20193029 doi: medRxiv preprint Case fatality ratio (CFR) is the ratio of Confirmed deaths and confirmed cases. The CFR and Test per million is shown in the figure 5. Case fatality ratio(CFR) of India is found to be falling progressively and is 2.25% in July end. This CFR is lowest in the world. CFR of mid-June was 3.33%. Continuous increase in the recovery cases were seen due to the effective measures taken by the centre and state. The recovery rate increased sharply to 64% in July end with respect to 53% in the mid-June 2020.

Stringency Index
Stringency index is a score assigned to countries by University of Oxford researchers about the measures taken to fight against the pandemic. The higher score means stricter the country's measures were. The figure 6 shows the stringency Index during the period. It can be seen that the stringency Index becomes 100 during the lockdown period. The World Health Organization (WHO) and United Nations (UN) have applauded response and action taken by India as 'comprehensive and robust'. Termed lockdown restriction as 'aggressive but vital'.
The Oxford Covid-19 Government response Tracker (OxCGRT) marked India's actions to fight against the virus.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review) preprint
The copyright holder for this this version posted September 23, 2020. . https://doi.org/10.1101/2020.09.11.20193029 doi: medRxiv preprint Figure 6 Stringency Index  is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review) preprint
The copyright holder for this this version posted September 23, 2020. . https://doi.org/10.1101/2020.09.11.20193029 doi: medRxiv preprint Figure 8. Scatter plot matrix and linear regression fit models Figure 9 The prediction fit model

Change in visitors
Due to the Covid-19 pandemic many restrictions were imposed in the country. As a result the movement of citizens were also affected. The figure 10 shows the change in visitors at retail and recreation, grocery, . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review) preprint
The copyright holder for this this version posted September 23, 2020. . https://doi.org/10.1101/2020 pharmaceutical stores, parks, residential, transit station and workplaces. During the period of the pandemic. The zero percentage line represents the normal visits. The parallel plot for the Covid-19 pandemic is shown in the figure 11. Variation of all the elements can be noted from the figure.

Conclusion
An exploratory data analysis was conducted on the Covid-19 cases in India. Data were collected from various sources for the period of January to July 2020. .Data was cleaned and arranged in order to do the analysis.
MATLAB 2019a and R were used for the analysis. India took stringent measures for fighting against the pandemic. Countrywide lockdown, campaigns, contact tracing, containment zones were introduced for the same.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted September 23, 2020. . https://doi.org/10. 1101/2020 The Oxford Covid-19 Government response Tracker (OxCGRT) marked India's actions to fight against the virus and gave a score of 100. Case fatality ratio (CFR) of India is found to be falling progressively and is 2.25% in July end. This CFR is lowest in the world. The CFR was at 3.33% during mid-June 2020. The confirmed cases in India crossed 15 lakhs by July end. Number of tests and laboratories were increased progressively. The recovery rate is also found to be climbing up. The visual exploratory data analysis done in this manuscript gives a detail view of the 6. Daily COVID-19 bulletin. "PIB India (@PIB_India) on Twitter". Retrieved 3 June 2020 -via Twitter . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted September 23, 2020. . https://doi.org/10.1101/2020  . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted September 23, 2020. . https://doi.org/10. 1101/2020