Skip to content

Exploratory data analysis in Microsoft SQL Server of covid data from Feb 2020 to Oct 2022

Notifications You must be signed in to change notification settings

Brijesh403/SQL-Covid_Exploratory_Data_Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 

Repository files navigation

SQL-Covid_Exploratory_Data_Analysis

Exploratory data analysis in Microsoft SQL Server of covid data from Feb 2020 to Oct 2022


Table of Contents

  1. Data
  2. Objective
  3. Processing and Cleaning
  4. SQL
  5. Conclusion and Insights
  6. Visualization

Exporatory data analysis of the covide data to practice SQL in Microsoft SQL Server.

Data

Data source : Public data from Ourworldindata.com Data Source

Edouard Mathieu, Hannah Ritchie, Lucas Rodés-Guirao, Cameron Appel, Charlie Giattino, Joe Hasell, Bobbie Macdonald, Saloni Dattani, Diana Beltekian, Esteban Ortiz-Ospina and Max Roser (2020) 
- "Coronavirus Pandemic (COVID-19)". 
Published online at OurWorldInData.org. 
Retrieved from: 'https://ourworldindata.org/coronavirus' [Online Resource]

The date range : February 2020 to October 2022 (61 MB data)

File Type: csv format

File Content: CSV file consist of 70 columns -which contain information related to Covid numbers reported, GDP, Populations etc. Number of rows are 2,23,788 from excel files.

Objective

Covid was on of the worst pendemic which hid the world it did impacted many nation in may ways to understand the impact on diffrent country using SQL in Microsoft SQL Server

Processing and Cleaning

  • To simplyfy the process of data we stored the source file and made a copy of data.
  • Divide the data in to two groups of
    • Covid_Death_Table
    • Covid_Vactination_Table

SQL

  • SQL storing, manipulating and retrieving use in project
    • Joins
    • Aggregations
    • RegEx
    • CASE statment

Conclusion and Insights

The coronavirus disease continues to spread across the world following a trajectory that is difficult to predict. Covid is still not over and if not properly handled can occure more severe sinarios in future.

  • The future predictions and trend analysis is shown in below Vizualization.
  • According to this analysis the senario of COVID 19 could be resolve around May 2023.

Visualization

To see the Visualization please refer the link below: (https://public.tableau.com/app/profile/brijesh.vaghela/viz/ExploratoryDataAnalysisCOVID/EDACOVID)

About

Exploratory data analysis in Microsoft SQL Server of covid data from Feb 2020 to Oct 2022

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published