tidycseeCOVID19

PLEASE NOTE: CHECK THE DATA BEFORE USING

I have noticed some inconsistencies in the data that are probably related to how I parse the original records. I have not had time to fix these issues.

tidycseeCOVID19 is a data package that aims to provide a curated, tidy version of the JHU CSEE COVID19 daily report data.

Installation

You can install the latest version of the data package from github:

remotes::install_github("klucar/tidycseeCOVID19")

Example

Example of how to load data and create a globe-histogram:

library(tidycseeCOVID19)
data("covid19_daily")
covid19 <- tibble::as_tibble(covid19_daily)

library(ggmap)
library(threejs)
library(tidyverse)
library(tidycseeCOVID19)

data("covid19_daily")
covid19 <- tibble::as_tibble(covid19_daily)

# This probaby isn't the right way, but just get a plot
# out for now.
plot_data <- covid19 %>%
  group_by(Latitude, Longitude) %>%
  summarise(Confirmed_sum = sum(Confirmed))

# Plot the data on the globe
globe <- globejs(lat = plot_data$Latitude,
                 long = plot_data$Longitude,
                 val = 20* log10(plot_data$Confirmed_sum),
                 color = 'red',
                 pointsize = 0.5,
                 atmosphere = TRUE)

globe

Data Transforms

The raw data files have become more consistent recently, but the early data files had many different formats. This dataset has done the following to the data:

Removed Ship information into separate column, keeping Country where available.
Converted Update date/time strings into R dttm format.
Normalized / Standardized Country names.
Separated City, State entries into separate columns
Backfilled Lat/Long data into early entries
Expanded US State abbreviations and other abbreviations.

HELP!

I made my best guess as to what was the right thing to do when normalizing everything. I'm not a world geography expert so if you find errors, please let me know. Here's some things I'm not sure of:

China Mainland Provinces: Some are really close in spelling, not sure if these are separate provinces or typos.
Not sure if Guinea and Ecuatorial Guinea are the same country.
There's a lot of hand-jamming some values, let me know of any inconsistencies you find.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
R		R
data-raw		data-raw
data		data
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
.gitmodules		.gitmodules
CoronaGlobe.PNG		CoronaGlobe.PNG
DESCRIPTION		DESCRIPTION
LICENSE		LICENSE
NAMESPACE		NAMESPACE
README.md		README.md
tidycseeCOVID19.Rproj		tidycseeCOVID19.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tidycseeCOVID19

PLEASE NOTE: CHECK THE DATA BEFORE USING

I have noticed some inconsistencies in the data that are probably related to how I parse the original records. I have not had time to fix these issues.

Installation

Example

Data Transforms

HELP!

About

Releases

Packages

Languages

License

klucar/tidycseeCOVID19

Folders and files

Latest commit

History

Repository files navigation

tidycseeCOVID19

PLEASE NOTE: CHECK THE DATA BEFORE USING

I have noticed some inconsistencies in the data that are probably related to how I parse the original records. I have not had time to fix these issues.

Installation

Example

Data Transforms

HELP!

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages