Skip to content

Instantly share code, notes, and snippets.

@tylergibson
Created October 19, 2020 20:28
Show Gist options
  • Save tylergibson/3702ddc346289345df8d9b2ed7d12745 to your computer and use it in GitHub Desktop.
Save tylergibson/3702ddc346289345df8d9b2ed7d12745 to your computer and use it in GitHub Desktop.
Issues with COVID Datasets
Daily Cases (https://usafactsstatic.blob.core.windows.net/public/data/covid-19/covid_confirmed_usafacts.csv)
Data Entry Error in Glenn County CA on 7/2/2020 - 1160 should be 116
Major declination error in Nueces County TX on 7/25/2020 - Fall of 18.4%, Should follow up with TX HHS
Major declination error in Midland County TX on 7/21/2020 - Fall of 13.5%, Should follow up with TX HHS
Data Entry Error in Baltimore County MD on 7/17/2020 - 9141 should be 9411 - Also none of Baltimore County MD numbers match their publicly reported data https://bc-coronavirus-response-bc-gis.hub.arcgis.com/
Data Entry Error in Dixie County FL on 9/5/2020 - 368 should be 768
Data Entry Error in Jasper County MO on 7/3/2020 - 515 when days on both sides are 620 and is only cumulative outlier.
Daily Deaths (https://usafactsstatic.blob.core.windows.net/public/data/covid-19/covid_deaths_usafacts.csv)
Error in Glenn County CA on 7/2/2020 - should be zero, entered as 112
Unclear what the unallocated statewide numbers mean - they do not follow the same cumulative totaling, what does this mean? Should these numbers be excluded from calculations?
Newport RI - 9/16 through 9/22 deaths are zeroed - data is missing/wrong (Erros not present in RI state data, or in JHU data)
@anthonyp-usafacts
Copy link

Hi @tylergibson -- We've seen this and the team is working on it. Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment