Created
October 19, 2020 20:28
-
-
Save tylergibson/3702ddc346289345df8d9b2ed7d12745 to your computer and use it in GitHub Desktop.
Issues with COVID Datasets
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Daily Cases (https://usafactsstatic.blob.core.windows.net/public/data/covid-19/covid_confirmed_usafacts.csv) | |
Data Entry Error in Glenn County CA on 7/2/2020 - 1160 should be 116 | |
Major declination error in Nueces County TX on 7/25/2020 - Fall of 18.4%, Should follow up with TX HHS | |
Major declination error in Midland County TX on 7/21/2020 - Fall of 13.5%, Should follow up with TX HHS | |
Data Entry Error in Baltimore County MD on 7/17/2020 - 9141 should be 9411 - Also none of Baltimore County MD numbers match their publicly reported data https://bc-coronavirus-response-bc-gis.hub.arcgis.com/ | |
Data Entry Error in Dixie County FL on 9/5/2020 - 368 should be 768 | |
Data Entry Error in Jasper County MO on 7/3/2020 - 515 when days on both sides are 620 and is only cumulative outlier. | |
Daily Deaths (https://usafactsstatic.blob.core.windows.net/public/data/covid-19/covid_deaths_usafacts.csv) | |
Error in Glenn County CA on 7/2/2020 - should be zero, entered as 112 | |
Unclear what the unallocated statewide numbers mean - they do not follow the same cumulative totaling, what does this mean? Should these numbers be excluded from calculations? | |
Newport RI - 9/16 through 9/22 deaths are zeroed - data is missing/wrong (Erros not present in RI state data, or in JHU data) |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi @tylergibson -- We've seen this and the team is working on it. Thank you!