Unified dataset for a better understanding of COVID-19
The repository aims at developing a unified dataset by collecting worldwide fine-grained case data, merged with exogenous variables helpful for a better understanding of COVID-19. Available in:
R | Python | MATLAB | Scala | Julia | Node.js | Excel
The data are updated on an hourly basis. Read more
The dataset includes the time series of vaccines, tests, cases, deaths, recovered, hospitalizations, intensive therapy, policy measures and more. See the full dataset documentation.
The data are available at different levels of granularity:
The latest and vintage CSV data files are available here.
You are welcome to join and extend the number of supporting data sources as a joint effort against COVID-19. Join us on Slack to get help, add a new data source and earn a badge.
See the projects and publications that use COVID-19 Data Hub.
We have invested a lot of time and effort in creating COVID-19 Data Hub, please agree to the Terms of Use and cite the following reference when using it:
Guidotti, E., Ardia, D., (2020), "COVID-19 Data Hub", Journal of Open Source Software 5(51):2376, doi: 10.21105/joss.02376.
A BibTeX entry for LaTeX users is:
@Article{, title = {COVID-19 Data Hub}, year = {2020}, doi = {10.21105/joss.02376}, author = {Emanuele Guidotti and David Ardia}, journal = {Journal of Open Source Software}, volume = {5}, number = {51}, pages = {2376} }