Data Portal

Data Sources

A collection of global and international data sources available to The Trinity Challenge participants.

There's a wealth of data available

The Trinity Challenge’s data catalogue is collation of a broad range of global and local data sources which could serve as meaningful inputs or starting points for The Challenge.

These are intended as examples of available open-source datasets and not as an exhaustive list of all possible datasets, and hence a first attempt to bring together critical datasets for global public health from a data landscape that has historically been fragmented and unstructured.

Health

Health

Environmental & Animal

Environmental & Animal

Demographic

Demographic

Mobility & Social

Mobility & Social

Socio-economic

Socio-economic

Government & Policy

Government & Policy

Microbiology & viral ecology

Datasets provide overview of various genomic data, e.g. viral sequence data, human genetic variation, and other gene mapping data

The Broad Institute DNA/RNA sequencing
NCBI Viruses complete genomes
EBI data resources
Elixir Celular data catalogue

Symptoms

Data on tracked COVID-19 symptoms

Israel disease detection survey
COVID-19 Healthy vs patient chest X-ray
MIT aggregated COVID-19 related clinical studies

Cases

Data on COVID-19 cases by various factors, e.g. geography (state and sub-state), ethnicity, age; as well as adjacent data such as test-positivity rates and R0 levels

John Hopkins University Covid-19 cases
CDC Covid-19 case surveillance
Open COVID-19 Data Working Group: epidemiological data
US COVID-19 cases/deaths by ethnicity
France COVID-19 cases by age
US COVID-19 health outcomes
US COVID-19 Live Rt values per state

Hospital capacity

Data on healthcare system capacity, e.g. projections of available hospital beds under different COVID scenarios, or real-time data of available hospital staff

US hospital bed capacity
Germany hospital staff capacity
NPGEO Germany nursing home geographic distribution

Morbidity & mortality

Data on human morbidity and mortality from various diseases, and non-disease causes; as well as analysis of mortality fluctuations over time (e.g. COVID excess mortality)

The Economist's tracker for COVID-19 excess deaths
The Human Mortality Database

Health statistics

Data on how long and how healthy different populations live; as well as on health system metrics and performance

OECD Health statistics
Global Health Data Exchange
Tencent Health COVID-19 live updates

Environmental

Environmental datasets with relevance for infectious diseases modelling, including projective climate data, biodiversity, air and water quality, and data on waste

Environmental data to study infectious diseases
OECD environmental data
NASA Earth science datasets
The Knowledge network for Biocomplexity

Animal Reservoirs

Data on local biodiversity as relevant for predicting zoonotic risks, e.g. livestock density and potential for human-animal interaction in agricultural production systems

PREDICT project database
Agri-environmental indicator
EU animal production statistics

Consumer behavior

Data on spending and consumption patterns as affected by COVID-19

Germany road tolls for commercial vehicles

Employment

Data on employment, unemployment and changes over time

The World Bank database on employment

Education

Data on education access, completion, and disruption, including due to COVID

Unesco school closures due to COVID-19
National impact of COVID-19 on education and learning

Economic activity

Data on economic activity including growth, trade, labour, etc., by locality

EU Total wages and salaries
The World Bank global distribution of economic activity
Eurostat covid 19 datasets