Data Literacy
Kelly McConville
Coffee and Treats with L&IT | Fall 2024
In May of 2020, the Georgia Department of Public Health posted the following graph:
At a quick first glance, what story does the Georgia Department of Public Health graph appear to be telling?
What is misleading about the Georgia Department of Public Health graph? How could we fix this issue?
After public outcry, the Georgia Department of Public Health said they made a mistake and posted the following updated graph:
Alberto Cairo, a journalist and designer, created the second graph of the Georgia COVID-19 data:
A key principle of data visualization is to “help the viewer make meaningful comparisons”.
What comparisons are made easy by the lefthand graph? What about by the righthand graph?
From these graphs, can we get an accurate estimate of the COVID prevalence in these Georgian counties over this two week period?
What are the pros of using wastewater over nasal swabs to assess COVID prevalence? What are the cons?
The graph also incorporates uncertainty measures. Quantifying uncertainty is a key component of data literacy.
Context explains the Monday jumps in the COVID counts.
You have a lot of design choices and these choices can help or hinder the story-telling.
Voluntary COVID test results likely don’t provide good estimates of COVID prevalence.
They sampled the wastewater and then got a range of plausible values for the RNA copies each day.
Need to understand how “raw” data are processed into insights.
What choices were made at each step?
How do those choices impact the conclusions?
About developing reasoning
Requires judgment that takes time to develop
How can the Center support data literacy at Bucknell?
What is data literacy to you?
How do you see generative AI changing or impacting data literacy?