Big Data Are No Substitute for Personal Input in Surveys

Big Data Are No Substitute for Personal Input in Surveys

Despite today’s researchers in the social sciences having access to historically unparalleled amounts of data, many aspects of contemporary social developments have proven difficult to predict. National elections and the ongoing coronavirus pandemic are highly visible examples where current systems have been challenged to provide accurate forecasts.

When the analysis of digital data reaches its limits, methods that focus on observations made by individuals can be useful. In contexts such as the coronavirus pandemic, a method called human social sensing can elicit information that is difficult to obtain from digital trace data. Frauke Kreuter, professor of statistics and data science in the social sciences and the humanities at Ludwig-Maximilians-Universitat (LMU) in Munich, is now using this method with the global “COVID Trends & Impact Survey” to predict the course of the pandemic.

“We shouldn’t focus too narrowly on the analysis of digital trace data. It’s a mistake to ignore the fact that people are equipped with sensory capacities. Those can be particularly useful in areas that are difficult to capture with data from behavioral traces,” says Kreuter. Many social phenomena develop very rapidly—and often in unexpected directions. This makes it impossible to collect enough up-to-date passive data to enable researchers to trace the trajectories of such trends.

The individual as a human sensor plays a central role

In a research paper that has just appeared in the journal Nature, Kreuter and her international coauthors demonstrate that interviews with individual informants still have a crucial role to play, even in an era where big data generates the biggest headlines. “One should never lose sight of the fact that surveys can serve as a very valuable source of supplementary information,” she says.

An individual interviewee is in a position to supply information not only about her own situation, but also, “individual interviews allow one to ask individuals about themselves and also about things in their local environment. However, the latter possibility has long been overlooked.” And now, in this current context, it is beneficial to make use of the fact that the individual can be viewed as a potentially informative “social sensor” for what is happening around him.

The development of the coronavirus pandemic as a test case

As a statistician, Kreuter chose to employ together with her colleagues this approach in the current global “COVID Trends & Impact Survey,” which attempts to forecast the trajectory of the coronavirus pandemic. The survey began in April 2020, and over 55 million people globally have been interviewed. One of the queries asks interviewees whether they personally know anyone in their local community with COVID-19 symptoms.

This is of particular salience as, “the answers to this question turn out to be a strong predictor of the further development of the pandemic,” says Kreuter. Preliminary results of the study have already appeared as a preprint on MedRxiv. “The crucial prerequisite for the use of individuals as sensors in social science surveys is that the informants chosen must constitute a controlled and representative sample of the population of interest.”

– This press release was originally published on the Ludwig-Maximilians-Universität München website