Recommendation 5
Kelly McConville (chair), Ben Baumer, Nathan Tintle, Mine Dogucu, Patti Frazer Lock
Encourage multivariate thinking.
Introduction
Multivariable thinking (MVT) is critical for students to be able to navigate our complex, data-driven world. Phenomena in areas such as economics, medicine, social sciences, and natural sciences are influenced by a multitude of factors, and many rich data sets with lots of cases and a great number of variables are readily available. MVT is no longer a skill required only of statisticians and data scientists—everyone should be able to think critically about the relationships among many variables and draw conclusions from that investigation.
MVT is not new to statistics, or even new to the introductory statistics course. However, we must do more than sprinkle traces of MVT in our courses. Rather, we must embrace the inherent complexity of MVT and ensure that students get repeated practice disentangling the relationships in data. MVT is needed throughout the data analysis process and is necessary for understanding current research in applied disciplines such as medicine (NJ 2005) and countless others across the natural and social sciences and beyond. Multiple regression is just one technique for engaging with MVT. More broadly, MVT can span multiple units in introductory classes, including exploratory data analysis, statistical modeling, and statistical inference. In this section, we provide concrete suggestions on how to infuse MVT into introductory statistics and data science courses.
First steps towards MVT
Here are some starting points for incorporating MVT into your classes:
Even if you focus on one or two variables in a study in order to help students master key learning objectives, go beyond this and require students to reflect on the broader context in which the datasets exist. (Assessment examples/resources: Problem 1 of this document)
Regularly ask your students to create/find, explore and summarize datasets that have multiple variables. (Assessment examples/resources: First day survey example, student generated game data as explained in depth by Kuiper et al. (2025), problem 2 of this document)
Provide opportunities for students to interpret and create data visualizations that display more than two variables. This, for example, can be achieved simply by coloring points on a scatterplot to represent a third variable. (Assessment examples/resources: Problem 3 of this document, NYTimes “What’s going on in this graph?, Gapminder Tools, this applet)
Ask students to identify potential confounding variables and brainstorm how the impact of additional variables could possibly affect their conclusions. Exploring an example that exhibits Simpson’s Paradox can provide a compelling case for the importance of MVT. (Assessment examples/resources: How Much Do Minority Lives Matter?)
Additional Resources
Annotated bibliography
Assessments