View on GitHub

tutorial_effective_data_plots

Materials for WOMBAT 2024 tutorial

WOMBAT 2024 Tutorial: Creating data plots for effective decision-making using statistical inference with R

Website: https://dicook.github.io/tutorial_effective_data_plots/

This is for statisticians and data science practitioners who are interested in improving their data visualisation skills.

Presenter: Dianne Cook is Professor of Business Analytics at Monash University in Melbourne, Australia. She is a world leader in data visualisation, especially the visualisation of high-dimensional data using tours with low-dimensional projections, and projection pursuit. She is currently focusing on bridging the gap between exploratory graphics and statistical inference. Di is a Fellow of the American Statistical Association, past editor of the Journal of Computational and Graphical Statistics, current editor of the R Journal, elected Ordinary Member of the R Foundation, and elected member of the International Statistical Institute.

Structure of tutorial

Background: Participants should have a good working knowledge of R, and tidy verse, and some experience with ggplot2. Familiarity with the material in R4DS is helpful.

Course Schedule

time topic
1:30-1:45 Why, philosophy and benefits
1:45-2:05 Organising data to map variables to plots
2:05-2:35 Making a variety of plots
2:35-3:00 Do but don’t, and cognitive principles
2:30-3:00 BREAK
3:00-3:20 What is your plot testing?
3:20-3:35 Creating null samples
3:35-4:00 Conducting a lineup test
4:00-4:30 Testing for best plot design

Session 1 Slides

Session 2 Slides

Zip file of materials

Getting started

  1. You should have a reasonably up to date version of R and R Studio, eg RStudio 2024.09.0 +375 and R version 4.4.1 (2024-06-14). Install the following packages, and their dependencies.
install.packages(c("ggplot2", "tidyr", "dplyr", "readr", "stringr", "nullabor", "colorspace", "palmerpenguins", "broom", "ggbeeswarm", "vcd", "MASS", "conflicted"), dependencies=c("Depends", "Imports"))
  1. Download the Zip file of materials to your laptop, and unzip it.

  2. Download just the R scripts, slides1.R, slides2.R

  3. Open your RStudio be clicking on tutorial.Rproj.

GitHub repo with all materials is https://dicook.github.io/tutorial_effective_data_plots/.