Seeing More in Your Data

New Visual Tools for Uncertainty, High-Dimensional Exploration, and Model Diagnostics

Dianne Cook
Department of Econometrics & Business Statistics
Monash University

2026-06-05

Overview

Three new developments that make statistical visualisation tackle contemporary ideals.

Uncertainty — showing what we don’t know (ggdibbler)
Model diagnostics — automating the reading of residual plots (nullabor / autovi)
High dimensions — seeing beyond two variables (tourr / mulgar)

Preamble

Doing research reproducibly

literate programming, e.g. quarto documents, where code and text live together. Avoids cut and paste and mistakes made by forgetting to update if parameters or data change.
version control, e.g. GitHub, to fully track changes to the work, able to back-track, branch

See the link at the bottom of these slides for resources to reproduce this talk.

Part 1

Visualising Uncertainty with ggdibbler

Harriet Mason’s PhD research

The problem: the ways that we typically display uncertainty, or not

Code

library(tibble)
library(dplyr)
library(ggplot2)
library(scales)
library(colorspace)
library(sf)
library(ggthemes)
set.seed(416)
d <- tibble(x = rep(c("A1", "A2", "A3"), 4), 
            y = rep(c("B1", "B2", "B3", "B4"), c(3, 3, 3, 3)),
            v = sort(runif(12, 65, 68)))
d_plt1 <- ggplot(d, aes(x = x, y = y, fill = v)) +
  geom_tile() +
  scale_fill_continuous_divergingx(palette = "Zissou 1", mid=66.5) +
  xlab("") + ylab("") +
  ggtitle("No uncertainty shown") +
  theme_minimal() +
  theme(legend.position = "none", 
    axis.text = element_text(size = 20),
    title = element_text(size=20))
d_plt1 + geom_text(aes(x = x, y = y, label = number(v, accuracy = 0.1)), size=8)

When uncertainty is invisible, decisions look more confident than they are.

`ggdibbler`: distributions as data

Replace a fixed value with a distribution — the plot does the rest.

Code

library(ggdibbler)
library(distributional)
library(patchwork)

d_uncertain_lo <- d |>
  group_by(x, y) |>
  mutate(est = dist_normal(v, runif(1, 0.5, 1.5))) |>
  ungroup() 

d_plt2 <- ggplot(d_uncertain_lo, aes(x = x, y = y, fill = est)) +
    geom_tile_sample() + # same syntax, uncertain output
    scale_fill_continuous_divergingx(palette = "Zissou 1", mid=66.5) +
    xlab("") + ylab("") +
    ggtitle("Low uncertainty") +
    theme_minimal() +
    theme(legend.position = "none", 
    axis.text = element_text(size = 20),
    title = element_text(size=20))  

d_uncertain_hi <- d |>
  group_by(x, y) |>
  mutate(est = dist_normal(v, runif(1, 0.5, 10.5))) |>
  ungroup() 

d_plt3 <- ggplot(d_uncertain_hi, aes(x = x, y = y, fill = est)) +
    geom_tile_sample() + # same syntax, uncertain output
    scale_fill_continuous_divergingx(palette = "Zissou 1", mid=66.5) +
    xlab("") + ylab("") +
    ggtitle("High uncertainty") +
    theme_minimal() +
    theme(legend.position = "none", 
      axis.text = element_text(size = 20),
      title = element_text(size=20))

d_plt1 + d_plt2 + d_plt3

Works with any ggplot2 geom via geom_*_sample()
Accepts continuous, discrete, spatial (sf), and mixed distributions
Uses the distributional package — normal, empirical, truncated, mixed
No new syntax to learn beyond replacing the variable

Philosophical foundation

ggdibbler’s philosophy: uncertainty visualisation should

enhance statistically significant signals to reinforce confidence
suppress apparent signals that spurious.

We call this “signal modulation”.

Don’t treat noise as a separate signal (two variables, signal + noise) or a statistic. Visualise noise and signal together as a “single integrated uncertain value”.

The visualisation should express the uncertainty without additional cognitive load.

Software: `ggdibbler`

Drop-in replacement for ggplot2
Applicable to estimates, predictions, bounded measurements, and more

Stepping backwards: grammar of graphics

The ggplot2 package specifies a plot using a grammar, that maps variables from tidy data into plot elements.

This also provides a tight connection between the plot and statistics.

\[ \bar{x} = \sum_{i=1}^n x_i \]

\(\longrightarrow\)

ggplot(data, 
  aes(x = v1, 
      y = v2, 
      colour = cl)

We can swap out data for a distribution, or with null samples, to assess uncertainty or significance of structure.

Part 2

Plots as statistics, visual inference with the lineup protocol
Automating residual plot diagnostics with computer vision, Weihao (Patrick) Li’s thesis

Why residual plots matter — and why they’re hard

After fitting a regression model, we plot the residuals to check our assumptions.

The problem:

Numerical tests (Breusch-Pagan, Shapiro-Wilk) are either insensitive or over-sensitive
Visual inspection is effective but subjective and unscalable
Different analysts reading the same plot reach different conclusions

“Does this pattern look like a problem, or is it just noise?”

The lineup protocol: visual inference (1/2)

Example 1
without nulls
with nulls

Code

library(nullabor)

cars_lm <- lm(mpg ~ hp, data=mtcars)
cars_d <- mtcars |>
  mutate(.resid = residuals(cars_lm),
         .fitted = fitted(cars_lm))
# write_csv(cars_d, file="data/cars_d.csv")         

set.seed(320)
l1 <- lineup(null_lm(mpg ~ hp, method = "rotate"), 
       true = cars_d, n = 12)

cars_plt <- ggplot(l1, aes(x=hp, y=.resid)) +
  geom_point(alpha = 0.8) +
  facet_wrap(~.sample, ncol=4) +
  xlab("") +
  theme_few() +
  theme(axis.text = element_blank())

Which plot is the most different?

Code

set.seed(332)
d <- tibble(.fitted = -rexp(n=84*12),
            .resid = rnorm(n=84*12),
            .sample = rep(1:12, 84))
# write_csv(d, file="data/sim_exp.csv")            

d |>
  dplyr::filter(.sample == 1) |>
  ggplot(aes(x=.fitted, y=.resid)) +
    geom_hline(yintercept = 0, colour = "red") +
    geom_point(alpha = 0.8) +
    theme_bw() +
    theme(axis.text = element_blank())

Is there a problem with the model fit? Like heteroskedasticity?

Code

sk_l <- ggplot(d, aes(x=.fitted, y=.resid)) +
  geom_hline(yintercept = 0, colour = "red") +
  geom_point(alpha=0.8) +
  facet_wrap(~.sample, ncol=4, scales="free") +
  theme_bw() +
  theme(axis.text = element_blank(),
        axis.title = element_blank())

All of these are null samples. There is no relationship between residuals and fitted.

Which one did I show you?

The lineup protocol: visual inference (2/2)

The nullabor package provides a formal framework for reading plots.

Generate null plots by permuting or simulation under H₀
H₀ is specified by the plot mappings, when using the grammar of graphics
Embed the real plot in a lineup of decoys
Ask: which plot is the most different?
If it is the true data (\(p < 0.05\), if num plots \(= 20\)), the pattern is statistically detectable
The approach has been validated against conventional tests
Provides significance testing in problems where there are no existing tests

From human judgement to computer vision

Train a computer vision model to read residual plots the way a human would read lineups.

Generate thousands of residual plots under known conditions: bad (various model misspecifications) and good (no violation of error assumptions)
Train a convolutional neural network on these synthetic plots
Validate against human subject experiment results
The model gets closer to accuracy of human raters — but at scale
Provides additional feedback of where image departs from a good residual plot

Prior to training, a large scale human subjects experiment was conducted using lineups to compare human performance relative to classical statistical tests (Breusch-Pagan for heteroskedasticityt, Shapiro-Wilk for non-normality).

The `autovi` package

Code

library(autovi)

# Fit a model
mtcars_lm <- lm(mpg ~ wt + hp, data = mtcars)

# Automated residual plot assessment
result <- auto_vi(mtcars_lm)
result$p_value   # formal p-value from CV model
result$plot      # annotated residual plot

Returns a p-value analogous to a formal hypothesis test
Indicates which type of departure was detected (non-linearity, heteroscedasticity, etc.)
But it requires installing python and tensorflow, so there is also available as a Shiny web app for easy use.

Example 2: Revisited

Upload
Significance
Explanation

Key take-aways

Use the lineup protocol to assess your interpretation of structure in a plot: Look at plots in the context of null samples, what might this look like if a sample consistent with no structure were shown.
The computer vision model in the autovi software and associated shiny app can help evaluate patterns in residual plots. Especially useful for teaching introductory statistics.

Part 3

Exploring high-dimensional data with tours - you can see beyond 2D

Customer segmentation with dozens of behavioural variables
Financial risk models with many correlated inputs
Supply chain data with multiple performance metrics per node
Survey data with hundreds of items

What is a “tour”?

A tour is a continuous sequence of 2D projections of high-dimensional data — like slowly rotating a sculpture to see it from all angles.

Grand tour: random rotation through all projections
Guided tour: steered toward projections with interesting structure
Radial tour: to assess sensitivity of structure and variable importance
Slice tour: sections through the data (for concave or hollow structure)
Sage tour: reverses for the “crowding” effect of high dimensions projected to low dimensions (think central limit theorem, but in an undesirable way)

Code

library(tourr)
data(penguins)

f_std <- function(x) (x-mean(x))/sd(x)
pinguino <- penguins |>
  dplyr::filter(!is.na(bill_len)) |>
  rename(bl = bill_len,
         bd = bill_dep,
         fl = flipper_len,
         bm = body_mass) |>
  select(bl:bm, species) |>
  mutate_if(is.numeric, f_std)      
animate_xy(pinguino[, 1:4], 
  #guided_tour(lda_pp(pinguino$species)),
  col = pinguino$species)

render_gif(pinguino[, 1:4],
  grand_tour(), 
  display_xy(col = pinguino$species),
  gif_file = "images/penguins_tour.gif",
  apf = 1/20,
  frames = 500)

The `mulgar` book: a practical guide

Cook & Laa (2026) — Interactively Exploring High-Dimensional Data and Models in R: dicook.github.io/mulgar_book

Covers visualisation for:

Principal component analysis
Non-linear dimension reduction (UMAP, t-SNE)
Clustering (k-means, hierarchical, model-based, self-organised maps)

Supervised classification (linear discriminant analysis, trees, forests, SVMs, neural nets)
Diagnostics for model fit
Explainable AI (SHAP values)

Example: Risk Taking (1/2)

Survey of 563 Australian tourists, see Dolnicar S, Grün B, Leisch F (2018)
Six different types of risks: recreational, health, career, financial, social and safety
Rated on a scale from 1 (never) to 5 (very often)

Goal: Conduct market segmentation to group tourists into similar behaviour.

Step 1: understand the shape of the data

Code

# Step 1: get a sense of the data
library(lionfish)
data("risk")
colnames(risk) <- c("Rec", "Hea", "Car", "Fin", "Saf", "Soc")

animate_xy(risk)
set.seed(201)
render_gif(risk,
           grand_tour(),
           display_xy(col = "#6C26AC"),
           start = basis_random(6,2),
           gif_file = "images/risk_gt.gif",
           apf = 1/20,
           frames = 400,
           width = 400,
           height = 400)

shape
tour
images

Apple in two halves and knife on cutting board.

Banana cut into eight coin-shaped pieces and knife on cutting board.

Animation showing 2D projections of 6D data as scatterplots of purple points. There is a circle with line segments radiating from the centre which represent the projection coefficients of each 2D projection shown. The patterns that can be seen are circular in many projections, and sometimes elongated, almost elliptical with some higher density at one end and lower density at the other. We can also see discrete lines of points which is due to each variable being ordinal: which can be ignored because it is not important structure for understanding the association between variables.

A single 2D projection of 6D data shown as a scatterplot of purple points. A purple sketch roughs out the shape, which is like a pear. The variables are mostly contributing to this projection Soc, Rec and Hea.

A single 2D projection of 6D data shown as a scatterplot of purple points. A purple sketch roughs out the shape, which is like a rhombus. All six variables contribute to this projection in different directions.

Example: Risk Taking (2/2)

Code

risk <- readRDS("data/risk_MSA.rds")
colnames(risk) <- c("Rec", "Hea", "Car", "Fin", "Saf", "Soc")
risk <- as.data.frame(risk)

risk_d <- apply(risk, 2, function(x) (x - mean(x)) / sd(x))

# Clustering
nc <- 5
set.seed(1145)
r_km <- kmeans(risk_d, centers = nc, iter.max = 500, nstart = 5)

r_km_d <- risk_d |>
  as_tibble() |>
  mutate(cl = factor(r_km$cluster)) |>
  bind_cols(model.matrix(~ as.factor(r_km$cluster) - 1))
colnames(r_km_d)[(ncol(r_km_d) - nc + 1):ncol(r_km_d)] <- paste0(
  "cluster",
  1:nc
)
r_km_d <- r_km_d |>
  mutate_at(vars(contains("cluster")), function(x) x + 1)

#animate_xy(r_km_d[, 1:6], col = r_km_d$cl)
#animate_xy(r_km_d[, 1:6], guided_tour(lda_pp(r_km_d$cl)), col = r_km_d$cl)

render_gif(r_km_d[, 1:6],
           guided_tour(lda_pp(r_km_d$cl)),
           display_xy(col = r_km_d$cl),
           start = basis_random(6,2),
           gif_file = "images/risk_cl5.gif",
           apf = 1/20,
           frames = 400,
           width = 400,
           height = 400)

Number of clusters: 2
3
4
5

Clustering by \(k\)-means, and to visualise we use a guided tour steering towards the projection that best separates the clusters.

How is the clustering dividing this data?
How many clusters would you use?

Key takeaways: high-dimensional visualisation

Tours let you explore structure that PCA and t-SNE assume away
Connects directly to ML model diagnostics and XAI, making black boxes more transparent

Putting it together

Three new methodology directions, one theme

Better graphics → better decisions.

All three advances address a common problem: our standard visualisations hide things we need to know.

Software/methodology	What it reveals
`ggdibbler`	Uncertainty in estimates and predictions
`nullabor` / `autovi`	Whether model assumptions are actually met
`tourr` / `mulgar`	Structure hidden in high dimensions

Resources

ggdibbler
https://harriet-mason.github.io/ggdibbler/

mulgar book
https://dicook.github.io/mulgar_book/

nullabor
github.com/dicook/nullabor

autovi
https://autoviweb.netlify.app/

Slides made in Quarto, with code included.

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Seeing More in Your Data

Overview

Preamble

Part 1

The problem: the ways that we typically display uncertainty, or not

ggdibbler: distributions as data

Philosophical foundation

Software: ggdibbler

Stepping backwards: grammar of graphics

Part 2

Why residual plots matter — and why they’re hard

The lineup protocol: visual inference (1/2)

The lineup protocol: visual inference (2/2)

From human judgement to computer vision

The autovi package

Example 1: Revisited

Example 2: Revisited

Key take-aways

Part 3

What is a “tour”?

The mulgar book: a practical guide

Example: Risk Taking (1/2)

Example: Risk Taking (2/2)

Key takeaways: high-dimensional visualisation

Putting it together

Three new methodology directions, one theme

Resources

`ggdibbler`: distributions as data

Software: `ggdibbler`

The `autovi` package

The `mulgar` book: a practical guide