SISBID 2025
https://github.com/dicook/SISBID
What patterns do you see?
01:30
\({\mathbf x}_i \in \mathcal{R}^p\), \(i^{th}\) data vector
\(F\) is a \(p\times d\) orthonormal basis
\(F'F=I_d\), where \(d\) is the projection dimension.
The projection of \({\mathbf x_i}\) onto \(F\) is \({\mathbf y}_i=F'{\mathbf x}_i\).
Tour is indexed by time, \(F(t)\), where \(t\in [a, z]\). Starting and target frame denoted as \(F_a = F(a), F_z=F(t)\).
The animation of the projected data is given by a path \({\mathbf y}_i(t)=F'(t){\mathbf x}_i\).
Length and direction of axes relative to the pattern of interest
Length and direction of axes relative to the pattern of interest
spinifex
package)New target bases are chosen using a projection pursuit index function
\[\mathop{\text{maximize}}_{F}~g(xF) ~~~\text{ subject to } F \text{ being orthonormal}\]
holes
: This is an inverse Gaussian filter, which is optimised when there is not much data in the center of the projection, i.e. a “hole” or donut shape in 2D.central mass
: The opposite of holes, high density in the centre of the projection, and often “outliers” on the edges.LDA
/PDA
: An index based on the linear discriminant dimension reduction (and penalised), optimised by projections where the named classes are most separated.bd
contribution controlledbd
is removed from projection, Gentoo separation disappearsbd
is important for distinguishing Gentoo4D Torus
Compute PCA, reduce dimension, show original variable axes in the reduced space.
Using the sample code from the tour package, check how many clusters are in the example data.
library(tourr)
data(flea)
?animate_xy
# On a Mac, start quartz window with: quartz()
# On windows, start X11 window with: X11()
animate_xy(flea[, 1:6])
# RStudio graphics windows: may want to reduce frame rate
animate_xy(flea[, 1:6], fps=10)
# Also
animate_xy(flea[, -7], col = flea$species)
animate_xy(flea[, 1:6], tour_path = guided_tour(lda_pp(flea$species)), col=flea$species)
02:00
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.