Processing math: 100%
+ - 0:00:00
Notes for current slide
Notes for next slide


  Burning sage

Reversing the curse of dimensionality in the visualization of high-dimensional data

work in collaboration with Di Cook and Stuart Lee arXiv:2009.10979

Ursula Laa

Monash University

Department of Econometrics and Business Statistics & School of Physics and Astronomy

ursula.laa@monash.edu

6th November 2020


1

Open problem described in 2018 (Di Cook)



pD object
3D slice


pD slice



Picture sources: Samantha Cooper and CakesDekor

1

Curse of dimensionality paradox

  • Origin: Bellman (1961) described difficulty of optimization in high dimensions given exponential growth in space
  • Consequence: most points are far from the sample mean, near the edge of the sample space
  • Paradox: using dimension reduction we instead get an excessive amount of observations near the center of the distribution, most projections are approximately Gaussian

2

Concepts of projected volume

  • To understand the piling near the center of projections, we can think about the high-dimensional volume projected onto a 2D area
  • To impose rotation invariance and avoid edge effects, we start from a hypersphere in p dimensions


3

Concepts of projected volume

  • To understand the piling near the center of projections, we can think about the high-dimensional volume projected onto a 2D area
  • To impose rotation invariance and avoid edge effects, we start from a hypersphere in p dimensions


3

Concepts of projected volume

  • To understand the piling near the center of projections, we can think about the high-dimensional volume projected onto a 2D area
  • To impose rotation invariance and avoid edge effects, we start from a hypersphere in p dimensions


4

Burning sage transformation

We can define a radial transformation that will redistribute the projected volume such that equal pD volume is projected onto equal 2D area

ry=R1(1(ryR)2)p/2

5

Burning sage transformation

We can define a radial transformation that will redistribute the projected volume such that equal pD volume is projected onto equal 2D area

ry=R1(1(ryR)2)p/2

5

Burning sage transformation

We can define a radial transformation that will redistribute the projected volume such that equal pD volume is projected onto equal 2D area

ry=R1(1(ryR)2)p/2

6

Sage tour

The new transformation is especially useful combined with a tour display, showing sequences of low-dimensional projections: for each new view we project the data to 2D and then show the sage display of the projected data


7

Needle in a haystack: Pollen data



8

Clustering in high dimensions: Single Cell Mouse Retina Data


9

Clustering in high dimensions: Single Cell Mouse Retina Data

10

Discussion & Outlook


  • New display that reverses piling effects when visualizing high-dimensional data in low-dimensional projections
  • This is especially useful in combination with a tour, implemented as the sage tour display
  • Related approach: slice tour Laa, Cook, Valencia (2020)
  • These new displays are complementary to non-linear dimension reduction methods for visualization, e.g. t-SNE
  • Displays should be implemented in an interactive interface, for efficient tuning
  • Thinking about new transformations and slicing methods
11

Acknowledgements


My slides are made using RMarkdown, xaringan and the ninjutsu theme, and based on a Monash themed template from Emi Tanaka.


The main R packages used are tourr, tidyverse, plotly, geozoo.


This is joint work done in collaboration with Dianne Cook and Stuart Lee.

12



Thanks!


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Ursula Laa

Monash University

Department of Econometrics and Business Statistics & School of Physics and Astronomy

ursula.laa@monash.edu

12

Open problem described in 2018 (Di Cook)



pD object
3D slice


pD slice



Picture sources: Samantha Cooper and CakesDekor

1
Paused

Help

Keyboard shortcuts

, , Pg Up, k Go to previous slide
, , Pg Dn, Space, j Go to next slide
Home Go to first slide
End Go to last slide
Number + Return Go to specific slide
b / m / f Toggle blackout / mirrored / fullscreen mode
c Clone slideshow
p Toggle presenter mode
t Restart the presentation timer
?, h Toggle this help
Esc Back to slideshow