Kernelized diffusion maps and application
Séminaire Données et Aléatoire Théorie & Applications
10/04/2025 - 14:00 Loucas Pillaud-Vivien (ENPC) Salle 106
Spectral clustering and diffusion maps are celebrated dimensionality reduction algorithms built on eigen-elements related to the diffusive structure of the data. The core of these procedures is the approximation of a Laplacian through a graph kernel approach, however this local average construction is known to be cursed by the high-dimension. We show how to build a different estimator of the Laplacian, via a reproducing kernel Hilbert space method, which adapts naturally to the regularity of the problem. We provide non-asymptotic statistical rates proving that the kernel estimator we build can circumvent the curse of dimensionality. We discuss techniques (Nyström subsampling, Fourier features) that enable to reduce the computational cost of the estimator while not degrading its overall performance. Finally, we will also show that this technique enables to do semi-supervised learning.