Manifold learning

cosmos 29th November 2017 at 12:32pm
Dimensionality reduction

aka nonlinear dimensionality reduction

nonlinear dimensionality reduction consist of two steps: first, they start with con - structing a representation of local affinity of the data points (typically, a sparsely connected graph). Second, the data points are embedded into a low-dimensional space, trying to preserve some criterion of the original affinity. For example, spectral embeddings tend to map points with many connec- tions between them to nearby locations, and multidimension- al scaling (MDS)-type methods try to preserve global information, such as graph geodesic distances. Examples of manifold learning include different flavors of MDS [26] , locally linear embedding [27] , sto- chastic neighbor embedding [28 ], spectral embeddings, such as Laplacian eigenmaps [29] and diffusion maps [30] , and deep models [31] . Instead of embedding the vertices, the graph structure can be pro - cessed by decomposing it into small subgraphs called motifs [36] or graphlets [37]. Finally, most recent approaches [32]– [34] tried to apply the successful word-embedding model [35] to graphs.

Input distribution lies in a manifold when there is structure/correlations in high dimensional space – like Sloppy models

https://en.wikipedia.org/wiki/Nonlinear_dimensionality_reduction#Laplacian_eigenmaps

Several efficient manifold learning techniques have been proposed.

  • Isometric feature mapping (ISOMAP) (Balasubramanian et al., 2002) estimates the geodesic distances on the manifold and uses them for projection. Multidimensional scaling with Geodesic distance, basically. A Global Geometric Framework for Nonlinear Dimensionality ReductionThe Isomap Algorithm and Topological Stability
  • Locality Preserving Projections (LPP)
  • Locally linear embedding (LLE) (Roweis and Saul, 2000) projects data points to a low-dimensional space that preserves local geometric properties. paper
  • Laplacian eigenmaps (LE) (Belkin and Niyogi, 2003) uses the weighted distance between two points as the loss function to get the dimension reduction results.
  • Local tangent space alignment (LTSA) (Zhang and Zha, 2004) constructs a local tangent space for each point and obtains the global low-dimensional embedding results through affine transformation of the local tangent spaces.

Yan et al. (2007) present a general formulation known as graph embedding to unify different dimensionality reduction algorithms within a common framework.