0.6 Dimensionality reduction (Page 3/3)

Concise signal models Page 3 / 3

The johnson-lindenstrauss lemma

Fundamentals

As with the above techniques in manifold learning, the Johnson-Lindenstrauss (JL)lemma [link] , [link] , [link] , [link] provides a method for dimensionality reduction of a set of data in $R^{N}$ . Unlike manifold-based methods, however, the JL lemma can be used for any arbitrary set $Q$ of points in $R^{N}$ ; the data set is not assumed to have any a priori structure.

Despite the apparent lack of structure in an arbitrary point cloud data set, the JL lemma suggests that there does exist a method for dimensionality reduction of that data set that can preserve key information while mapping the data to a lower-dimensional space $R^{M}$ . In particular, the original formulation of the JL lemma [link] states that there exists a Lipschitz mapping $Φ : R^{N} \mapsto R^{M}$ with $M = O (log (# Q))$ such that all pairwise distances between points in $Q$ are approximately preserved. This fact is useful for solving problemssuch as Approximate Nearest Neighbor [link] , in which one desires the nearest point in $Q$ to some query point $y \in R^{N}$ (but a solution not much further than the optimal point is also acceptable). Such problems can be solvedsignificantly more quickly in $R^{M}$ than in $R^{N}$ .

Recent reformulations of the JL lemma propose random linear operators that, with high probability, will ensure a nearisometric embedding. These typically build on concentration of measure results such as the following.

Lemma

[link] , [link] Let $x \in R^{N}$ , fix $0 < ϵ < 1$ , and let $Φ$ be a matrix constructed in one of the following two manners:

$Φ$ is a random $M \times N$ matrix with i.i.d. $N (0, σ^{2})$ entries, where $σ^{2} = 1 / N$ , or
$Φ$ is random orthoprojector from $R^{N}$ to $R^{M}$ .

Then with probability exceeding

1 - 2 exp (- \frac{M (ϵ^{2} / 2 - ϵ^{3} / 3)}{2}),

the following holds:

(1 - ϵ) \sqrt{\frac{M}{N}} \leq \frac{{∥Φ, x∥}_{2}}{{∥x∥}_{2}} \leq (1 + ϵ) \sqrt{\frac{M}{N}} .

The random orthoprojector referred to above is clearly related to the first case (simple matrix multiplication by a Gaussian $Φ$ ) but subtly different; one could think of constructing a randomGaussian $Φ$ , then using Gram-Schmidt to orthonormalize the rows before multiplying $x$ . We note also that simple rescaling of $Φ$ can be used to eliminate the $\sqrt{\frac{M}{N}}$ in [link] ; however we prefer this formulation for later reference.

By using the union bound over all $(\binom{# Q}{2})$ pairs of distinct points in $Q$ , Lemma "The Johnson-Lindenstrauss lemma" can be used to prove a randomized version of the Johnson-Lindenstrauss lemma.

Lemma

Johnson-lindenstrauss

Let $Q$ be a finite collection of points in $R^{N}$ . Fix $0 < ϵ < 1$ and $β > 0$ . Set

M \geq (\frac{4 + 2 β}{ϵ^{2} / 2 - ϵ^{3} / 3}) ln (# Q) .

Let $Φ$ be a matrix constructed in one of the following two manners:

$Φ$ is a random $M \times N$ matrix with i.i.d. $N (0, σ^{2})$ entries, where $σ^{2} = 1 / N$ , or
$Φ$ is random orthoprojector from $R^{N}$ to $R^{M}$ .

Then with probability exceeding $1 - {(# Q)}^{- β}$ , the following statement holds: for every $x, y \in Q$ ,

(1 - ϵ) \sqrt{\frac{M}{N}} \leq \frac{{∥Φ x - Φ y∥}_{2}}{{∥x - y∥}_{2}} \leq (1 + ϵ) \sqrt{\frac{M}{N}} .

Indeed, [link] establishes that both [link] and [link] also hold when the elements of $Φ$ are chosen i.i.d. from a random Rademacher distribution ( $\pm σ$ with equal probability $1 / 2$ ) or from a similar ternary distribution ( $\pm \sqrt{3} σ$ with equal probability $1 / 6$ ; 0 with probability $2 / 3$ ). These can further improve the computational benefits of the JL lemma.

Connections with compressed sensing

In the following module on Compressed Sensing we will discuss further topics in dimensionality reduction that relate to the JL lemma. In particular, as discussed in Connections with dimensionality reduction , the core mechanics of Compressed Sensing can be interpreted in terms of a stable embedding that arises for the family of $K$ -sparse signals when observed with random measurements, and this stable embedding can be proved using the JL lemma. Furthermore, as discussed in Stable embeddings of manifolds , one can ensure a stable embedding of families of signals obeying manifold models under a sufficient number of random projections, with the theory again following from the JL lemma.

<< Chapter < Page Page > Chapter >>

Read also:

Get Jobilize Job Search Mobile App in your pocket Now!

100% Free Mobile Applications
Receive real-time job alerts and never miss the right job again

Source: OpenStax, Concise signal models. OpenStax CNX. Sep 14, 2009 Download for free at http://cnx.org/content/col10635/1.4

Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Concise signal models' conversation and receive update notifications?

Ask

©flickr:	Vocabulary Practice Quiz! By Katie Montrose Start Quiz
	20 AP 20 Blood Vessels Circulation MCQ By OpenStax Start Quiz
	Principles of Marketing By Dionne Mahaffey Start Quiz
	Principles of microeconomics for ap® courses By OpenStax Read Online Course
	11 Neuroanatomy 11 The Cerebellum By Stephen Voron Start Quiz
	Social Dances 2 By Marion Cabalfin Start Quiz
	Power Enigeering types of bearing lubrication By Sam Luong Start Quiz
	Pre Employment English Proficiency Exam By Katherina jennife... Start Quiz
	2 Pharmacology Nervous System Essay By Rohini Ajay Start Test
	8 Biology 08 Photosynthesis MCQ By OpenStax Start Quiz