In random matrix decomposition, how does a semi-orthogonal matrix capture the range of an input matrix?

Question

I am reading this paper on probabilistic algorithms for matrix decomposition. I don't understand Section 1.2. Given a matrix $A \in \mathbb{R}^{m \times n}$, we want an "approximate basis for the range of $A$", $Q \in \mathbb{R}^{r \times n}$ such that:

$$ Q \text{ has orthonormal columns and } A \approx QQ^* A $$

I understand that the range of $A$ is the column space of $A$, but I don't understand how finding a matrix $Q$ s.t. $A \approx QQ^* A$ means that $Q$ is a good approximation of that column space.

score 0 · Answer 1 · 2018-11-05T20:18:17.313

There are a number of algorithms in the book for explicitly constructing this

On page $28$ you have algorithm $4.5$. Your question is how is finding matrix $Q$ a good approximaton of the column space. Given an $ m \times n$ matrix $A$ and an integer $l$ this computes an $m \times l$ orthonormal matrix $Q$ whose range approximates the range of $A$

Draw an $n \times l $ SRFT test matrix $\Omega $ which is defined in $4.6 $
Form the $m \times l$ matrix $Y = A \Omega $ using a (subsampled) FFT
Construct an $ m \times l$ matrix $Q$ whose columns form an orthonorml basis for the range $Y$ e.g. using the $QR$ factorization.

So you generate this SFRT matrix, which is partly a Fourier matrix. Then you sample the original matrix $A$. This is how you get the column space of $A$. Then you take the reduced $QR$ decomposition and return the $Q$ matrix. So you would have $Q_{m \times l}$

Why would this be a good approximation? Well $ Y = A \Omega $ and we took $QR = Y$ right. All we did is a take a random matrix and form a bunch of random projections into its subspace using $\Omega$ then take this matrix $Q$ by using Gram Schmidt.

This paper is like $80$ pages long but there is actually analytical bounds on how approximate you get. It also depends on the method.

My question is a little simpler. Suppose $A$ is a $2$-rank $3 \times 3$ matrix. I understand that $Q$ needs to be a $3 \times 2$ semi-orthogonal matrix, i.e. the two columns of $Q$ are an approximate basis for $A$. What I don't understand is why minimizing $\lVert A - QQ^*A \rVert$ finds a $Q$ that does this. — jds, Nov 03 '18 at 21:57
There wasn't really any need to downvote this...I was watching bohemian rhapsody. The analytical bounds are defined in the paper. The approximation are from $k$. If you look down at $1.9$ — , Nov 04 '18 at 03:17

score 0 · Accepted Answer · answered Nov 14 '18 at 20:56

I was confused because I wondered why we cared that $Q$ was invertible because

$$ A \approx QQ^*A \implies QQ^* \approx I $$

But I don't think that's the right intuition. For me, the key was to write the expression like this:

$$ A \approx Q(Q^*A) $$

What we want is a matrix $Q$ with as few orthogonal columns as possible such that, if we project $A$ into this $k$-dimensional subspace and then reconstruct $A$, our reconstruction of $A$ is within some error tolerance.

In random matrix decomposition, how does a semi-orthogonal matrix capture the range of an input matrix?

2 Answers2