Why the $C$ constant is defined like this in the Theorem:
There is the proof of Theorem provided by the book, i underlined what i do not understood.
The text where i found this Theorem was Numerical Linear Algebra , Trefethen and Bau.
Why the $C$ constant is defined like this in the Theorem:
There is the proof of Theorem provided by the book, i underlined what i do not understood.
The text where i found this Theorem was Numerical Linear Algebra , Trefethen and Bau.
Consider the ordinary power method. It looks like $x_k=\frac{A^k x_0}{\| A^k x_0 \|}=\frac{\sum_{i=1}^n \lambda_i^k c_i(x_0) v_i}{\left \| \sum_{i=1}^n \lambda_i^n c_i(x_0) v_i \right \|}$, where $c_i$ are scalars. If $c_1$ and $c_2$ are both nonzero, then we divide top and bottom by $\lambda_1^n c_1(x_0)$ to reveal the normalized version of $v_1 + \sum_{i=2}^n (\lambda_i/\lambda_1)^k (c_i/c_1) v_i$. This is $O(|\lambda_2/\lambda_1|^k)$ (why?).
Suppose instead we had $n-1$ vectors $x_0^{(j)}$ with $c_i(x_0^{(j)})=0$ for all $i<j$, $c_j(x_0^{(j)}) \neq 0,c_{j+1}(x_0^{(j)}) \neq 0$. Suppose we ran power iteration on each of those. Then the iteration of $x_0^{(j)}$ will give us $v_j$, and by an argument just like the one above, its error is $O(|\lambda_{j+1}/\lambda_j|^k)$.
This is not exactly how simultaneous iteration actually works: if it were then we would be able to immediately read off the eigenvectors from the initial conditions, in which case there would be no need to do any iteration. But the orthogonalization procedure used in simultaneous iteration essentially aims to make this be the situation as $k \to \infty$. In effect, once $x_k^{(1)}$ has nearly converged, $x_k^{(2)}$, being orthogonal to $x_k^{(1)}$, can't have much of a component in the direction of $v_1$, so its deviation from $v_2$ must arise primarily from the contribution of the component in the direction of $v_3$, etc.