Questions to the proof of the $L^p$ Ergodic Theorem of Von Neumann

Question

Before giving the Theorem of Von Neumann and asking my questions to its proof, I'll cite the Ergodic theorem of Birkhoff (out of Walters' "An Introduction to Ergodic Theory", p. 34) that is used in that proof of Von Neumann's Theorem:

Birkhoff Ergodic Theorem. Suppose $T\colon (X,\mathfrak{B},m)\to (X,\mathfrak{B},m)$ is measure-preserving (where we allow $(X,\mathfrak{B},m)$ to be $\sigma$-finite) and $f\in L^1(m)$. Then $\frac{1}{n}\sum_{i=0}^{n-1}f(T^i(x))$ converges a.e. to a function $f^*\in L^1(m)$. Also $f^*\circ T=f^*$ a.e. and if $m(X)<\infty$, then $\int f^*\, dm=\int f\, dm$.

Now to Von Neumann's Theorem (Walters, p. 36):

$L^p$ Ergodic Theorem of Von Neumann. Let $1\leq p<\infty$ and let $T$ be a measure-preserving transformation of the probability space $(X,\mathfrak{B},m)$. If $f\in L^p(m)$ there exists $f^*\in L^p(m)$ with $f^*\circ T=f^*$ a.e. and $\lVert (1/n)\sum_{i=0}^{n-1}f(T^ix)-f^*(x)\rVert_p\to 0$.

Here is the proof:

If $g$ is bounded and measurable then $g\in L^p$ and by the ergodic theorem we have that $$ \frac{1}{n}\sum_{i=0}^{n-1}g(T^ix)\to g^*(x)\text{ a.e.} $$ Clearly $g^*\in L^{\infty}(m)$ and hence $g^*\in L^p(m)$. Also $$ \lvert(1/n)\sum_{i=0}^{n-1}g(T^ix)-g^*(x)\rvert^p\to 0\text{ a.e.} $$ and by the bounded convergence theorem $$ \lVert (1/n)\sum_{i=0}^{n-1}g(T^ix)-g^*(x)\rVert_p\to 0. $$ If $\varepsilon > 0$ we can choose $N(\varepsilon,g)$ such that if $n>N(\varepsilon,g)$ and $k>0$ then $$ \left\lVert\frac{1}{n}\sum_{i=0}^{n-1}g(T^ix)-\frac{1}{n+k}\sum_{i=0}^{n+k-1}g(T^ix)\right\rVert_p <\varepsilon. $$

Let $f\in L^p(m)$ and $S_n(f)(x)=\frac{1}{n}\sum_{i=0}^{n-1}f(T^ix)$. We must show that $(S_n(f))_n$ is a Cauchy sequence in $L^p(m)$. Note that $\lVert S_n(f)\rVert_p\leq\lVert f\rVert_p$. Let $\varepsilon >0$ and choose $g\in L^{\infty}(m)$ such that $\lVert f-g\rVert_p < \varepsilon/4$. Then $$ \lVert S_nf-S_{n+k}f\rVert_p\leq\lVert S_nf-S_ng\rVert_p + \lVert S_ng-S_{n+k}g\rVert_p + \lVert S_{n+k}g-S_{n+k}f\rVert_p\\\leq \varepsilon/4 + \varepsilon/2 + \varepsilon/4 = \varepsilon $$ if $n> N(\varepsilon/2,g)$ and $k>0$. Therefore $(S_nf)_n$ is a Cauchy sequence in $L^p(m)$ and hence $\lVert S_f-f^*\rVert_p\to 0$ for some $f^*\in L^p(m)$.

We have $f^*\circ T=f^*$ a.e. because $$ \left(\frac{n+1}{n}\right)(S_{n+1}f)(x)-(S_nf)(Tx)=\frac{f(x)}{n}. $$

I have three questions concerning this proof.

1.) Why is (clearly) $g^*\in L^{\infty}(m)$?

I think it is because by the Birkhoff Ergodic Theorem it is $g^*\in L^1(m)$, i.e. $\int\lvert g^*\rvert\, dm<\infty$. From this it follows that $\lvert g^*\rvert < \infty$ a.e. and so it is $\text{ess}\sup_{x\in X}\lvert g^*(x)\rvert < \infty$.

2.) Why can we simply choose a function $g\in L^{\infty}(m)$ such that $\lVert f-g\rVert_p < \varepsilon/4$? I did understand that we do that in order to apply the first part of the proof but not why we can simply choose such a function. Maybe one may think of a simple function (which is bounded and measurable and therefore in $L^{\infty}$) which approximates $f$ good enough. Don't know.

3.) Why does the last identity show that $f^*\circ T=f^*$ a.e.? Do not see that. Especially why a.e.?

With greetings,

math12

Why this proof doesn't work for the case $p=\infty$ as well? — Odylo Abdalla Costa, Jun 26 '20 at 23:39

Ian · Answer 1 · 2014-09-01T18:45:27.817

3

If $|g| \leq M$ then $|g^*| \leq M$. When $n$ is finite this is just algebra; when you send $n \to \infty$ you have preservation of inequalities.
Given $\varepsilon > 0,f \in L^p$, there exists $M>0$ such that if $A=\{ x : |f(x)| \leq M \}$ then $\| f - f \chi_A \|_p < \varepsilon$. This follows from Chebyshev's inequality.
Take $n \to \infty$ on both sides, then wherever $(S_n f)(x) \to f^*(x)$, you get $f^*(x) - f^*(Tx) = 0$. Since the aforementioned convergence occurs a.e., $f^* = f^* \circ T$ a.e.

It would be instructive to prove this again using the Vitali convergence theorem. Proceeding that way, you need only show that $(S_n f)^p$ is uniformly integrable. I haven't worked out the details, but I think this exposes an interesting property of measure-preserving transformations. Specifically, I think if $T$ is a measure-preserving transformation and $f \in L^1$ has "modulus of integrability" $\delta(\varepsilon)$ (i.e. if $m(A) < \delta(\varepsilon)$ then $\int_A |f| dm < \varepsilon$), then $f \circ T$ has the same modulus of integrability. Then you inductively get that $f \circ T^n$ has that same modulus of integrability, thus $(S_n f)$ has the same modulus of integrability, and then the result follows.

edited Sep 01 '14 at 18:45

answered Sep 01 '14 at 18:37

Ian

101,645

1.) is clear now. 2.) Which version of the Tschebyscheff inequality are you meaing/ using= 3.) I see that for $n\to\infty$ it follows. But how do I get this equation at all? – Sep 01 '14 at 21:03
addition to 3.) And why is $(S_nf)(x)\to f^*(x)$ alomst surely? The convergence is in the p-Norm, ok. But why a.e.? – Sep 01 '14 at 21:19
1

For 3, just expand the definition: $\frac{n+1}{n} (S_{n+1} f)(x) - (S_n f)(Tx) = \frac{1}{n} \sum_{i=0}^n f \left ( T^i(x) \right ) - \frac{1}{n} \sum_{i=1}^n f \left ( T^i(x) \right ) = f(x)/n$. – Ian Sep 01 '14 at 21:20
For 2 I think I was mistaken. A different way of doing 2 is to prove that if $g$ is a nonnegative function then $\int_X g dm = \int_0^\infty m ( { x : g(x) \geq y } ) dy$. Then if $g$ is integrable then this integral converges, which means that the tail $\int_M^\infty m ( { x : g(x) \geq y } ) dy$ must go to zero as $M \to \infty$, and the rest should be pretty clear. – Ian Sep 01 '14 at 21:25
1

For your second commented question, just apply the Birkhoff ergodic theorem. – Ian Sep 01 '14 at 21:26
2.) is not clear to me. And with 3.) concerning the a.e.: From where do we know that $f^*$ is the same function to which (S_nf)_n$ converges in p-norm? – Sep 01 '14 at 21:33
For 3, there's lots of ways to establish uniqueness, Vitali's theorem is arguably the most straightforward since it says $f_n \to f$ a.e. and some conditions are equivalent to $f_n \to f$ in $L^p$. For 2, perhaps a shorter way to do it would be monotone convergence. Let $g_n = f(x) \chi_{ { y : n-1 \leq f(y) \leq n } }(x)$ and let $f_n(x) = \sum_{j=1}^n g_n(x)$. This clearly converges pointwise to $f$. By monotone convergence it also converges in $L^p$ to $f$. Therefore the tail can be made small in the sense of $L^p$. Now apply this to $|f|$ in your context. – Ian Sep 01 '14 at 21:50
Another way to do 3 is to prove that if $f_n \to f$ in $L^p$ then there is $f_{n_k}$ which converges to $f$ a.e. Therefore if $f_n$ in fact converges a.e. then it must converge a.e. to $f$. – Ian Sep 01 '14 at 21:51
Sorry, I misspoke in the comment before last. Vitali's theorem says that under some conditions $f_n \to f$ a.e. is equivalent to $f_n \to f$ in $L^p$. – Ian Sep 01 '14 at 22:01

Questions to the proof of the $L^p$ Ergodic Theorem of Von Neumann

1 Answers1

Linked