Central Limit Theorem and preventing the distribution from collapsing and degenerate

Question

I know question on central limit theorem has been asked numerous times and I have searched around the website but still couldn't get what I wanted to find out (probably most people who asked the question is stuck at different parts of the theory).

Attached in my question is a lecture slide for the search for a transformation of x̅ that has a limiting distribution. I do not understand why x̅ and µ is multiply by √n instead of n. Also, why when the statistic √n(x̅ - µ) is used, µ becomes zero?

Thank you in advance!

enter image description here

Suggestion to type in math mode and not use figures. – hola Nov 26 '19 at 19:44 — hola, Nov 26 '19 at 19:44

score 1 · Accepted Answer · answered Apr 03 '15 at 04:37

A previous slide defines $$ \bar{X} = \frac{X_1+\cdots+X_n}{\sqrt{n}}, $$ where the random variables $X_1,\ldots,X_n$ are independent copies of the same distribution, which has mean $\mathbb{E}[X_i] = \mu$ and variance $\mathbb{V}[X_i] = \sigma^2$. One calculates that the mean of $\bar{X}$ is also $\mu$, and its variance is $\sigma^2/n$.

We want to normalize $\bar{X}$ so that it has mean $0$ and variance $1$. If we subtract $\mu$ from $\bar{X}$ we obtain a random variable $\bar{X} - \mu$ with mean $0$ and the same variance. If we multiply the result by $\sqrt{n}/\sigma$ then the mean remains $0$ and the variance becomes $1$. The new random variable is $$ \tilde{X} = \frac{\sqrt{n}(\bar{X} - \mu)}{\sigma}. $$ It so happens that as $n\to\infty$, $\tilde{X}$ tends to a normal distribution $N(0,1)$. This is the central limit theorem.

In order for a theorem of this sort to hold, the mean and variance of $\tilde{X}$ have to be fixed. This is why we normalized it to have mean $0$ and variance $1$. We could have chosen other values and would have obtained an equivalent central limit theorem, but the choice of zero mean and unit variance is the conventional standard.

The slide tries to explain why other normalizations won't work:

$\bar{X}$ has fixed mean but the variance tends to $0$. Therefore $\bar{X}$ tends to the constant $\mu$ distribution. This is the law of large numbers. The law tells us what $\bar{X}$ tends to, but not how the deviation from the mean behaves.
$\sqrt{n} \bar{X}$ has fixed variance $\sigma^2$ but the mean tends to infinity (or minus infinity). So $\sqrt{n} \bar{X}$ cannot converge to any fixed distribution, unless $\mu = 0$. This is because if $\sqrt{n} \bar{X}$ tends to a distribution with mean $m$, then the mean of $\sqrt{n} \bar{X}$ must tend to $m$.
$\sqrt{n}(\bar{X} - \mu)$ has mean $0$ and variance $\sigma^2$, and we can hope for a limit theorem. Indeed, the central limit theorem shows that $\sqrt{n}(\bar{X} - \mu)$ tends to $N(0,\sigma^2)$, a normal distribution with mean $0$ and variance $\sigma^2$. This is equivalent to the central limit theorem stated above.

Thank you. Your explanation is very thorough! :) I still have some questions:
in 2) we can basically choose any combination value of µ and σ^2 but it's meaningless, hence we choose the standard normal values of mean o and variance 1 so that we can use the standard normal statistical table, right?

also, I do not understand why x̅ = (X1+⋯+Xn) / √n and consequently, why σ^2/n can just become σ^2 when everything else is multiplied by √n? does my question make sense? why √n is used instead of just n? — Ken, Apr 03 '15 at 05:29
We divide by $\sqrt{n}$ rather than by $n$ since we want the variance to be constant. If we divided by $n$, the variance would tend to zero and we wouldn't be able to see the errors. You can try it – add 100 numbers, each chosen uniformly at random among $1,-1$. If you divide by 100 and do a histogram (i.e., repeat the experiment many times and plot how many times each value was obtained) then the histogram will be very close to 0. Replace 100 by 1000, and it becomes even closer to 0. — Yuval Filmus, Apr 03 '15 at 06:17
In contrast, if you divide by 10 (that is, $\sqrt{100}$) then you should see a Gaussian. You should see a very similar Gaussian if you replace 100 with 1000 and 10 with $\sqrt{1000}$. I encourage you to do this experiment and see for yourself. — Yuval Filmus, Apr 03 '15 at 06:18

Central Limit Theorem and preventing the distribution from collapsing and degenerate

1 Answers1