Q. A researcher wants a single estimate of the probability of monkeys having a disease. Using a state-of-the-art blood scanner, she determines if the monkey has a disease or not. Based on her data, she obtains an estimate, $\widehat{p}$, for the actual probability, $p$, of monkeys having the disease. What is the minimum sample size needed to ensure a $\geq 99\%$ certainty that the difference between $\widehat{p}$ and $p$ is $\leq 10\%$?
Now, I have seen a similar question with a worked solution:
In August 2013, the New York Times reported that a recent poll indicated that 52 percent of the population was in favor of the job performance of President Obama, with a margin of error of $\pm 4$ percent. What does this mean? Can we infer how many people were questioned?
Solution. It has become common practice for the news media to present 95 percent confidence intervals. Since $z_{.025} = 1.96$, a 95 percent confidence interval for $p$, the percentage of the population that is in favor of President Obama’s job performance, is given by:
$$\widehat{p} \pm 1.96\sqrt{(\hat{p}(1-\hat{p})/n} = .52 \pm 1.96\sqrt{52(.48)/n} \tag{1}$$ where $n$ is the size of the sample. Since the “margin of error” is $\pm 4$ percent, it follows that $$1.96\sqrt{.52(.48)/n} = .04 \tag{2}$$ or $$ n = \frac{1.96^2(0.52)(0.48)}{(0.04)^2}=599.29 \tag{3}$$
My questions:
It seems like the only difference in this question is that I don’t actually have the estimate $\hat{p}$ to plug into $(1)$, so can I still solve this question using the approach above?
What is the intuition behind equation $(1)$ and how does this tie into C.L.T.? Yes, I can see that $z_{0.025} = 1.96$ is being used, but where are the remaining pieces, namely $\sqrt{(\hat{p}(1-\hat{p})/n}$, of the puzzle coming from?