Let's say I have data points $0.1$, $0.2$, $0.3$ coming from a normal distribution with mean $\mu$ and standard deviation $1$.
If I want to test the hypotheses $H_0: \mu = 0.15$ vs. $H_1: \mu > 0.15$, then the test statistic is
$$T = \frac{\hat{\mu} - \mu}{1/\sqrt{n}}
$$
where $\hat{\mu}$ is the sample mean. Under the null hypothesis,
$$T = \frac{\hat{\mu} - 1.5}{1/\sqrt{3}} \sim \mathcal{N}(0,1).
$$
With my data, I know my observed test statistic is $T = \sqrt{3}\times 0.05$.
Now my confusion here is the rationale for why we do the following:
The p-value is $p = \mathbb{P}(Z > \sqrt{3}\times 0.05)$ where $Z$ is standard normal.
Why do we look at the probability that the theoretical statistic exceeds the observed test statistic? Why not when it doesn't exceed?
A similar question with the p-value is when we do the two-sided test ($H_1: \mu \neq 0.15$). The p-value will evaluate to
$$p = 2\mathbb{P}(Z > \sqrt{3}\times 0.05)$$
but if the two-sided alternative is more likely than the one-sided alternative (It's either $>$ or $<$ opposed to $\neq$), shouldn't we intuitively be more likely to reject in the two-sided test? But our p-value seems to give the opposite result with the factor of 2, as we reject if p is small and this factor of 2 makes it harder to reject. Why is there a discrepancy in my intuition?
Throughout university, I have always been taught the process: Pick a significance level $\alpha$ (0.05 in this case) -> Choose a test statistic, then calculate it's p-value -> compare the p-value with the level. If $p$ is lower than the level then we reject null, else accept the null. Is this what you mean by doing it backwards? – OneGapLater Aug 05 '19 at 00:39