In Farb and Margalit's "A Primer on Mapping Class Groups" we have the following
Proposition 1.7: Two transverse simple closed curves in a surface $S$ are in minimal position iff they do not form a bigon.
The proof for the reverse direction for the case $\chi(S) < 0$ starts as follows:
Assume simple closed curves $\alpha, \beta$ form no bigon. Let $\tilde{\alpha}, \tilde{\beta}$ be nondisjoint lifts of $\alpha, \beta$. By a previous Lemma, $\tilde{\alpha}$ intersects $\tilde{\beta}$ in exactly one point $x$. It cannot be that the axes of the hyperbolic isometries corresponding to $\tilde{\alpha}$ and $\tilde{\beta}$ share exactly one endpoint at $\partial \mathbb{H}^2$, because this would violate discreteness of the action of $\pi_1(S)$ on $\mathbb{H}^2$; indeed, in this case the commutator of these isometries is parabolic and the conjugates of this parabolic isometry by either of the original hyperbolic isometries have arbitrarily small translation length.
Question 1: How do we know the isometries corresponding to the lifts are hyperbolic and not parabolic? Earlier in the text, it is mentioned that if a closed curve is homotopic into a neighbourhood of a puncture, then the corresponding isometry is parabolic.
Question 2: What precisely is meant by translation length here? It seems to me like the conjugate of a parabolic isometry by some hyperbolic isometry (sharing an endpoint) will again be parabolic, so $\inf d(x, f(x))$ would be $0$. Is a different translation length meant or is there a reason why the conjugate ends up being hyperbolic rather than parabolic?
Question 3: Why does the conjugate (regardless of being parabolic or hyperbolic) end up being of arbitrarily small translation length?