Help me understand this part in proving a theorem relating to Rademacher complexity. For simplicity, I am going to avoid all extraneous variables (Eq (4)). Random variable $\psi(S)$ is written as $\psi$ for simplicity. Here $S$ is a sample set, $t$ and $\delta$ are small real positive values. $D$ is the data distribution.
Given, $$Pr[\psi \leq E[\psi] + t] \leq \delta $$
Further, complementing the probability we get
$$Pr[\psi \leq E[\psi] + t] = 1- Pr[\psi -E[\psi] \geq t] \leq \delta$$
So, we can write it as
$$ Pr[\psi -E[\psi] \geq t] \geq 1-\delta$$
Finally, we can say that $ \psi\geq t + E[\psi] $ with atleast probability $1-\delta$.
Now, here is the confusing part, the authors are replacing the inequality in another inequality given as
$$ E_D[f] \leq E_S[f] + \psi $$
implies
$$E_D[f] \leq E_S[f] + E[\psi] + t $$ with probability atleast $1-\delta$.
As $\psi$ is larger than $E[\psi] + t$ this substitution doesn't work correct? Where am I going wrong?
TIA