For any random $m \times n$ matrix $X$ and $m \times k$ matrix $Y$,
$$ E[\| X^T Y \|] \leq E[ \|X\|^2 ]^{1/2} E[ \|Y\|^2 ]^{1/2}, $$ where $E[\| X^T Y \|]$ can be interpreted as the absolute value of inner product, $|\langle X,Y \rangle|$, and for a $m \times n$ matrix $A$, $\|A\| = \left(\sum_{i,j}a_{ij}^2\right)^{1/2} = trace(A^T A)^{1/2}$.
I don't how can I view $E[\| X^T Y \|]$ as $|\langle X,Y \rangle|$. Could anyone explain this for me?