$$\min_{\theta \in \Theta} \sup_{P: D(P, P_0) \leq \rho} \mathbb{E}_P\left[ \ell(\theta;(X,Y)) \right]$$
The above equation is from this paper in ML: (https://arxiv.org/pdf/1805.12018.pdf) on the top of page 2. I'm having a hard time understanding what that equation is even saying though, specifically the part in the middle. So, is it correct to say the equation is saying there is a distribution P, which is within a distance of row from P0, and we want to minimize the expected value given that distribution? Where does the support come into play though?