I'm reading Boyd's Convex Optimization textbook. In particular, I'm currently focusing on Chapter 5 (Duality). There is a frequent recurrence of two examples:
Minimum volume covering ellipsoid \begin{align*} minimize & \quad log \; det \; X^{-1}\\ subject \; to & \quad a_i^T X a_i \leq 1 \\ \end{align*}
Entropy Maximization: \begin{align*} minimize & \quad \displaystyle\sum_{i=1}^n x_i \;\;log \;x_i \\ subject \; to & \quad Ax \leq b \\ & \quad {\bf 1}^Tx=1 \end{align*}
I understand the interpretation of (1) as the minimum volume covering ellipsoid, but when would you ever want to solve this? E.g., if doing machine learning, you might want to do something like this to training data for the purpose of outlier detection, but such a model would surely be overfit, and it would seem better to incorporate a more graceful probabilistic decay from the boundary (you might consider setting the discovered ellipsoid to be equal to, say, the 95th percentile probability contour in a multivariate Gaussian model; but instead of doing this, it would seem wiser to just maximize a multivariate Gaussian likelihood directly). So when might one want to solve this problem?
As for (2), I can vaguely imagine situations in which one might want to find a maximum entropy probability distribution (as suggested by the cost function and second constraint) which satisfy some constraints, but what is a realistic example where one might want to impose the linear inequality constraint $Ax \leq b$?