This might be a question with a very simple solution but I really don't see where this comes from. In the book on Convex optimization by Boyd and Vandenberghe in the section of examples on duality, the author compute a norm approximation problem. Particularly, they write the norm minimization problem $$ \text{minimize } \|Ax - b\|$$ as \begin{equation} \begin{aligned} & \text{minimize} & & \|y\|\\ & \text{subject to} & & Ax - b = y \end{aligned} \end{equation}
From here, I know the dual would have the form $L(x,y,\nu) = \|y\| + \nu^T(Ax - b -y) = \|y\| + \nu^T(Ax - b) - \nu^T y$. Now, the authors claim that the dual problem ends up being
\begin{equation} \begin{aligned} & \text{minimize} & & b^T\nu\\ & \text{subject to} & & \|\nu\|_* \leq 1 \\ & & A^T\nu = 0 \end{aligned} \end{equation}
I thought the objective function in the dual would be $-\nu^T b$ and the second constraint would be $\nu^TA$. Can anyone give me a mathematical reason of why they take the transpose of these expressions? Also, why is the sign for the objective function positive and not negative. Thanks for your help!