Suppose $\mu$ is $m \times 1 $, $A$ is $m \times m$, $B$ is always $m \times n$ and $\Sigma$ is $n \times n$. Note that $\Sigma$ is symmetric.
I need to differentiate the follow form:
$$\ell = -\log( \det[B \Sigma B^T]) - \operatorname{tr}([B \Sigma B^T]^{-1} [\mu\mu^T - \mu\mu^T A^T - A(\mu\mu^T)^T + A \mu\mu^T A^T])$$
Now I would like to know how can I obtain the following:
$$\frac{\partial \ell }{\partial A} = \text{?}$$ $$\frac{\partial \ell }{\partial \Sigma} = \text{?}$$ $$\frac{\partial \ell }{\partial B} = \text{?}$$
And What would the optimal $A$ , $\Sigma$ and $B$ be after differentiating and rearranging the terms to one side ?
Update:
Through its differential,
I have taken an attempt and obtained the following: Let $Z = [\mu\mu^T - \mu\mu^T A^T - A(\mu\mu^T)^T + A \mu\mu^T A^T]$
$$d \ell = -tr\Big(\big[2 B^T \Sigma(B\Sigma B^T)^{-1})\big]dB + \big[ B^T(B\Sigma B^T)^{-1}B\big] d\Sigma + \big[ (B\Sigma B^T)^{-1}Z(B\Sigma B^T)^{-1} B\Sigma +\big((B\Sigma B^T)^{-1}Z(B \Sigma B^T)^{-1}B \Sigma\big)^T \big]dB + \big[ B^T(B\Sigma B^T)^{-1}Z^T (B\Sigma B^T)^{-1}B\big]d \Sigma\Big) - tr\Big(\Big(\big[B\Sigma B^T\big]^{-1}\big[ 2\mu^T\mu - 2\mu\mu^TA^T\big]\Big)dA\Big)$$
Please kindly verify if it is correct.
The problem that remains is how to rearrange the terms such that the optimal $A, B, \Sigma$ will be on one side.