I am trying to derive the following expression:
$\frac{\partial} {\partial \theta_i}Tr(A(\theta)^{-1}y (A(\theta)^{-1}y)^{T}B(\theta))$.
what I did is :
$\frac{\partial} {\partial \theta_i}Tr(A(\theta)^{-1}y (A(\theta)^{-1}y)^{T}B(\theta))= Tr \left(\frac{\partial (A(\theta)^{-1})}{\partial \theta_i} y (A(\theta)^{-1}y)^{T}B(\theta) + A(\theta)^{-1}y y^{T}\frac{\partial (A(\theta)^{-1})}{\partial \theta_i}B(\theta) +A(\theta)^{-1}y (A(\theta)^{-1}y)^{T} \frac{\partial B(\theta) }{\partial \theta_i} \right)$.
I already know that the $\frac{\partial (A(\theta)^{-1})}{\partial \theta_i} = -A(\theta)^{-1}\frac{\partial A(\theta)}{\partial \theta_i} A(\theta)^{-1}$.
However, I am not sure that I applied correctly the chain rule inside the trace.