Suppose $m > 1$. Let $f: \mathbb{R}^n \rightarrow \mathbb{R}^m$ be a smooth map. Consider $f + Ax$ for $A \in \mathrm{Mat}_{m\times n}$. Define $F: \mathbb{R}^n \times \mathrm{Mat}_{m\times n} \rightarrow \mathrm{Mat}_{m\times n}$ by $F(x,A) = df_x + A$.
I am really grateful to @Ross B. 's answer on the question The derivative of a linear transformation, $DF$ would be a rank-3 tensor with elements
$$ (DF)_{i,j,k} = \frac{\partial^2 f_i}{\partial x_j \partial x_k} $$
Some authors also define matrix-by-vector and matrix-by-matrix derivatives differently be considering $m \times n$ matricies as vectors in $\mathbb{R}^{mn}$ and "stacking" the resulting partial derivatives.
$dF$ maps from $m \times n \times n$ to $m \times n$?
Thank you very much.