This is essentially a follow-up to this question: Differences between a matrix and a tensor
I think I have a good intuition/idea for the change of basis for a rank-(1,1) tensor ($A\vec{v} = \vec{w}$) via diagonalization/similarity transformations/Jordan normal form, etc.
Now that I am studying the definition of symplectic and Lorentz transformations, which involve transformations of rank-(2,2) tensors (quadratic forms, $\vec{u}^T A \vec{v}$), I don't really understand the geometric intuition for why the basis of a quadratic form changes as $T^TAT$ instead of $T^{-1}AT$ like for a rank-(1,1) tensor. This is making it more difficult for me to understand the type of "invariance" that the definitions of symplectic and Lorentz transformations are supposed to imply.
Essentially: why is the change of basis for a quadratic form have the shape $T^T A T$? Is there a good geometric intuition for this/picture to have in mind?
Does this have something to do with Sylvester's Law of Inertia or the relationship between quadratic forms and symmetric bilinear forms? http://www.ucl.ac.uk/~ucahaya/ChapterV.pdf
I guess another way to phrase the question is: what is the difference in the geometric intuitions for similarity and congruence (of matrices)? http://www.maths.qmul.ac.uk/~twm/MTH6140/la26.pdf
This is especially confusing considering I was taught that they meant the same thing in 9th grade geometry -- clearly I know very little about this. Admittedly when the matrices $T$ are orthogonal (unitary), the definitions do coincide, but they don't coincide always -- for instance symplectic matrices are NOT always orthogonal, even though they are defined as changes of basis for the matrix $J$: $A^T J A = J$.