I came across a similarity/metric learning method that takes in the form of $x^TWy = z$, where $ x $ and $y$ are real valued vectors. For example, two images.
Breaking it into a more familiar form: $ x^TWy = \sum_{ij}w_{ij} x_{i}y_{j} = z $
This essentially looks very similar to polynomial regression with only interactions between features (without the polynomials). i.e. $z = f_w(x)= \sum_{i} w_ix_i + \sum _i\sum _{j=i+1} w_{ij}x_ix_j$
I was curious to see if the optimization for the matrix W is the same as doing optimization for multivariate linear/polynomial regression, since $x$ and $y$ are fixed, and the only variate is the weight matrix $W$?
And if it is a form of linear regression, is the optimization then convex?