I am reading up on AI and now read https://en.wikipedia.org/wiki/Kernel_perceptron
It says:
To derive a kernelized version of the perceptron algorithm, we must first formulate it in dual form, starting from the observation that the weight vector w can be expressed as a linear combination of the n training samples.
But what is "dual form"?