I'm currently following the MIT Single Variable lectures online and the professor states that the binomial theorem for the expansion
$(x + \Delta x)^{n} = x^{n} + nx^{n-1}\Delta x + O((\Delta x)^{2})$
How is this derived, and what does the big O term of the expansion represent in terms of the binomial theorem. Just to put this in context, this expression was used when computing the derivative of $x^{n}$ using the limit definition.