I know this method works by finding the root of the tangent line to the initial point, which is the best linear approximation of the function at that point. And I also know the closer the initial point to the root, the better the approximation, since a tangent line approximation is more accurate for nearby values.
But how do we know the root of the tangent line always produces a value that is closer the the root than the initial point? I haven't seen any author provide an explanation on this other than providing some diagrams. Is it possible that sometimes it gives an value that is slightly further away from the initial point (provided the initial guess is fair)?
