I am slightly ashamed to be asking this, but I have been recently reflecting on changing variables in very simple problems. If I missed a question that already discusses this please point it out to me and I will delete this one. Anyhow writing this will probably be a learning experience.
Directly from the Wikipedia page on the argument I take as an example the equation:
$$x^6 - 9 x^3 + 8 = 0. \, $$
I quickly recognize this as a high school problem and use the methods that were taught to me, namely I set $x^3 = u$ so $x = u^{1/3}$.
Then I proceed to solve quadratic equation that results from this substitution, and only at the end I apply the reverse transformation $x^3 = u$ to get an answer for my starting variable. With not much imagination I always thought that the function used when changing variables (in the above case $f(x) = x^3)$ should be bijective in the domain of interest of the starting equation. This is because I need the inverse to return to my "starting variable".
But I notice on Wikipedia that a bit more is required; the change of variable function should be a diffeomorphism, we need differentiability (and even smooth manifolds for the domain and the image).
This is where I realized that I was never taught a proof of why the change of variables method work or how it works but I was just applying these substitutions blindly.
So could someone kindly point me to a source where I can improve my understanding on this very powerful method by adding rigour to what I am doing and possibly even a geometric interpretation.