There is another (not much more rigorous) explanation.
Assume that we have to calculate the integral
$$\int_a^bf(g(x))dx$$
about whose existence (in the Riemannian sense) we are convinced. Also, assume that the inverse of $g(x)$ exists over $[a,b]$.
Suppose that the substitution: $$u=g(x)$$ would help to evaluate the integral. So, let's do it. (Accept without any explanation that the new limits will be $g(a)$ and $g(b)$.)
The real question is the transformation of $dx$.
The integral above exists by hypothesis, so we can approximate it by the following Riemann sum:
$$\int_a^bf(g(x))dx\approx \sum_{i=1}^{n+1}f(g(\tilde{x_i}))(x_{i+1}-x_i).$$
Let's divide and multiply every term in the sum by
$$g(x_{i+1})-g(x_i)$$
the result is
$$\sum_{i=1}^{n+1}f(g(\tilde{x_i}))\frac{x_{i+1}-x_i}{g(x_{i+1})-g(x_i)}(g(x_{i+1})-g(x_i)).$$
The expression
$$\frac{x_{i+1}-x_i}{g(x_{i+1})-g(x_i)}$$
reminds me to the reciprocal of the derivative of $g$ at $\tilde x_i$ being the the representative point between $x_{i+1}$ and $x_i$. The shorter the interval the better the approximation is. So, let's use this intuition and change the Riemann sum to the following sum which looks to be still a valid approximation of our Riemann integral
$$\int_a^bf(g(x))dx\approx\sum_{i=1}^{n+1}f(g(\tilde{x_i}))\frac1{g'(\tilde x_i)}(g(x_{i+1})-g(x_i)).$$
Now, what is $\frac1{g'(\tilde x_i)}$? It is the derivative of the inverse of $g$ at $g(\tilde {x_i})$:
$$\frac1{g'(\tilde x_i)}=[g^{-1}]'(g(\tilde {x_i})).$$
With this invention our Riemann approximation becomes
$$\int_a^bf(g(x))dx\approx\sum_{i=1}^{n+1}f(g(\tilde{x_i}))[g^{-1}]'(g(\tilde {x_i}))(g(x_{i+1})-g(x_i)).$$
Introduce the following notations now: $u_{i+1}=g(x_{i+1})$, $u_i=g(x_i)$, and $\tilde{u_i}=g(\tilde{x_i}).$
With this we have
$$\int_a^bf(g(x))dx\approx\sum_{i=1}^{n+1}f(\tilde{u_i})[g^{-1}]'(\tilde{u_i})(u_{i+1}-u_i).$$
We recognize that the Riemann sum above is the approximation of the following integral
$$\int_{g(a)}^{g(b)}f(u)[g^{-1}](u)du.$$
That is, we have the following approximation
$$\int_a^bf(g(x))dx\approx\int_{g(a)}^{g(b)}f(u)[g^{-1}]'(u)du.$$
It is only a technical problem to show that this is not an approximation (given the conditions formulated at the beginning) but an actual equation.
Let's compare this result with the rule of integration by substitution. Again, we have
$$\int_a^bf(g(x))dx$$
and we want to introduce the substitution $u=g(x)$. By the rule we take the inverse of $g$ so that we have $x=g^{-1}(u)$. Then, as the rule says we take
$$\frac{d x}{du}=\frac{d [g^{-1}](u)}{du}=[g^{-1}]'(u)$$
from where, as if it was legal, we declare that
$$dx=[g^{-1}]'(u)du$$
and we get formally the result that we gained before.