The formulation of the GAN in Goodfellow et al's 2014 paper are as variational optimisatons (i.e. over spaces of functions) and their practical implementation via parametric optimisation. The theory in the paper is developed for optimisaton over functions D (discriminator) and G (generator). In the implementation the functions D and G are realised by multilatyer or deep neural networks, so this is a parametric optimisation. Thus the variational optimisation indicated in the theory part of the paper is never actually carried out.
Although not stated in the 2014 paper, the theoretical results on GAN convergence and Nash equilibria require dim(z)$\geq$dim(x), where dim(x) is the latent space dimension and dim(z) the latent space dimension, for validity of Proposition 1 concerning the optimal detector. All the other results in the paper assume the optimal detector result.
When dim(z)<dim(x), which generally happens in practice, Proposition 1 is false since the variational calculus used in the proof (after the change of variables x=G(z)) cannot be applied to non continuous/non differentiable integrands. It is demonstrated in the paper: https://www.researchgate.net/publication/356815736_Convergence_and_Optimality_Analysis_of_Low-Dimensional_Generative_Adversarial_Networks_using_Error_Function_Integrals
that $p_g(x)$, the PDF of the generator output, is degenerate whenever dim(z)<dim(x), i.e. it contains delta functions. Thus when writing the optimisation problem for $V[D,G]$ for the function D as:
$\int\left( {\rm p}_d(x)\log(D(x))+{\rm p}_g(x)\log[1-D(x)]\right)dx$,
we cannot apply calculus of variations because the integrand contains delta functions, which are discontinuous, via the term $p_g(x)$. This has been demonstrated in practice by the results in C. Qin et al. (2020) 1