2

On the one hand, I read that the derivative of the complex conjugate $C[z]=\overline{z}$ is not differentiable anywhere (for instance see here). (see 1, below)

On the other hand, I see in physics taking the derivative of a complex scalar field to obtain the equation of motion using the Euler-Lagrange method (for instance see enter link description here (see 2, below)

So which is it, can we or can we not take the derivative?


  1. For case 1, the reference states that a complex function is differentiable if and only if it satisfies the Cauchy-Riemann equations:

$$ f[z]=f[x+iy]=u[x,y]+iv[x,y] $$

Then f is differentiable if

$$ \frac{\partial u}{\partial x} =\frac{\partial v}{\partial y} \\ \frac{\partial u}{\partial y} =-\frac{\partial v}{\partial x} $$

Then for the complex conjugate $C[x+iy]=x-iy$ then $\partial u/\partial x =1$ and $\partial v/\partial y=-1$. Consequently $C[z]=\overline{z}$ is not differentiable anywhere in the complex plane.

  1. For case 2, the physics paper defines the Lagrangian of a complex scalar free field as follows:

$$ \mathcal{L}=(\partial \phi^*)(\partial \phi) $$

Then they claim that

$$ \frac{\partial \mathcal{L}}{\partial (\partial \phi)}=\partial \phi^*\\ \frac{\partial \mathcal{L}}{\partial (\partial \phi^*)}=\partial \phi $$


To obtain these results I assume they apply the chain rule

$$ \frac{\partial }{\partial (\partial \phi)}(\partial \phi^* \partial \phi)=\partial \phi\frac{\partial }{\partial (\partial \phi)}(\partial \phi^* )+\partial \phi^* \frac{\partial }{\partial (\partial \phi)}(\partial \phi) $$

Is the following term not an 'illegal' derivative of a complex conjugate function?

$$ \partial \phi\frac{\partial }{\partial (\partial \phi)}(\partial \phi^* ) $$

Why are they allowed to pose it equal to 0?

Anon21
  • 2,581
  • In (1) you are confusing "if" with "only if". Not that it matters for the question you are asking. – ancient mathematician Jun 27 '20 at 15:58
  • For case 2, are you sure it isn't $\mathcal{L}=\partial\phi^\ast\partial\phi,,\frac{\partial\mathcal{L}}{\partial\partial\phi}=\phi^\ast,,\frac{\partial\mathcal{L}}{\partial\partial\phi^\ast}=\phi$? – J.G. Jun 27 '20 at 15:58
  • It's been a while since I did field theory, but is it possible that they consider $\partial\phi^\ast$ to be an independent variable, so its partial derivative with respect to $\partial\phi$ is 0? – Vercassivelaunos Jun 27 '20 at 16:00
  • Further to @Vercassivelaunos's suggestion, read this. – J.G. Jun 27 '20 at 16:00
  • @AlexandreH.Tremblay I've corrected two typos in my comment; re-read it. – J.G. Jun 27 '20 at 16:02
  • @J.G. I re-read your correct comment. The reference does contain the dot above the $\phi$ consistent with the notation for the derivative previously used in the paper. – Anon21 Jun 27 '20 at 16:06
  • The paper defines $\mathcal{L}$ to be the integrand in its equation (1), which is not $\partial \phi^* \partial \phi^*$. And, although I think this has been addressed by you above in comments, nothing has changed in the Question: you seem to have discarded all the time derivatives in the paper's equation (3). As mentioned in comments, the paper's equation (3) differs from your version -- your version is two copies of the same equation. – Eric Towers Jun 27 '20 at 16:09
  • OK, let me rephrase my point: your asterisks are sometimes in the wrong place, leading you to state two identical-looking equations. (Edit: @EricTowers has also already explained my point.) I know about $\dot{\phi}$ etc. I also repeat my suggestion that another question I linked to makes sense of this issue. – J.G. Jun 27 '20 at 16:10
  • @J.G. Oh okay, I just understood what you mean. I have edited my post. – Anon21 Jun 27 '20 at 16:12

1 Answers1

1

I've already recommended a discussion here, but it may be worth rewriting to tweak its emphasis. I'll compare three contexts in which one would want to define "derivatives"; in each case, I'll consider a function $f(x)$, rather than switching from the label $x$ to $z$ for the complex case.

  • A function $f:\,\Bbb R\mapsto\Bbb R$ differentiable at $x$ satisfies $f(x+h)\in f(x)+[(Df)(x)]h+o(h)$, with $Df(x)\in\Bbb R$ denoting the derivative of $f$ at $x$.
  • For an integer $n\ge2$, A function $f:\,\Bbb R^n\mapsto\Bbb R^n$ differentiable at $x$ satisfies what looks like the same condition, only now $Df(x)\in\Bbb R^{n\times n}$.
  • But if $S$ is a number system $n$-dimensional over $\Bbb R$, a function $f:\,S\mapsto S$ might satisfy a stronger condition, that we can take $Df(x)\in S$ where, as in the original example, the $O(h)$ term multiplies a number, in this case one in $S$, by another such number, $h$. The Cauchy-Riemann equations characterize which functions $f:\,\Bbb C\mapsto\Bbb C$ succumb to this treatment.

The trick here is respecting the multiplication defined on $\Bbb C$. If we denote the real and imaginary parts of a compex number $w$ as $w_0$ and $w_1$, we cannot in general change$$f_j(x+h)\in f_j(x)+\sum_{k=0}^1[(Df)(x)]_{jk}h_k+o(h)$$to $f_j(x+h)\in f_j(x)+(yh)_j+o(h)$ for some $y\in\Bbb C$. But the general case succumbs to another treatment. Since any $w\in\Bbb C$ satisfies $w_0=(w+w^\ast)/2,\,w_1=(w-w^\ast)/(2i)$,$$\begin{align}f_j(x+h)- f_j(x)&\in[(Df)(x)]_{j0}(h+h^\ast)/2+[(Df)(x)]_{j1}(h-h^\ast)/2i+o(h)\\&=\frac12\left\{[(Df)(x)]_{j0}-i[(Df)(x)]_{j1}\right\}h\\&+\frac12\left\{[(Df)(x)]_{j0}+i[(Df)(x)]_{j1}\right\}h^\ast+o(h).\end{align}$$So now, instead of only having a rate of change against small $h$, we also have a separate one against $h^\ast$. And although $h,\,h^\ast$ are "dependent" in the sense either value determines the other, these two rates of change are uniquely defined. The abbreviation $x:=\partial\phi$ lets us work with the given example from physics:$$\mathcal{L}(x)=x^\ast x\implies\mathcal{L}(x+h)-\mathcal{L}(x)=(x^\ast+h^\ast)(x+h)-x^\ast x\in x^\ast h+xh^\ast+o(h).$$

J.G.
  • 115,835