Gibbs' phenomenon is the fact that there is NO number of terms large enough to guarantee that the distance from the partial sum to the square wave is less than $\delta$: for any partial sum, there's always a value around 1.09, I believe (assuming a square wave that goes from $-1$ to $1$). The point is that the location of this "overshoot" moves closer and closer to the discontinuity, so that you get pointwise convergence, but not $L^\infty$ convergence.
Dym and McKean's book on Fourier Series and Integrals explains this in some detail, but leaves many things to the reader -- it's not for the faint of heart.
I seem to recall some story about some famous mathematician or physicist buying a "harmonic analyzer" (a mechanical Fourier-series finder) and complaining that the machine didn't work because it didn't really converge to a square wave, and Gibbs perhaps resolving this by explaining that pointwise and $L^\infty$ convergence were distinct notions.
Post-comment addition:
I suspect that messing with the convolution-multiplication theorem might get you a reasonable estimate of the rate of convergence.
From some rough scratchwork based on that idea, my suspicion is that the $n$th partial sum at any point differs from the true value by some error $E(n)$, and that $|E(n)|$ ends up bounded by a constant times $\frac{1}{n}$, with the constant depending on the point at which you're examining the convergence.