There are some nice answers already, but I would like to highlight some other aspect. You say that continuity can be intuitively described by
I have a pointed pencil and any curve I can draw w/o lifting the pencil from the paper is continuous.
Your intuition is not wrong, but it's not a good enough (in fact you are saying that if the graph of a function is path-connected, then that function is continuous, see this question). It misses the main point of continuity (assuming $\mathbb{R}^n$ space), which is
$$\text{if the input changes a little, the output cannot change too much.} \tag{$\spadesuit$}$$
There is a big problem with the above statement, i.e. what do "little" and "too much" mean? If we would fix some relation between these changes at one point $x_0$, what about other places? The different concepts of continuity reflect different takes on the matter, and the standard $\varepsilon$-$\delta$ definition is just one of them.
Another way of saying $(\spadesuit)$ would be big change in output implies big change in input, or to make it more precise a change bigger than $\varepsilon$ in output implies a change in input bigger than $\delta$. Transposing the implication, a change bounded by $\delta$ in input implies change bounded by $\epsilon$ in output. Looking familiar?
By relating $\varepsilon$ and $\delta$ in different ways we can get different versions of continuity. If $f$ is continuous on $\mathbb{R}$, it means that, at each point there is some relation between input change and output change (input change bounds and output change bounds) and the most basic case is that (for each point) for each $\varepsilon$ there is some $\delta$.
To give you more examples, if that $\varepsilon$-$\delta$ relation is the same at all the points, we get uniform continuity. If we strengthen it further, e.g. if that $\varepsilon$-$\delta$ relation is linear, we get Lipschitz continuity (or α-Hölder continuity if the relation is like function $x \mapsto x^\alpha$). On the other hand, continuously differentiable functions are these, which change in a way so that we are able to tell, in a consistent manner, what that change is (note that $x\mapsto x^2$ on $\mathbb{R}$ continuously differentiable, but not uniformly continuous or Lipschitz).
Finally, if you get out of $\mathbb{R}^n$ into more complex spaces, there are yet different notions and definitions of continuity (the one I like the most is via nets, you can find some more info about it here), but that's not the subject of this post.
I hope that helps $\ddot\smile$