I have a discrete stochastic function $f(t_k)$ (in my case, it's a time signal related to atmospheric parameters). I know this function to have a certain smoothness. One way I can specify its smoothness is using its auto-correlation function, or equivalently, its power spectral density which can be estimated by $S_f(\omega_k) = \mathcal{F}(f(t_k)) \overline{\mathcal{F}(f(t_k))}$, where here $\mathcal{F}$ is the discrete Fourier transform over some arbitrary window of data and $\bar{}$ means complex conjugate. Let's assume that the SDF is the a-priori information I have available for defining the signal smoothness.
I want to be able to define sequential or batch estimation algorithms to estimate this signal given noisy measurements. For simplicity, let's say:
$$ y(t_k) = f(t_k) + w(t_k) $$
is our measurement, where $E[w(t_k)w(t_l)] = \sigma^2\delta(k-l)$, where $\sigma^2$ is the noise variance and $\delta$ is the Dirac delta function.
If I want to predict/estimate $\hat{f}(t_k)$ given past estimates $\{\hat{f}(t_{k-1}), \hat{f}(t_{k-2}), \dots \}$, one way to do this is using an auto-regressive (AR) model:
$$ \hat{f}(t_{k}) = c + \sum_{i=1}^P a_i \hat{f}(t_{k-i}) $$
where $c$ and $\{a_i\}$ are model coefficients. (There is a direct relationship between an AR model and its corresponding power spectral density, correct?)
Another way is by estimating and using derivatives of $f(t_{k-1})$:
$$ \hat{f}(t_k) = \hat{f}(t_{k-1}) + \Delta t \hat{\dot{f}}(t_{k-1}) + \frac{\Delta t}{2} \hat{\ddot{f}}(t_{k-1}) + \cdots $$
where $\Delta t = t_k - t_{k-1}$, $\dot{f}$ denotes the first time derivative (i.e. rate) of $f$, and $\ddot{f}$ denotes the second time derivative (i.e. acceleration) of $f$, etc. (I don't at all understand how this method of predicting $f$ relates to its power spectral density.)
Intuitively, I feel there must be a relationship between these two ways of predicting/estimating a smooth function, but I can't quite pin it down, nor find any references that do. So...
What is the relationship between using an auto-regressive model and using function derivatives to predict a smooth signal $f(t_k)$, particularly in regard to knowing a-priori its power spectral density $S_f(\omega_k)$.
Thank you for any help/guidance/direction. If this question belongs in another community, please let me know.