The GP kernel defines a continuous-time linear SDE (a special case of the continuous-discrete state space model). The SDE is then exactly discretised via matrix exponentials to obtain the discrete-time transition matrices and noise covariance . See also this paper on discretisation of continuous-time linear systems.
Each Mátern kernel () corresponds to a linear SDE (a continuous-discrete state space model) with state dimension . We need to train lengthscale (), variance (), and likelihood noise ().
Mátern-5/2 (, )
A typical choice for spatial and temporal problems. The continuous-time SDE is:
The Brownian motion is scalar (), entering only the third state dimension. The observation model picks out the first component of the state:
For Mátern-1/2 this simplifies to and . For higher-order Mátern, and are dense matrices computed numerically. Inference then uses kalman filtering and smoothing.