Moving from nonlinear gaussian ssm, we relax the assumption that the noise is Gaussian. The dynamics may remain Gaussian, but the observation model has a non-Gaussian likelihood:

where can be any distribution, e.g.:

  • Poisson: for count data
  • Bernoulli: for classification
  • Student-t: for heavy-tailed noise

This is used in online learning using ssm for neural network classification.

Inference

Exact Kalman filtering is not possible — the non-Gaussian likelihood means the posterior is no longer Gaussian after the update step. See inference methods.