Nonlinear dynamical factor analysis (NDFA) [1] is
a variational Bayesian method for learning nonlinear state-space models.
The mappings
and
in
Eqs. (1) and (2) are modelled with
multilayer perceptron (MLP) networks whose parameters can be learned from
the data.
The parameter vector
include network weigths, noise levels, and
hierarchical priors for them.
The posterior distribution over the sources
and the parameters
is approximated by a Gaussian distribution
with some
further independency assumptions.
Both learning and inference are based on minimising a
cost function
The variational Bayesian inference algorithm in [1] uses the gradient of the cost function w.r.t. state in a heuristic manner. We propose an algorithm that differs from it in three ways. Firstly, the heuristic updates are replaced by a standard conjugate gradient algorithm [11]. Secondly, the linearisation method from [7] is applied. Thirdly, the gradient is replaced by a vector of approximated total derivatives, as described in the following section.