The posterior distributions of the sources are effectively approximated by mixtures of Gaussians which means that the above update rules are not directly applicable for them. The new values for discrete posterior probabilities of the source indices and the posterior means and variances of the Gaussians corresponding to different source are most easily solved by making a quadratic approximation for Cf based on the derivatives and , where Cf denotes the sum of terms of the form (27).
The usefulness of this approximation is based on the fact that the cost function tries to minimise the misfit between the approximation and the actual posterior. If we can solve the actual posterior and show that it can be described by our parametric approximation, we know that the cost function will be minimised by setting the parameters of the approximation to values corresponding to the actual posterior.
In this case, making a second order approximation for Cf is equivalent to approximating p(X | si(t)) by an unnormalised Gaussian distribution. Since the prior p(si(t)) is a mixture of Gaussians and the posterior will be given by p(si(t) | X) = p(X | si(t)) p(si(t)) / p(X), we notice that the posterior is also a mixture of Gaussians since a Gaussian multiplied with an unnormalised Gaussian will produce another unnormalised Gaussian and the normalising factor p(X) will then make sure that the posterior is a normalised mixture of Gaussians. From the resulting mixture of Gaussians one can then determine the values for , and . In [1], a similar method without approximations was used for a linear model. Due to the linearity of the mapping, is quadratic and no approximations are needed.