For efficiency reasons in Matlab code, the data elements, that are missing, are set to any value (e.g. zero) and the fact that they are missing is otherwise taken account of.
The approximation
is assumed to be a
Gaussian density with a diagonal covariance matrix. This simplifies
the cost function
CKL given by (7) into
expectations of many simple terms. The terms concerning the
reconstruction error of the missing values are not included in the
cost function.
The partial derivatives of
CKL with respect to parameters
are used for adaptation of
.
The effect of
i(t) on
CKL is transferred through
the noise vectors
e(t) which are also the reconstruction
errors as seen in formula (3). When the variance of
noise for the missing elements is set to infinity, the algorithm
effectively ignores the reconstruction errors of the missing values.
The reconstruction of data is obtained by the mapping f from the estimated factors. There is an analogy with the SOM. Factor vector scorresponds to the winning map unit in the SOM and f(s)corresponds to the model vector of the winner.
![]() |