next up previous contents
Next: Nonlinear factor analysis Up: Variational Bayesian methods Previous: Missing values   Contents


Partially observed values

A single value in the data can be somewhere between being observed and missing. So-called coarse data means that we only know that a data point belongs to a certain subset of all possibilities. So-called soft or fuzzy data generalises this further by giving weights to the possibilities. Pearl (1988) handles the issue with so called virtual evidence. The partially observed node itself is set as missing, but a new node is added as a child for it. Observing the new node produces a wanted likelihood term for the partially observed node.

In Publication III, different ways of handling partially observed values are studied in context of variational Bayesian learning. A simple example comparing two different approaches is given in Figure 4.1. The virtual evidence approach is recommended based on both theory and experimentation in which independent factor analysis (see Section 3.1.4) was applied to real image data.

Figure 4.1: Some x-values of the data are observed only partially. They are marked with dotted lines representing their confidence intervals. Top: A simple data set for a factor analysis problem. Bottom left: In a compared approach, where a distribution is fixed over the values, the model (a Gaussian shown as the ellipse) needs to adjust to cover the distributions. Bottom right: In the virtual evidence approach, the partially observed values are reconstructed based on the model.
\begin{figure}\begin{center}
\epsfig{file=bayes_evidencenodes.eps,width=0.95\textwidth}
\end{center}
\end{figure}

Green et al. (2001) and Seltzer et al. (2004) study missing and unreliable values framework to improve speech recognition in noisy environment. The recognition accuracy is greatly improved in noisy environments by first identifying components in the spectrographic representation that are corrupt.


next up previous contents
Next: Nonlinear factor analysis Up: Variational Bayesian methods Previous: Missing values   Contents
Tapani Raiko 2006-11-21