 
 
 
 
 
 
 
  
A Gaussian variable s has two inputs m and v and prior
probability 
 .
The variance is
parameterised this way because then the mean and expected exponential
of v suffice for computing the cost function.  In Appendix A, it is
shown that when s, m and v are mutually independent, i.e. 
q(s,
m, v) = q(s)q(m)q(v),
.
The variance is
parameterised this way because then the mean and expected exponential
of v suffice for computing the cost function.  In Appendix A, it is
shown that when s, m and v are mutually independent, i.e. 
q(s,
m, v) = q(s)q(m)q(v), 
 yields
yields
 .
The posterior approximation q(s) is defined to
be Gaussian with mean
.
The posterior approximation q(s) is defined to
be Gaussian with mean 
 and variance
and variance 
 :
:
 .
This yields
.
This yields
 .
The parameters
.
The parameters 
 and
and 
 are to be optimised
during learning.
are to be optimised
during learning.
The output of a latent Gaussian node trivially provides expectation
and variance: 
 and
and 
 .
The
expected exponential is
.
The
expected exponential is
|  | = |  | (4.3) | 
| = | ![$\displaystyle \int (2\pi \widetilde{s})^{-1/2}\exp\left[\frac{-(s-\overline{s})^2}{2\widetilde{s}}+s\right]ds$](img83.gif) | (4.4) | |
| = | ![$\displaystyle \int (2\pi \widetilde{s})^{-1/2}\exp\left[\frac{-(s-\overline{s}-\widetilde{s})^2}{2\widetilde{s}}+\overline{s}+\frac{\widetilde{s}}{2}\right]ds$](img84.gif) | (4.5) | |
| = |  | (4.6) | 
 ,
,
 and
and
 .
.
 
 
 
 
 
 
