Finally, the priors of the quantities and have flat
Gaussian distributions
(the constants depending
on the scale of the data). When going up in the hierarchy, we use
the same distribution for each column of a matrix and for each
component of a vector. On the top, the number of required constant
priors is small. Thus very little information is provided and needed a
priori. This kind of hierarchical priors are used in the experiments
later on this paper.