next up previous
Next: 1. Independent Variable Group Up: Independent Variable Group Analysis Previous: Independent Variable Group Analysis

   
1. Introduction

The goal of unsupervised learning is to extract an efficient representation of the statistical structure implicit in the observations. A good model is both accurate and simple in terms of model complexity, i.e., it forms a compact representation of the input data. Sparse coding, independent components, etc., can all be justified from the point of view of constructing compact representations. Learning amounts to searching in a model space by optimization of some cost function that measures both the accuracy of representation and--ideally at least--the model complexity.

In problems with a large number of diverse observations there are often groups of variables which have strong mutual dependences within the group but which can be considered practically independent of the variables in other groups. It can be expected that the larger the problem domain, the more independent groups there are[*]. Estimating a model for each group separately produces a more compact representation than applying the model to the whole set of variables. Compact representations are computationally beneficial and, moreover, offer better generalization.



 

Krista Lagus
2001-08-28