Next: Regularization Up: Principal Component Analysis for Previous: Adapting the Subspace Learning

Overfitting

A trained PCA model can be used for reconstructing missing values:

$\displaystyle \hat{x}_{ij} = \sum_{k=1}^c a_{ik} s_{kj} \, , \quad (i,j) \notin O \, .$

Although PCA performs a linear transformation of data, overfitting is a serious problem for large-scale problems with lots of missing values. This happens when the value of the cost function

in Eq. (10) is small for training data, but the quality of prediction (16) is poor for new data. For further details, see [12].

Subsections

- Regularization
- Variational Bayesian Learning

Tapani Raiko 2007-09-11