Abstract
Statistical shape modelling(SSM) is a popular technique in computer vision applications, where the variation of shape of a given structure is modelled by principal component analysis (PCA) on a set of training samples. The issue of sample size sufficiency is not generally considered. In this paper, we propose a framework to investigate the sources of SSM inaccuracy. Based on this framework, we propose a procedure to determine sample size sufficiency by testing whether the training data stabilises the SSM. Also, the number of principal modes to retain (PCA dimension) is usually chosen using rules that aim to cover a percentage of the total variance or to limit the residual to a threshold. However, an ideal rule should retain modes that correspond to real structural variation and discard those that are dominated by noise. We show that these commonly used rules are not reliable, and we propose a new rule that uses bootstrap stability analysis on mode directions to determine the PCA dimension.
For validation we use synthetic 3D face datasets generated using a known number of structural modes with added noise. A 4-way ANOVA is applied for the model reconstruction accuracy on sample size, shape vector dimension, PCA dimension, and the noise level. It shows that there is no universal sample size guideline for SSM, nor is there a simple relationship to the shape vector dimension (with p-Value=0.2932). Validation of our rule for retaining structural modes showed it detected the correct number of modes to retain where the conventional methods failed. The methods were also tested on real 2D (22 points) and 3D (500 points) face data, retaining 24 and 70 modes with sample sufficiency being reached at approximately 50 and 150 samples respectively. We provide a foundation for appropriate selection of PCA dimension and determination of sample size sufficiency in statistical shape modelling.
Chapter PDF
Similar content being viewed by others
Keywords
- Principal Component Analysis
- Synthetic Dataset
- Principal Component Analysis Model
- Principal Mode
- Active Appearance Model
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Cootes, F., Hill, A., Taylor, C., Haslam, J.: The use of active shape models for locating structures in medical images. In: Proc. IPMI, pp. 33–47 (1993)
Cootes, T., Taylor, C., Cooper, D., Graham, J.: Active shape models and their training and application. Comput. Vis. Image Underst. 61(1), 38–59 (1995)
Sukno, F.M., Ordas, S., Butakoff, C., Cruz, S.: Active shape models with invariant optimal features: Application to facial analysis. IEEE Trans. Pattern Anal. Mach. Intell. 29(7), 1105–1117 (2007) (Senior Member-Alejandro F. Frangi)
Cootes, T., Edwards, G., Taylor, C.: Active appearance models. IEEE Transactions on Pattern Analysis and Machine Intelligence 23(6), 681–685 (2001)
Blanz, V., Vetter, T.: Face recognition based on fitting a 3D morphable model. IEEE Transactions On Pattern Analysis And Machine Intelligence 25, 1063–1074 (2003)
Osborne, J., Costello, A.: Sample size and subject to item ratio in principal components analysis. Practical Assessment, Research and Evaluation 9(11) (2004)
Guadagnoli, E., Velicer, W.: Relation of sample size to the stability of component patterns. Psychological Bulletin 103, 265–275 (1988)
Barrett, P., Kline, P.: The observation to variable ratio in factor analysis. Personality Study and Group Behavior 1, 23–33 (1981)
Arrindell, W., van der Ende, J.: An empirical test of the utility of the observations-to-variables ratio in factor and components analysis. Applied Psychological Measurement 9(2), 165–178 (1985)
Velicer, W., Peacock, A., Jackson, D.: A comparison of component and factor patterns: A monte carlo approach. Multivariate Behavioral Research 17(3), 371–388 (1982)
Aleamoni, L.: Effects of size of sample on eigenvalues, observed communalities, and factor loadings. Journal of Applied Psychology 58(2), 266–269 (1973)
Comfrey, A., Lee, H.: A First Course in Factor Analysis. Lawrence Erlbaum, Hillsdale (1992)
MacCallum, R., Widaman, K., Zhang, S., Hong, S.: Sample size in factor analysis. Psychological Methods 4, 84–99 (1999)
MacCallum, R., Widaman, K., Hong, K.P.S.: Sample size in factor analysis: The role of model error. Multivariate Behavioral Research 36, 611–637 (2001)
Jackson, D.: Stopping rules in principal components analysis: a comparison of heuristical and statistical approaches. Ecology 74, 2204–2214 (1993)
Jolliffe, I.: Principal Component Analysis, 2nd edn. Springer, Heidelberg (2002)
Sinha, A., Buchanan, B.: Assessing the stability of principal components using regression. Psychometrika 60(3), 355–369 (2006)
Daudin, J., Duby, C., Trecourt, P.: Stability of principal component analysis studied by the bootstrap method. Statistics 19, 341–358 (1988)
Besse, P.: PCA stability and choice of dimensionality. Statistics& Probability 13, 405–410 (1992)
Babalola, K., Cootes, T., Patenaude, B., Rao, A., Jenkinson, M.: Comparing the similarity of statistical shape models using the bhattacharya metric. In: Larsen, R., Nielsen, M., Sporring, J. (eds.) MICCAI 2006. LNCS, vol. 4190, pp. 142–150. Springer, Heidelberg (2006)
Besse, P., de Falguerolles, A.: Application of resampling methods to the choice of dimension in PCA. In: Hardle, W., Simar, L. (eds.) Computer Intensive Methods in Statistics, pp. 167–176. Physica-Verlag, Heidelberg (1993)
University of Notre Dame Computer Vision Research Laboratory: Biometrics database distribution (2007), http://www.nd.edu/~cvrl/UNDBiometricsDatabase.html
Papatheodorou, T.: 3D Face Recognition Using Rigid and Non-Rigid Surface Registration. PhD thesis, VIP Group, Department of Computing, Imperial College, London University (2006)
Cootes, T.: The AR face database 22 point markup (N/A), http://www.isbe.man.ac.uk/~bim/data/tarfd_markup/tarfd_markup.html
Martinez, A., Benavente, R.: The AR face database (2007), http://cobweb.ecn.purdue.edu/~aleix/aleix_face_DB.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mei, L., Figl, M., Darzi, A., Rueckert, D., Edwards, P. (2008). Sample Sufficiency and PCA Dimension for Statistical Shape Models. In: Forsyth, D., Torr, P., Zisserman, A. (eds) Computer Vision – ECCV 2008. ECCV 2008. Lecture Notes in Computer Science, vol 5305. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88693-8_36
Download citation
DOI: https://doi.org/10.1007/978-3-540-88693-8_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88692-1
Online ISBN: 978-3-540-88693-8
eBook Packages: Computer ScienceComputer Science (R0)