arXiv Analytics

Sign in

arXiv:1708.04981 [stat.ME]AbstractReferencesReviewsResources

On the number of principal components in high dimensions

Sungkyu Jung, Myung Hee Lee, Jeongyoun Ahn

Published 2017-08-16Version 1

We consider the problem of how many components to retain in the application of principal component analysis when the dimension is much higher than the number of observations. To estimate the number of components, we propose to sequentially test skewness of the squared lengths of residual scores that are obtained by removing leading principal components. The residual lengths are asymptotically left-skewed if all principal components with diverging variances are removed, and right-skewed if not. The proposed estimator is shown to be consistent, performs well in high-dimensional simulation studies, and provides reasonable estimates in a number of real data examples.

Related articles: Most relevant | Search more
arXiv:2104.03087 [stat.ME] (Published 2021-04-07)
Dynamic Principal Subspaces with Sparsity in High Dimensions
arXiv:1212.6006 [stat.ME] (Published 2012-12-25)
Principal Component Analysis for Experiments
arXiv:1510.03671 [stat.ME] (Published 2015-10-13)
Model distances for vine copulas in high dimensions with application to testing the simplifying assumption