SciELO - Scientific Electronic Library Online

 
vol.44 número1On Some Statistical Properties of the Spatio-Temporal Product DensityA Reparameterized Weighted Lindley Distribution: Properties, Estimation and Applications índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

  • Em processo de indexaçãoCitado por Google
  • Não possue artigos similaresSimilares em SciELO
  • Em processo de indexaçãoSimilares em Google

Compartilhar


Revista Colombiana de Estadística

versão impressa ISSN 0120-1751

Resumo

GANAN-CARDENAS, Eduard  e  CORREA-MORALES, Juan Carlos. Comparison of Correction Factors and Sample Size Required to Test the Equality of the Smallest Eigenvalues in Principal Component Analysis. Rev.Colomb.Estad. [online]. 2021, vol.44, n.1, pp.43-64.  Epub 25-Fev-2021. ISSN 0120-1751.  https://doi.org/10.15446/rce.v44n1.83987.

In the inferential process of Principal Component Analysis (PCA), one of the main challenges for researchers is establishing the correct number of components to represent the sample. For that purpose, heuristic and statistical strategies have been proposed. One statistical approach consists in testing the hypothesis of the equality of the smallest eigenvalues in the covariance or correlation matrix using a Likelihood-Ratio Test (LRT) that follows a x2 limit distribution. Different correction factors have been proposed to improve the approximation of the sampling distribution of the statistic. We use simulation to study the significance level and power of the test under the use of these different factors and analyze the sample size required for an adequate approximation. The results indicate that for covariance matrix, the factor proposed by Bartlett offers the best balance between the objectives of low probability of Type I Error and high Power. If the correlation matrix is used, the factors W * B and cχ 2 D are the most recommended. Empirically, we can observe that most factors require sample sizes 10 or 20 times the number of variables if covariance or correlation matrices, respectively, are implemented.

Palavras-chave : Chi-square distribution; Likelihood ratio test; Power comparisons; Principal components analysis; Sphericity test.

        · resumo em Espanhol     · texto em Inglês     · Inglês ( pdf )