Robust Dimensionality Reduction: A Bootstrap-Based Evaluation of PCA with Applications in Nutritional and Environmental Sciences

Authors

  • Zakiah I. Kalantan Department of Statistics, Faculty of Sciences, King Abdulaziz University, Jeddah, 21589, Saudi Arabia https://orcid.org/0000-0002-7040-5623
  • Lujain S. Alharbi Department of Statistics, Faculty of Sciences, King Abdulaziz University, Jeddah, 21589, Saudi Arabia
  • Maryam H. Al-Zahrani Department of Biochemistry, Faculty of Sciences, King Abdulaziz University, Jeddah, 21589, Saudi Arabia
  • Sulafah M. Saleh Binhimd Department of Statistics, Faculty of Sciences, King Abdulaziz University, Jeddah, 21589, Saudi Arabia

DOI:

https://doi.org/10.37256/cm.6120256016

Keywords:

big data, data visualization, principal component analysis, bootstrap, stability

Abstract

The complex structures of vast amounts of data provide a considerable challenge to researchers. Dimensional reduction methods are reliable and transform high-dimensional data into lower-dimensional representations while preserving most original information. Principal Component Analysis (PCA) is a commonly used approach for dimensionality reduction that transforms data into a lower-dimensional space while preserving important information. The variability within the sample may affect the stability and reliability of PCA results. The constraint of existing approaches compromises the accuracy of PCA stability assessments in practical data scenarios. These methodologies frequently depend on linear assumptions and encounter difficulties when addressing high-dimensional data. This study used the bootstrap method to assess the stability of PCA by assessing the variability of eigenvalues and principal components over several bootstrap iterations. We evaluate how stability metrics, particularly confidence intervals for eigenvalues and the proportion of variance clarified, can assist in determining the optimal number of principal components. The results indicate that the bootstrap provides a helpful framework for evaluating the robustness of PCA and guiding informed decisions on dimensionality reduction in many applications, including data compression, visualization, and classification. Moreover, the results illustrate the efficacy of this method in enhancing the reliability and interpretability of PCA findings among distinct data-driven research endeavors. This study enhances understanding of how principal component analysis (PCA) tackles data unpredictability while delivering valuable insights for professionals in several disciplines.

Downloads

Published

2025-01-25

How to Cite

1.
Kalantan ZI, Alharbi LS, Al-Zahrani MH, Binhimd SMS. Robust Dimensionality Reduction: A Bootstrap-Based Evaluation of PCA with Applications in Nutritional and Environmental Sciences. Contemp. Math. [Internet]. 2025 Jan. 25 [cited 2025 Feb. 23];6(1):923-42. Available from: https://ojs.wiserpub.com/index.php/CM/article/view/6016