Loading...
Detecting outlying variables in multigroup data: A comparison of different loading similarity coefficients
Gvaladze,Sopiko ; De Roover,Kim ; Tuerlinckx,Francis ; Ceulemans,Eva
Gvaladze,Sopiko
De Roover,Kim
Tuerlinckx,Francis
Ceulemans,Eva
Abstract
Multivariate multigroup data are collected in many fields of science, where the so-called groups pertain to, for instance, experimental groups or countries the participants are nested in. To summarize the main information in such data, principal component analysis (PCA) is highly popular. PCA reduces the variables to a few components that are linear combinations of the original variables. Researchers usually assume those components to be the same across the groups and aim to apply a simultaneous component analysis. To investigate whether this assumption is reasonable, one often analyzes the groups separately and computes a similarity index between the group-specific component loadings of the variables. In many cases, however, most variables have highly similar loadings across the groups, but a few variables, which we will call "outlying variables," behave differently, indicating that a simultaneous analysis is not warranted. In such cases, the outlying variables should be removed before proceeding with the simultaneous analysis. To do so, the variables are ranked according to their relative outlyingness. Although some procedures have been proposed that yield such an outlyingness ranking, they might not be optimal, because they all rely on the same choice of similarity coefficient without evaluating other alternatives. In this paper, we give an overview of other options and report extensive simulations in which we investigate how this choice affects the correctness of the outlyingness ranking. We also illustrate the added value of the outlying variable approach by means of sensometric data on different bread samples.
Description
Funding Information: The research leading to the results reported in this paper was supported by the Research Fund of KU Leuven (C14/19/054). For the simulations, we used the infrastructure of the VSC?Flemish Supercomputer Center, funded by the Hercules Foundation and the Flemish Government?department EWI.
Date
2021
Journal Title
Journal ISSN
Volume Title
Publisher
Research Projects
Organizational Units
Journal Issue
Keywords
component similarity, multivariate multigroup data, PCA, SCA, Tucker's congruence, SIMULTANEOUS COMPONENT ANALYSIS, CONGRUENCE COEFFICIENTS, ROTATION, MULTIBLOCK, MODELS, NUMBER, COMMON
Citation
Gvaladze, S, De Roover, K, Tuerlinckx, F & Ceulemans, E 2021, 'Detecting outlying variables in multigroup data : A comparison of different loading similarity coefficients', Journal of Chemometrics, vol. 35, no. 2, e3233. https://doi.org/10.1002/cem.3233
